TY - GEN
T1 - Learning to Control Redundant Musculoskeletal Systems with Neural Networks and SQP
T2 - 2018 IEEE International Conference on Robotics and Automation, ICRA 2018
AU - Driess, Danny
AU - Zimmermann, Heiko
AU - Wolfen, Simon
AU - Suissa, Dan
AU - Haeufle, Daniel
AU - Hennes, Daniel
AU - Toussaint, Marc
AU - Schmitt, Syn
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/9/10
Y1 - 2018/9/10
N2 - Modeling biomechanical musculoskeletal systems reveals that the mapping from muscle stimulations to movement dynamics is highly nonlinear and complex, which makes it difficult to control those systems with classical techniques. In this work, we not only investigate whether machine learning approaches are capable of learning a controller for such systems. We are especially interested in the question if the structure of the musculoskeletal apparatus exhibits properties that are favorable for the learning task. In particular, we consider learning a control policy from target positions to muscle stimulations. To account for the high actuator redundancy of biomechanical systems, our approach uses a learned forward model represented by a neural network and sequential quadratic programming to obtain the control policy, which also enables us to alternate the co-contraction level and hence allows to change the stiffness of the system and to include optimality criteria like small muscle stimulations. Experiments on both a simulated musculoskeletal model of a human arm and a real biomimetic muscle-driven robot show that our approach is able to learn an accurate controller despite high redundancy and nonlinearity, while retaining sample efficiency.
AB - Modeling biomechanical musculoskeletal systems reveals that the mapping from muscle stimulations to movement dynamics is highly nonlinear and complex, which makes it difficult to control those systems with classical techniques. In this work, we not only investigate whether machine learning approaches are capable of learning a controller for such systems. We are especially interested in the question if the structure of the musculoskeletal apparatus exhibits properties that are favorable for the learning task. In particular, we consider learning a control policy from target positions to muscle stimulations. To account for the high actuator redundancy of biomechanical systems, our approach uses a learned forward model represented by a neural network and sequential quadratic programming to obtain the control policy, which also enables us to alternate the co-contraction level and hence allows to change the stiffness of the system and to include optimality criteria like small muscle stimulations. Experiments on both a simulated musculoskeletal model of a human arm and a real biomimetic muscle-driven robot show that our approach is able to learn an accurate controller despite high redundancy and nonlinearity, while retaining sample efficiency.
UR - http://www.scopus.com/inward/record.url?scp=85063138292&partnerID=8YFLogxK
U2 - 10.1109/ICRA.2018.8463160
DO - 10.1109/ICRA.2018.8463160
M3 - Conference contribution
AN - SCOPUS:85063138292
T3 - Proceedings - IEEE International Conference on Robotics and Automation
SP - 6461
EP - 6468
BT - 2018 IEEE International Conference on Robotics and Automation, ICRA 2018
PB - Institute of Electrical and Electronics Engineers
Y2 - 21 May 2018 through 25 May 2018
ER -