TY - GEN
T1 - Extending Policy from One-Shot Learning through Coaching
AU - Balakuntala, Mythra V.
AU - Venkatesh, Vishnunandan L.N.
AU - Bindu, Jyothsna Padmakumar
AU - Voyles, Richard M.
AU - Wachs, Juan
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/10/1
Y1 - 2019/10/1
N2 - Humans generally teach their fellow collaborators to perform tasks through a small number of demonstrations, often followed by episodes of coaching that tune and refine the execution during practice. Adopting a similar framework for teaching robots through demonstrations makes teaching tasks highly intuitive and imitating the refinement of complex tasks through coaching improves the efficacy. Unlike traditional Learning from Demonstration (LfD) approaches which rely on multiple demonstrations to train a task, we present a novel one-shot learning from demonstration approach, augmented by coaching, to transfer the task from task expert to robot. The demonstration is automatically segmented into a sequence of a priori skills (the task policy) parametrized to match task goals. During practice, the robotic skills self-evaluate their performances and refine the task policy to locally optimize cumulative performance. Then, human coaching further refines the task policy to explore and globally optimize the net performance. Both the self-evaluation and coaching are implemented using reinforcement learning (RL) methods. The proposed approach is evaluated using the task of scooping and unscooping granular media. The self-evaluator of the scooping skill uses the realtime force signature and resistive force theory to minimize scooping resistance similar to how humans scoop. Coaching feedback focuses modifications to sub-domains of the action space, using RL to converge to desired performance. Thus, the proposed method provides a framework for learning tasks from one demonstration and generalizing it using human feedback through coaching achieving a success rate of ≈90%.
AB - Humans generally teach their fellow collaborators to perform tasks through a small number of demonstrations, often followed by episodes of coaching that tune and refine the execution during practice. Adopting a similar framework for teaching robots through demonstrations makes teaching tasks highly intuitive and imitating the refinement of complex tasks through coaching improves the efficacy. Unlike traditional Learning from Demonstration (LfD) approaches which rely on multiple demonstrations to train a task, we present a novel one-shot learning from demonstration approach, augmented by coaching, to transfer the task from task expert to robot. The demonstration is automatically segmented into a sequence of a priori skills (the task policy) parametrized to match task goals. During practice, the robotic skills self-evaluate their performances and refine the task policy to locally optimize cumulative performance. Then, human coaching further refines the task policy to explore and globally optimize the net performance. Both the self-evaluation and coaching are implemented using reinforcement learning (RL) methods. The proposed approach is evaluated using the task of scooping and unscooping granular media. The self-evaluator of the scooping skill uses the realtime force signature and resistive force theory to minimize scooping resistance similar to how humans scoop. Coaching feedback focuses modifications to sub-domains of the action space, using RL to converge to desired performance. Thus, the proposed method provides a framework for learning tasks from one demonstration and generalizing it using human feedback through coaching achieving a success rate of ≈90%.
UR - http://www.scopus.com/inward/record.url?scp=85078858892&partnerID=8YFLogxK
U2 - 10.1109/RO-MAN46459.2019.8956364
DO - 10.1109/RO-MAN46459.2019.8956364
M3 - Conference contribution
AN - SCOPUS:85078858892
T3 - 2019 28th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2019
BT - 2019 28th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2019
PB - Institute of Electrical and Electronics Engineers
T2 - 28th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN 2019
Y2 - 14 October 2019 through 18 October 2019
ER -