Automated Deep Reinforcement Learning Environment for Hardware of a Modular Legged Robot

We present an automated learning environment for developing control policies directly on the hardware of a modular legged robot.

June 27, 2018
International Conference on Ubiquitous Robots 2018

Authors

Sehoon Ha (Disney Research)

Joohyung Kim (Disney Research)

Katsu Yamane (Disney Research)

Automated Deep Reinforcement Learning Environment for Hardware of a Modular Legged Robot

Download Publication PDF

Abstract

In this paper, we present an automated learning environment for developing control policies directly on the hardware of a modular legged robot. This environment facilitates the reinforcement learning process by computing the rewards using a vision-based tracking system and relocating the robot to the initial position using a resetting mechanism. We employ two state-of-the-art deep reinforcement learning (DRL) algorithms, Trust Region Policy Optimization (TRPO) and Deep Deterministic Policy Gradient (DDPG), to train neural network policies for simple rowing and crawling motions. Using the developed environment, we demonstrate both learning algorithms can effectively learn policies for simple locomotion skills on highly stochastic hardware and environments. We further expedite learning by transferring policies learned on a single legged configuration to multi-legged ones.

Copyright Notice

The documents contained in these directories are included by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a non-commercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author’s copyright. These works may not be reposted without the explicit permission of the copyright holder.

Automated Deep Reinforcement Learning Environment for Hardware of a Modular Legged Robot

We present an automated learning environment for developing control policies directly on the hardware of a modular legged robot.

Authors

Automated Deep Reinforcement Learning Environment for Hardware of a Modular Legged Robot

Abstract

Copyright Notice

Research at Disney

Legal

MORE