Bipedal wheeled robots, with their excellent mobility and flexibility, demonstrate broad application prospects. However, current research on bipedal wheeled robots mainly focuses on small mobile platforms, whose endurance and dynamic performance still cannot meet the requirements of practical applications. For high-inertia bipedal wheeled robots that meet the requirements and are equipped with robotic arms, the autonomous grasping stability of the robotic arm also faces significant challenges. To address these issues, we propose a local space-oriented decoupled high-inertia bipedal wheeled robot training method - DeLM ( De coupled L oco - M anipulation Methods), aiming to provide strong support for the body stability of this type of robot during grasping tasks. In this paper, we first introduce a randomization curriculum for the robotic arm to describe the impact of the real-world robotic arm on lower limb balance in different poses, which does not require additional consideration of robotic arm collision constraints; in addition, we only use a one-stage training paradigm to achieve a transparent learning process of the lower limbs for the upper limbs, enabling the lower limbs to actively infer the pose of the robotic arm and thus autonomously adjust the body pose to achieve stable balance; finally, we perform parametric modeling and randomization of the real robot motor in simulation, successfully achieving smooth grasping tasks on a 65kg bipedal wheeled robot, including picking up objects, grasping objects at high places, etc. The results show that the method we proposed in our bipedal wheeled robot grasping task success rate exceeds 95%, with the average shaking distance not exceeding 10cm, meeting the requirements of most applications. For specific details of the relevant demonstration, please refer to our project website:https://decoupled-loco-manipulation.github.io/DeLM.github.io/.
Fig. 1. Overview of DLM. In Arm Curriculumn in Circle 1 (blue area) we formulated a randomization course for robotic arms. This part of the robotic arms has no collision volume in the simulation. The grasping postures of different robotic arms are generated through various randomization methods. The difficulty gradually transitions from the simplest fixed starting posture to the most difficult random linear interpolation posture; in Arm Estimator in Circle 2 (green area) Arm Estimator, the above Arm Curriculumn is used to achieve different actions, and the two indicators of the end effector position of the robotic arm and the center of mass position of the robotic arm are used to better assist the lower limb balance; Circle 3 (grey area) is the Sim Train process of the overall solution. Historical observation information outputs the estimated value through historical MLP, and the robotic arm information in the privileged observation information (including the center of mass of the robotic arm and ee. Pos) and ontology information (base height and linear speed) are supervised and learned, and the motor parameters are restricted through the NP3O network, and the robot's balance is finally controlled; Circle 4 (white area) is the deployment process, and the upper limbs control the movement of the robotic arm through Diffusion Policy or remote operation, while the lower limbs use the above-mentioned training network for smooth control. During the deployment process, detailed calibrations such as motor properties, sensor delay and motor torque compensation are also required to reduce sim2real's gap.
BibTex Code Here