Reinforcement Learning for Robotic Assembly Using Non-Diagonal Stiffness Matrix
Autor: | Sho Sakaino, Toshiaki Tsuji, Masahide Oikawa, Tsukasa Kusakabe, Kyo Kutsuzawa |
---|---|
Rok vydání: | 2021 |
Předmět: |
0209 industrial biotechnology
Control and Optimization Admittance Computer science Mechanical Engineering Diagonal Biomedical Engineering Time constant 02 engineering and technology 021001 nanoscience & nanotechnology Computer Science Applications Human-Computer Interaction Task (computing) 020901 industrial engineering & automation Sampling (signal processing) Artificial Intelligence Control and Systems Engineering Control theory Trajectory Reinforcement learning Computer Vision and Pattern Recognition 0210 nano-technology Stiffness matrix |
Zdroj: | IEEE Robotics and Automation Letters. 6:2737-2744 |
ISSN: | 2377-3774 |
Popis: | Contact-rich tasks, wherein multiple contact transitions occur in a series of operations, have been extensively studied for task automation. Precision assembly, a typical example of contact-rich tasks, requires high time constants to cope with the change in contact state. Therefore, this letter proposes a local trajectory planning method for precision assembly with high time constants. Because the non-diagonal component of a stiffness matrix can induce motion at high sampling frequencies, we use this concept to design a stiffness matrix to guide the motion of an object and propose a method to control it. We introduce reinforcement learning (RL) for the selection of the stiffness matrix because the relationship between the desired direction and the sensor response is difficult to model. An architecture with various sampling rates for RL and admittance control has the advantage of rapid response owing to a high time constant of the local trajectory modification. The effectiveness of the method is verified experimentally on two contact-rich tasks: inserting a peg into a hole and inserting a gear. Using the proposed method, the average total time needed to insert the peg in the hole is 1.64 s, which is less than half the time reported by the best of the existing state of the art studies. |
Databáze: | OpenAIRE |
Externí odkaz: |