Distributed Non-Communicating Multi-Robot Collision Avoidance via Map-Based Deep Reinforcement Learning

Guangda Chen;Shunyi Yao;Jun Ma;Lifan Pan;Yu’an Chen;Pei Xu;Jianmin Ji;Xiaoping Chen;Chen; Guangda;Yao; Shunyi;Ma; Jun;Pan; Lifan;Chen; Yu’an;Xu; Pei;Ji; Jianmin;Chen; Xiaoping;

doi:10.3390/s20174836

Distributed Non-Communicating Multi-Robot Collision Avoidance via Map-Based Deep Reinforcement Learning

Clicks: 223

ID: 110084

2020

Free PDF

Article Quality & Performance Metrics

Overall Quality Improving Quality

0.0 /100

Combines engagement data with AI-assessed academic quality

Reader Engagement Steady Performance

30.0 /100

222 views

32 readers

AI Quality Assessment

Not analyzed

Abstract

EN
- Turkish
- Spanish
- Portuguese
- Arabic
- Chinese
- French
- German
- Indonesian
- Russian
- Thai

It is challenging to avoid obstacles safely and efficiently for multiple robots of different shapes in distributed and communication-free scenarios, where robots do not communicate with each other and only sense other robots’ positions and obstacles around them. Most existing multi-robot collision avoidance systems either require communication between robots or require expensive movement data of other robots, like velocities, accelerations and paths. In this paper, we propose a map-based deep reinforcement learning approach for multi-robot collision avoidance in a distributed and communication-free environment. We use the egocentric local grid map of a robot to represent the environmental information around it including its shape and observable appearances of other robots and obstacles, which can be easily generated by using multiple sensors or sensor fusion. Then we apply the distributed proximal policy optimization (DPPO) algorithm to train a convolutional neural network that directly maps three frames of egocentric local grid maps and the robot’s relative local goal positions into low-level robot control commands. Compared to other methods, the map-based approach is more robust to noisy sensor data, does not require robots’ movement data and considers sizes and shapes of related robots, which make it to be more efficient and easier to be deployed to real robots. We first train the neural network in a specified simulator of multiple mobile robots using DPPO, where a multi-stage curriculum learning strategy for multiple scenarios is used to improve the performance. Then we deploy the trained model to real robots to perform collision avoidance in their navigation without tedious parameter tuning. We evaluate the approach with multiple scenarios both in the simulator and on four differential-drive mobile robots in the real world. Both qualitative and quantitative experiments show that our approach is efficient and outperforms existing DRL-based approaches in many indicators. We also conduct ablation studies showing the positive effects of using egocentric grid maps and multi-stage curriculum learning.

Reference Key	chen2020sensorsdistributed Use this key to autocite in the manuscript while using SciMatic Manuscript Manager or Thesis Manager
Authors	Guangda Chen;Shunyi Yao;Jun Ma;Lifan Pan;Yu’an Chen;Pei Xu;Jianmin Ji;Xiaoping Chen;Chen, Guangda;Yao, Shunyi;Ma, Jun;Pan, Lifan;Chen, Yu’an;Xu, Pei;Ji, Jianmin;Chen, Xiaoping;
Journal	sensors
Year	2020
DOI	10.3390/s20174836 Searching for DOI...
URL	https://www.mdpi.com/1424-8220/20/17/4836## https://doi.org/10.3390/s20174836
Keywords	deep reinforcement learning multi-robot navigation distributed collision avoidance

Citations

No citations found. To add a citation, contact the admin at info@scimatic.org

Comments

Login to comment Register

No comments yet. Be the first to comment on this article.