TY - GEN
T1 - Safety-Driven Deep Reinforcement Learning Framework for Cobots
T2 - 10th International Conference on Control, Decision and Information Technologies, CoDIT 2024
AU - Abbas, Ammar N.
AU - Mehak, Shakra
AU - Chasparis, Georgios C.
AU - Kelleher, John D.
AU - Guilfoyle, Michael
AU - Leva, Maria Chiara
AU - Ramasubramanian, Aswin K.
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - This study presents a novel methodology incorporating safety constraints into a robotic simulation during the training of deep reinforcement learning (DRL). The framework integrates specific parts of the safety requirements, such as velocity constraints, as specified by ISO 10218, directly within the DRL model that becomes a part of the robot's learning algorithm. The study then evaluated the efficiency of these safety constraints by subjecting the DRL model to various scenarios, including grasping tasks with and without obstacle avoidance. The validation process involved comprehensive simulation-based testing of the DRL model's responses to potential hazards and its compliance. Also, the performance of the system is carried out by the functional safety standards IEC 61508 to determine the safety integrity level. The study indicated a significant improvement in the safety performance of the robotic system. The proposed DRL model anticipates and mitigates hazards while maintaining operational efficiency. This study was validated in a testbed with a collaborative robotic arm with safety sensors and assessed with metrics such as the average number of safety violations, obstacle avoidance, and the number of successful grasps. The proposed approach outperforms the conventional method by a 16.5% average success rate on the tested scenarios in the simulations and 2.5% in the testbed without safety violations.
AB - This study presents a novel methodology incorporating safety constraints into a robotic simulation during the training of deep reinforcement learning (DRL). The framework integrates specific parts of the safety requirements, such as velocity constraints, as specified by ISO 10218, directly within the DRL model that becomes a part of the robot's learning algorithm. The study then evaluated the efficiency of these safety constraints by subjecting the DRL model to various scenarios, including grasping tasks with and without obstacle avoidance. The validation process involved comprehensive simulation-based testing of the DRL model's responses to potential hazards and its compliance. Also, the performance of the system is carried out by the functional safety standards IEC 61508 to determine the safety integrity level. The study indicated a significant improvement in the safety performance of the robotic system. The proposed DRL model anticipates and mitigates hazards while maintaining operational efficiency. This study was validated in a testbed with a collaborative robotic arm with safety sensors and assessed with metrics such as the average number of safety violations, obstacle avoidance, and the number of successful grasps. The proposed approach outperforms the conventional method by a 16.5% average success rate on the tested scenarios in the simulations and 2.5% in the testbed without safety violations.
KW - Collaborative Robots
KW - Functional Safety
KW - ISO standards
KW - Safe Deep Reinforcement Learning
UR - http://www.scopus.com/inward/record.url?scp=85208225124&partnerID=8YFLogxK
U2 - 10.1109/CoDIT62066.2024.10708345
DO - 10.1109/CoDIT62066.2024.10708345
M3 - Conference contribution
AN - SCOPUS:85208225124
T3 - 10th 2024 International Conference on Control, Decision and Information Technologies, CoDIT 2024
SP - 2917
EP - 2923
BT - 10th 2024 International Conference on Control, Decision and Information Technologies, CoDIT 2024
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 1 July 2024 through 4 July 2024
ER -