Container Unloading via Reinforcement Learning: Picking Order, Deadlock Avoidance, and Proof-of-Concept Simulation

2605.27143

Total: 1

#1 Container Unloading via Reinforcement Learning: Picking Order, Deadlock Avoidance, and Proof-of-Concept Simulation [PDF] [Copy] [Kimi] [REL]

Authors: Jan Rüdiger, Max Schenke, Daniel Weber

Unloading containers in the courier, express and parcel industry is a physically demanding and labor-intensive work. Automatizing this process is an important step towards increasing the efficiency of parcel-handling systems. This work investigates the potential of reinforcement learning to learn a policy for item selection in container unloading scenarios. For that, a simulation environment is created and a masked deep Q-learning with a specially designed neural network architecture is implemented. The results indicate that the agent can learn to select items with an average success rate of 60 %, which is significantly better than a random policy at a random chance of 20 %. The findings suggest that RL could be a promising approach for automatizing item unloading tasks in the future.

Subject: Systems and Control

Publish: 2026-05-26 15:08:22 UTC