Deep Reinforcement Learning in Mining Fleet Optimization
Reinforcement Learning (RL) can drastically improve fleet assignment decisions in mining. In my project Vivania, a DRL agent learned to reduce queue times and exploit system dynamics to increase throughput with only CPU training.
Key takeaways
- Reward shaping should balance travel time, queue avoidance, and throughput.
- Sim fidelity matters: include realistic constraints to avoid degenerate tricks.
- Evaluate with domain KPIs: tons/hour, queue length, idle time, fuel burn.
Explore the repo for a starting point to experiment with DRL in mining operations.