Google Scholar

Reinforcement Learning for Solving Stochastic Vehicle Routing Problem with Time Windows

Z Iklassov, I Sobirov, R Solozabal, M Takac - arXiv preprint arXiv …, 2024 - arxiv.org

Z Iklassov, I Sobirov, R Solozabal, M Takac

arXiv preprint arXiv:2402.09765, 2024•arxiv.org

This paper introduces a reinforcement learning approach to optimize the Stochastic Vehicle Routing Problem with Time Windows (SVRP), focusing on reducing travel costs in goods delivery. We develop a novel SVRP formulation that accounts for uncertain travel costs and demands, alongside specific customer time windows. An attention-based neural network trained through reinforcement learning is employed to minimize routing costs. Our approach addresses a gap in SVRP research, which traditionally relies on heuristic methods, by leveraging machine learning. The model outperforms the Ant-Colony Optimization algorithm, achieving a 1.73% reduction in travel costs. It uniquely integrates external information, demonstrating robustness in diverse environments, making it a valuable benchmark for future SVRP studies and industry application.

arxiv.org

Show moreShow less

Cite

Advanced search

Saved to My library

Reinforcement Learning for Solving Stochastic Vehicle Routing Problem with Time Windows