Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

FrugalLight : Symmetry-Aware Cyclic Heterogeneous Intersection Control using Deep Reinforcement Learning with Model Compression, Distillation and Domain Knowledge

Published: 13 May 2024 Publication History

Abstract

Developing countries need to better manage fast increasing traffic flows, owing to rapid urbanization. Else, increasing traffic congestion would increase fatalities due to reckless driving, as well as keep vehicular emissions and air pollution critically high in cities like New Delhi. State-of-the-art traffic signal control methods in developed countries, however, use expensive sensing, computation, and communication resources. How far can control algorithms go, under resource constraints, is explored through the design and evaluation of FrugalLight (FL) in this article. We also captured and processed a real traffic dataset at a busy intersection in New Delhi, India, using efficient techniques on low cost embedded devices. This dataset (https://delhi-trafficdensity-dataset.github.io) contains traffic density information at fine time granularity of one measurement every second, from all approaches of the intersection for 40 days. FrugalLight (https://github.com/sachin-iitd/FrugalLight) is evaluated on the collected traffic dataset from New Delhi and another open source traffic dataset from New York. FrugalLight matches the performance of state-of-the-art Convolutional Neural Network (CNN) based sensing and Deep Reinforcement Learning (DRL) based control algorithms, while utilizing resources less by an order of magnitude. We further explore improvements using a careful combination of knowledge distillation and domain knowledge based DRL model compression, with employing Model-Agnostic Meta-Learning to quickly adapt to traffic at new intersections. The collected real dataset and FrugalLight therefore opens up opportunities for resource efficient RL based intersection control design for the ML research community, where the controller should have limited carbon footprint. Such intelligent, green, intersection controllers can help reduce traffic congestion and associated vehicular emissions, even if compute and communication infrastructure is limited in low resource regions. This is a critical step toward achieving two of the United Nations Sustainable Development Goals (SDG), namely sustainable cities and communities and climate action.

References

[1]
2021. Meta learning (computer science). (2021). Retrieved from https://en.wikipedia.org/wiki/Meta_learning_(computer_science)
[2]
Mohammad Aslani, Mohammad Saadi Mesgari, and Marco Wiering. 2017. Adaptive traffic signal control with actor-critic methods in a real-world traffic network with different traffic disruption events. Transportation Research Part C: Emerging Technologies 85 (2017), 732–752. DOI:
[3]
Mohammad Aslani, Stefan Seipel, Mohammad Saadi Mesgari, and Marco Wiering. 2018. Traffic signal optimization through discrete and continuous reinforcement learning with robustness analysis in downtown tehran. Adv. Eng. Inform. 38, C (2018), 639–655. DOI:
[4]
U. K. Bewiser. 2016. Traffic lights cause traffic jams, new research suggests. (2016). Retrieved from https://www.bewiser.co.uk/news/car-insurance/traffic-lights-cause-traffic-jams-new-research-suggests
[5]
Ankit Bhardwaj, Shiva R. Iyer, Sriram Ramesh, Jerome White, and Lakshminarayanan Subramanian. 2023. Understanding sudden traffic jams: From emergence to impact. Development Engineering 8 (2023), 100105.
[6]
Thierry Bouwmans. 2014. Background modeling and Foreground Detection for video surveillance: Traditional and Recent Approaches, Benchmarking and Evaluation. (2014). Retrieved from http://www.crcpress.com/product/isbn/9781482205374.
[7]
CC-by4. 2013. Attribution 4.0 International (CC BY 4.0). (2013). Retrieved June 7, 2023 from https://creativecommons.org/licenses/by/4.0
[8]
Mayank Singh Chauhan, Arshdeep Singh, Mansi Khemka, Arneish Prateek, and Rijurekha Sen. 2019. Embedded CNN based vehicle classification and counting in non-laned road traffic. In Proceedings of the 10th International Conference on Information and Communication Technologies and Development.
[9]
Sachin Chauhan, Kashish Bansal, and Rijurekha Sen. 2020. EcoLight: Intersection control in developing regions under extreme budget and network constraints. In Proceedings of the 34th Conference on Neural Information Processing Systems. Retrieved from https://proceedings.neurips.cc/paper/2020/file/97e49161287e7a4f9b745366e4f9431b-Paper.pdf
[10]
Sachin Chauhan, Sayan Ranu, Rijurekha Sen, Zeel B. Patel, and Nipun Batra. 2023. AirDelhi: Fine-grained spatio-temporal particulate matter dataset from delhi for ML based modeling. In Proceedings of the 37th Conference on Neural Information Processing Systems Track on Datasets and Benchmarks. Retrieved from https://openreview.net/pdf?id=n2wW7goGky
[11]
Sachin Chauhan and Rijurekha Sen. 2023. RealLight: DRL based intersection control in developing countries without traffic simulators. In Proceedings of the NeurIPS 2023 Computational Sustainability: Promises and Pitfalls from Theory to Deployment. Retrieved from https://openreview.net/pdf?id=dmjT841VuV
[12]
Seung-Bae Cools, Carlos Gershenson, and Bart D’Hooghe. 2006. Self-organizing traffic lights: A realistic simulation. Advances in Applied Self-Organizing Systems (2006). DOI:
[13]
Anthony Davis. 2019. Inductive loops or wireless magnetometers for traffic signal control. (2019). Retrieved from https://highways.today/2019/03/22/inductive-loops-wireless-magnetometers
[14]
Samah El-Tantawy and Baher Abdulhai. 2010. An agent-based learning towards decentralized and coordinated traffic signal control. In Proceedings of the 13th International IEEE Conference on Intelligent Transportation Systems. 665–670. DOI:
[15]
Samah El-Tantawy, Baher Abdulhai, and Hossam Abdelgawad. 2013. Multiagent reinforcement learning for integrated network of adaptive traffic signal controllers (MARLIN-ATSC): Methodology and large-scale application on downtown toronto. IEEE Transactions on Intelligent Transportation Systems 14, 3 (2013), 1140–1150. DOI:
[16]
FHWA. 2006. Traffic Detector Handbook: Chapter 4. In-Roadway Sensor Design. (2006). Retrieved from https://www.fhwa.dot.gov/publications/research/operations/its/06108/04.cfm)
[17]
Chelsea Finn, Pieter Abbeel, and Sergey Levine. 2017. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks. (2017). arXiv:1703.03400. Retrieved from https://arxiv.org/abs/1703.03400
[18]
Qiushan Guo, Xinjiang Wang, Yichao Wu, Zhipeng Yu, Ding Liang, Xiaolin Hu, and Ping Luo. 2020. Online knowledge distillation via collaborative learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.
[19]
Philipp Gysel, Jon Pimentel, M. Motamedi, and Soheil Ghiasi. 2018. Ristretto: A framework for empirical study of resource-efficient inference in convolutional neural networks. IEEE Transactions on Neural Networks and Learning Systems (2018). DOI:
[20]
Jiang Han, John W. Polak, Javier Barria, and Rajesh Krishnan. 2010. On the estimation of space-mean-speed from inductive loop detector data. Transportation Planning and Technology 33, 1 (2010), 91–104. DOI:arXiv:https://doi.org/10.1080/03081060903429421
[21]
Song Han, Jeff Pool, John Tran, and William J. Dally. 2015. Learning both Weights and Connections for Efficient Neural Networks. (2015). arXiv:1506.02626. Retrieved from https://arxiv.org/abs/1506.02626
[22]
Geoffrey Hinton, Nitish Srivastava, and Kevin Swersky. 2012. Neural networks for machine learning lecture 6a overview of mini-batch gradient descent. Cited on 14, 8 (2012), 2.
[23]
Shiva R. Iyer, Ulzee An, and Lakshminarayanan Subramanian. 2020. Forecasting sparse traffic congestion patterns using message-passing rnns. In Proceedings of the ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing.IEEE, 3772–3776.
[24]
Vipin Jain, Ashlesh Sharma, and Lakshminarayanan Subramanian. 2012. Road traffic congestion in the developing world. In Proceedings of the 2nd ACM Symposium on Computing for Development. 1–10.
[25]
Nicole Kobie. 2018. London is hacking its traffic lights to slash waiting times. (2018). Retrieved from https://www.wired.co.uk/article/traffic-lights-uk-london
[26]
Daniel Krajzewicz, Georg Hertkorn, Christian Feld, and Peter Wagner. 2002. SUMO (simulation of urban mobility); An open-source traffic simulation. In Proceedings of the 4th Middle East Symposium on Simulation and Modelling. 183–187.
[27]
Xu lan, Xiatian Zhu, and Shaogang Gong. 2018. Knowledge distillation by on-the-fly native ensemble. In Proceedings of the Advances in Neural Information Processing Systems 31.S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett (Eds.), Curran Associates, Inc., 7517–7527. Retrieved from http://papers.nips.cc/paper/7980-knowledge-distillation-by-on-the-fly-native-ensemble.pdf
[28]
John D. C. Little, M. D. Kelson, and N. H. Gartner. 1981. MAXBAND: A versatile program for setting signals on arteries and triangular networks. (1981).
[29]
P. Lowrie. 1990. SCATS: Sydney co-ordinated adaptive traffic system: A traffic responsive method of controlling urban traffic.
[30]
Huizi Mao, Song Han, Jeff Pool, Wenshuo Li, Xingyu Liu, Yu Wang, and William J. Dally. 2017. Exploring the Regularity of Sparse Structure in Convolutional Neural Networks. (2017). arXiv:1705.08922. Retrieved from https://arxiv.org/abs/1705.08922
[31]
Vaishnavh Nagarajan, Aditya Krishna Menon, Srinadh Bhojanapalli, Hossein Mobahi, and Sanjiv Kumar. 2023. On student-teacher deviations in distillation: Does it pay to disobey. In Proceedings of the 37th Conference on Neural Information Processing Systems.
[32]
Tomoki Nishi, Keisuke Otaki, Keiichiro Hayakawa, and Takayoshi Yoshimura. 2018. Traffic signal control based on reinforcement learning with graph convolutional neural nets. In Proceedings of the 2018 21st International Conference on Intelligent Transportation Systems. 877–883. DOI:
[33]
OpenCV. 2021. Optical Flow. (2021). Retrieved Oct 29, 2021 fromhttps://docs.opencv.org/3.4.15/d4/dee/tutorial_optical_flow.html
[34]
S. Peeta and P. Zhang. 2002. Counting device selection and reliability: Synthesis study. Joint Transportation Research Program (2002).
[35]
L. A. Prashanth and Shalabh Bhatnagar. 2011. Reinforcement learning with average cost for adaptive control of traffic lights at intersections. In Proceedings of the 2011 14th International IEEE Conference on Intelligent Transportation Systems. 1640–1645. DOI:
[36]
Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. 2016. You only look once: Unified, real-time object detection. 779–788. DOI:
[37]
Stefano Giovanni Rizzo, Giovanna Vantini, and Sanjay Chawla. 2019. Time critic policy gradient methods for traffic signal control in complex and congested scenarios. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.Association for Computing Machinery, New York, NY, 1654–1664. DOI:
[38]
Roger P. Roess, Elena S. Prassas, and William R. McShane. 2004. Traffic Engineering. Pearson/Prentice Hall. 04295239Retrieved from https://books.google.co.in/books?id=OYNPAAAAMAAJ
[39]
Andrea Sassella, Francesco Abbr., Simone Formentin, Andrea G. Bianchessi, and Sergio M. Savaresi. 2023. On queue length estimation in urban traffic intersections via inductive loops. In Proceedings of the 2023 American Control Conference. 1135–1140. DOI:
[40]
Omais Shafi, Sachin Chauhan, Gayathri Ananthanarayanan, and Rijurekha Sen. 2022. DynCNN: Application dynamism and ambient temperature aware neural network scheduler in edge devices for traffic control. In Proceedings of the ACM SIGCAS/SIGCHI Conference on Computing and Sustainable Societies.Association for Computing Machinery, New York, NY, 513–528. DOI:
[41]
Elise van der Pol and Frans A. Oliehoek. 2016. Coordinated deep reinforcement learners for traffic light control. Retrieved from https://api.semanticscholar.org/CorpusID:198950131
[42]
Pravin Varaiya. 2013. Max pressure control of a network of signalized intersections. Transportation Research Part C: Emerging Technologies 36 (2013), 177–195. DOI:
[43]
Shiva Verma. 2019. Understanding different Loss Functions for Neural Networks. (2019). Retrieved from https://towardsdatascience.com/loss-functions-neural-networks-dd1ed0274718
[44]
Yanan Wang, Tong Xu, Xin Niu, Chang Tan, Enhong Chen, and Hui Xiong. 2019. STMARL: A spatio-temporal multi-agent reinforcement learning approach for cooperative traffic light control. IEEE Transactions on Mobile Computing 21 (2019), 2228–2242.
[45]
Hua Wei, Chacha Chen, Guanjie Zheng, Kan Wu, Vikash Gayah, Kai Xu, and Zhenhui Li. 2019. PressLight: Learning max pressure control to coordinate traffic signals in arterial network. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.1290–1298.
[46]
Hua Wei, Nan Xu, Huichu Zhang, Guanjie Zheng, Xinshi Zang, Chacha Chen, Weinan Zhang, Yamin Zhu, Kai Xu, and Zhenhui Li. 2019. CoLight: Learning network-level cooperation for traffic signal control. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management.
[47]
Hua Wei, Guanjie Zheng, Vikash Gayah, and Zhenhui Li. 2019. A Survey on Traffic Signal Control Methods. (2019). arXiv:1904.08117. Retrieved from https://arxiv.org/abs/1904.08117
[48]
David Williams. 2012. Too many traffic lights make congestion worse. (2012). Retrieved from https://www.standard.co.uk/hp/front/too-many-traffic-lights-make-congestion-worse-6676646.html)
[49]
Tien-Ju Yang, Yu-Hsin Chen, and Vivienne Sze. 2017. Designing energy-efficient convolutional neural networks using energy-aware pruning. 2017 IEEE Conference on Computer Vision and Pattern Recognition (2017). DOI:
[50]
Jiecao Yu, Andrew Lukefahr, David Palframan, Ganesh Dasika, Reetuparna Das, and Scott Mahlke. 2017. Scalpel: Customizing DNN pruning to the underlying hardware parallelism. SIGARCH Comput. Archit. News 45, 2 (2017), 548–560. DOI:
[51]
Xinshi Zang, Huaxiu Yao, Guanjie Zheng, Nan Xu, Kai Xu, and Zhenhui Li. 2020. MetaLight: Value-based meta-reinforcement learning for traffic signal control. Proceedings of the AAAI Conference on Artificial Intelligence 34, 01 (2020), 1153–1160. DOI:
[52]
Huichu Zhang, Zhenhui Li, Siyuan Feng, Chang Liu, Yaoyao Ding, Yichen Zhu, Zihan Zhou, Weinan Zhang, Yong Yu, and Haiming Jin. 2019. CityFlow: A multi-agent reinforcement learning environment for large scale city traffic scenario. The World Wide Web Conference on (2019). DOI:

Cited By

View all
  • (2024)WebLight: DRL based Intersection Control in Developing Countries without Reliable CamerasProceedings of the 7th ACM SIGCAS/SIGCHI Conference on Computing and Sustainable Societies10.1145/3674829.3675075(201-210)Online publication date: 8-Jul-2024

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Journal on Computing and Sustainable Societies
ACM Journal on Computing and Sustainable Societies  Volume 2, Issue 2
June 2024
421 pages
EISSN:2834-5533
DOI:10.1145/3613748
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 May 2024
Online AM: 19 February 2024
Accepted: 26 January 2024
Revised: 09 January 2024
Received: 09 January 2024
Published in ACMJCSS Volume 2, Issue 2

Check for updates

Author Tags

  1. Traffic signal control
  2. real traffic dataset

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)134
  • Downloads (Last 6 weeks)11
Reflects downloads up to 13 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)WebLight: DRL based Intersection Control in Developing Countries without Reliable CamerasProceedings of the 7th ACM SIGCAS/SIGCHI Conference on Computing and Sustainable Societies10.1145/3674829.3675075(201-210)Online publication date: 8-Jul-2024

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

Full Text

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media