Qian, T., Chen, J., Zhuo, L., Jiao, Y., & Jiang, Y.-G. (2024). NuScenes-QA: A Multi-Modal Visual Question Answering Benchmark for Autonomous Driving Scenario. Proceedings of the AAAI Conference on Artificial Intelligence, 38(5), 4542-4550. https://doi.org/10.1609/aaai.v38i5.28253