default search action
18th ECCV 2024: Milan, Italy - Part LXII
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LXII. Lecture Notes in Computer Science 15120, Springer 2025, ISBN 978-3-031-73032-0 - Aayam Shrestha, Pan Liu, Germán Ros, Kai Yuan, Alan Fern:
Generating Physically Realistic and Directable Human Motions from Multi-modal Inputs. 1-17 - Nikita Karaev, Ignacio Rocco, Benjamin Graham, Natalia Neverova, Andrea Vedaldi, Christian Rupprecht:
CoTracker: It Is Better to Track Together. 18-35 - Ziyi Lin, Dongyang Liu, Renrui Zhang, Peng Gao, Longtian Qiu, Han Xiao, Han Qiu, Wenqi Shao, Keqin Chen, Jiaming Han, Siyuan Huang, Yichi Zhang, Xuming He, Yu Qiao, Hongsheng Li:
SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models. 36-55 - Yuxuan Sun, Hao Wu, Chenglu Zhu, Sunyi Zheng, Qizi Chen, Kai Zhang, Yunlong Zhang, Dan Wan, Xiaoxiao Lan, Mengyue Zheng, Jingxiong Li, Xinheng Lyu, Tao Lin, Lin Yang:
PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology. 56-73 - Avery Ma, Amir-massoud Farahmand, Yangchen Pan, Philip Torr, Jindong Gu:
Improving Adversarial Transferability via Model Alignment. 74-92 - Wenhao Ding, Yulong Cao, Ding Zhao, Chaowei Xiao, Marco Pavone:
RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios. 93-110 - Hao Tang, Weiyao Wang, Pierre Gleize, Matt Feiszli:
ADen: Adaptive Density Representations for Sparse-View Camera Pose Estimation. 111-128 - Yunsong Zhou, Linyan Huang, Qingwen Bu, Jia Zeng, Tianyu Li, Hang Qiu, Hongzi Zhu, Minyi Guo, Yu Qiao, Hongyang Li:
Embodied Understanding of Driving Scenarios. 129-148 - Chris Zhang, Sourav Biswas, Kelvin Wong, Kion Fallah, Lunjun Zhang, Dian Chen, Sergio Casas, Raquel Urtasun:
Learning to Drive via Asymmetric Self-Play. 149-168 - Zhening Huang, Xiaoyang Wu, Xi Chen, Hengshuang Zhao, Lei Zhu, Joan Lasenby:
OpenIns3D: Snap and Lookup for 3D Open-Vocabulary Instance Segmentation. 169-185 - Xijun Wang, Junbang Liang, Chun-Kai Wang, Kenan Deng, Yu Lou, Ming C. Lin, Shan Yang:
ViLA: Efficient Video-Language Alignment for Video Question Answering. 186-204 - Rohit Girdhar, Mannat Singh, Andrew Brown, Quentin Duval, Samaneh Azadi, Sai Saketh Rambhatla, Akbar Shah, Xi Yin, Devi Parikh, Ishan Misra:
Factorizing Text-to-Video Generation by Explicit Image Conditioning. 205-224 - Yang Zhao, Yanwu Xu, Zhisheng Xiao, Haolin Jia, Tingbo Hou:
MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices. 225-242 - Yiyang Su, Minchul Kim, Feng Liu, Anil K. Jain, Xiaoming Liu:
Open-Set Biometrics: Beyond Good Closed-Set Models. 243-261 - Siyuan Cheng, Guangyu Shen, Kaiyuan Zhang, Guanhong Tao, Shengwei An, Hanxi Guo, Shiqing Ma, Xiangyu Zhang:
UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening. 262-281 - Fengyuan Liu, Haochen Luo, Yiming Li, Philip Torr, Jindong Gu:
Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution. 282-301 - Opher Bar Nathan, Deborah Levy, Tali Treibitz, Dan Rosenbaum:
Osmosis: RGBD Diffusion Prior for Underwater Image Restoration. 302-319 - Feixiang Zhou, Bryan M. Williams, Hossein Rahmani:
Towards Adaptive Pseudo-Label Learning for Semi-Supervised Temporal Action Localization. 320-338 - Anders Holst, Niels Chr. Overgaard:
Computing the Lipschitz Constant Needed for Fast Scene Recovery from CASSI Measurements. 339-353 - Yu Chi, Fangneng Zhan, Sibo Wu, Christian Theobalt, Adam Kortylewski:
DatasetNeRF: Efficient 3D-Aware Data Factory with Generative Radiance Fields. 354-372 - Mikhail Okunev, Marc Mapeke, Benjamin Attal, Christian Richardt, Matthew O'Toole, James Tompkin:
Flowed Time of Flight Radiance Fields. 373-389 - Haoran Li, Long Ma, Haolin Shi, Yanbin Hao, Yong Liao, Lechao Cheng, Peng Yuan Zhou:
3D-GOI: 3D GAN Omni-Inversion for Multifaceted and Multi-object Editing. 390-406 - Chaitanya Patel, Shaojie Bai, Te-Li Wang, Jason M. Saragih, Shih-En Wei:
Fast Registration of Photorealistic Avatars for VR Facial Animation. 407-423 - Cristina Mata, Kanchana Ranasinghe, Michael S. Ryoo:
CoPT: Unsupervised Domain Adaptive Segmentation Using Domain-Agnostic Text Embeddings. 424-440 - Ziwei Yao, Ruiping Wang, Xilin Chen:
HiFi-Score: Fine-Grained Image Description Evaluation with Hierarchical Parsing Graphs. 441-458 - Anas Mahmoud, Ali Harakeh, Steven L. Waslander:
Image-to-Lidar Relational Distillation for Autonomous Driving Data. 459-475 - Gemma Canet Tarres, Zhe Lin, Zhifei Zhang, Jianming Zhang, Yizhi Song, Dan Ruta, Andrew Gilbert, John P. Collomosse, Soo Ye Kim:
Thinking Outside the BBox: Unconstrained Generative Object Compositing. 476-495
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.