default search action
18th ECCV 2024: Milan, Italy - Part V
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part V. Lecture Notes in Computer Science 15063, Springer 2025, ISBN 978-3-031-72651-4 - Zhengdi Yu, Shaoli Huang, Yongkang Cheng, Tolga Birdal:
SignAvatars: A Large-Scale 3D Sign Language Holistic Motion Dataset and Benchmark. 1-19 - Lujun Li, Zimian Wei, Peijie Dong, Wenhan Luo, Wei Xue, Qifeng Liu, Yike Guo:
AttnZero: Efficient Attention Discovery for Vision Transformers. 20-37 - Lujun Li, Haosen Sun, Shiwen Li, Peijie Dong, Wenhan Luo, Wei Xue, Qifeng Liu, Yike Guo:
Auto-GAS: Automated Proxy Discovery for Training-Free Generative Architecture Search. 38-55 - Haosen Sun, Lujun Li, Peijie Dong, Zimian Wei, Shitong Shao:
Auto-DAS: Automated Proxy Discovery for Training-Free Distillation-Aware Architecture Search. 56-73 - ZeXiang Liu, Yangguang Li, Youtian Lin, Xin Yu, Sida Peng, Yan-Pei Cao, Xiaojuan Qi, Xiaoshui Huang, Ding Liang, Wanli Ouyang:
UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation. 74-91 - Huabin Liu, Xiao Ma, Cheng Zhong, Yang Zhang, Weiyao Lin:
TimeCraft: Navigate Weakly-Supervised Temporal Grounded Video Question Answering via Bi-directional Reasoning. 92-107 - Haejoon Lee, Aswin C. Sankaranarayanan:
Spectral Subsurface Scattering for Material Classification. 108-124 - Benjin Zhu, Zhe Wang, Hongsheng Li:
nuCraft: Crafting High Resolution 3D Semantic Occupancy for Unified 3D Scene Understanding. 125-141 - Xianrui Luo, Huiqiang Sun, Juewen Peng, Zhiguo Cao:
Dynamic Neural Radiance Field from Defocused Monocular Video. 142-159 - Yang Liu, Pengxiang Ding, Siteng Huang, Min Zhang, Han Zhao, Donglin Wang:
PiTe: Pixel-Temporal Alignment for Large Video-Language Model. 160-176 - Shadi Hamdan, Fatma Güney:
CarFormer: Self-driving with Learned Object-Centric Representations. 177-193 - Wei Wu, Qingnan Fan, Shuai Qin, Hong Gu, Ruoyu Zhao, Antoni B. Chan:
FreeDiff: Progressive Frequency Truncation for Image Editing with Diffusion Models. 194-209 - Cheng Shi, Yuchen Zhu, Sibei Yang:
Plain-Det: A Plain Multi-dataset Object Detector. 210-226 - Zhen Zhao, Zicheng Wang, Longyue Wang, Dian Yu, Yixuan Yuan, Luping Zhou:
Alternate Diverse Teaching for Semi-supervised Medical Image Segmentation. 227-243 - Wei Cong, Yang Cong, Yuyang Liu, Gan Sun:
Cs2K: Class-Specific and Class-Shared Knowledge Guidance for Incremental Semantic Segmentation. 244-261 - Dongliang Cao, Zorah Lähner, Florian Bernard:
Synchronous Diffusion for Unsupervised Smooth Non-rigid 3D Shape Matching. 262-281 - David Fan, Jue Wang, Shuai Liao, Zhikang Zhang, Vimal Bhat, Xinyu Li:
Text-Guided Video Masked Autoencoder. 282-298 - Laurynas Karazija, Iro Laina, Andrea Vedaldi, Christian Rupprecht:
Diffusion Models for Open-Vocabulary Segmentation. 299-317 - Peixi Xiong, Michael Kozuch, Nilesh Jain:
Textual-Visual Logic Challenge: Understanding and Reasoning in Text-to-Image Generation. 318-334 - Pengyu Zhang, Hao Yin, Zeren Wang, Wenyue Chen, Shengming Li, Dong Wang, Huchuan Lu, Xu Jia:
EvSign: Sign Language Recognition and Translation with Streaming Events. 335-351 - Pengxiang Ding, Han Zhao, Wenjie Zhang, Wenxuan Song, Min Zhang, Siteng Huang, Ningxi Yang, Donglin Wang:
QUAR-VLA: Vision-Language-Action Model for Quadruped Robots. 352-367 - Huilin Zhu, Jingling Yuan, Zhengwei Yang, Yu Guo, Zheng Wang, Xian Zhong, Shengfeng He:
Zero-Shot Object Counting with Good Exemplars. 368-385 - Jingye Chen, Yupan Huang, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei:
TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering. 386-402 - Yanbo Wang, Wentao Zhao, Chuan Cao, Tianchen Deng, Jingchuan Wang, Weidong Chen:
SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds. 403-421 - Hyunjin Kim, Minhyuk Sung:
PartSTAD: 2D-to-3D Part Segmentation Task Adaptation. 422-439 - Rajeev Yasarla, Manish Kumar Singh, Hong Cai, Yunxiao Shi, Jisoo Jeong, Yinhao Zhu, Shizhong Han, Risheek Garrepalli, Fatih Porikli:
FutureDepth: Learning to Predict the Future Improves Video Depth Estimation. 440-458 - Yanyuan Qiao, Qianyi Liu, Jiajun Liu, Jing Liu, Qi Wu:
LLM as Copilot for Coarse-Grained Vision-and-Language Navigation. 459-476
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.