default search action
18th ECCV 2024: Milan, Italy - Part XXXVIII
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XXXVIII. Lecture Notes in Computer Science 15096, Springer 2025, ISBN 978-3-031-72919-5 - Luchuan Song, Pinxin Liu, Lele Chen, Guojun Yin, Chenliang Xu:
Tri2-plane: Thinking Head Avatar via Feature Pyramid. 1-20 - Yuzhong Zhao, Yue Liu, Zonghao Guo, Weijia Wu, Chen Gong, Qixiang Ye, Fang Wan:
ControlCap: Controllable Region-Level Captioning. 21-38 - Jilong Wang, Saihui Hou, Yan Huang, Chunshui Cao, Xu Liu, Yongzhen Huang, Tianzhu Zhang, Liang Wang:
Free Lunch for Gait Recognition: A Novel Relation Descriptor. 39-56 - Weitai Kang, Gaowen Liu, Mubarak Shah, Yan Yan:
SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding. 57-75 - Xiaoran Zhang, John C. Stendahl, Lawrence H. Staib, Albert J. Sinusas, Alex Wong, James S. Duncan:
Adaptive Correspondence Scoring for Unsupervised Medical Image Registration. 76-92 - Nithin Gopalakrishnan Nair, Jeya Maria Jose Valanarasu, Vishal M. Patel:
MaxFusion: Plug&Play Multi-modal Generation in Text-to-Image Diffusion Models. 93-110 - Ashkan Mirzaei, Tristan Aumentado-Armstrong, Marcus A. Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G. Derpanis, Igor Gilitschenski:
Watch Your Steps: Local Image and Scene Editing by Text Instructions. 111-129 - Hritam Basak, Zhaozheng Yin:
Forget More to Learn More: Domain-Specific Feature Unlearning for Semi-supervised and Unsupervised Domain Adaptation. 130-148 - Anh Thai, Weiyao Wang, Hao Tang, Stefan Stojanov, James M. Rehg, Matt Feiszli:
3˟ 2: 3D Object Part Segmentation by 2D Semantic Correspondences. 149-166 - Zhengyuan Yang, Jianfeng Wang, Linjie Li, Kevin Lin, Chung-Ching Lin, Zicheng Liu, Lijuan Wang:
Idea2Img: Iterative Self-refinement with GPT-4V for Automatic Image Design and Generation. 167-184 - Gustavo Pérez, Daniel Sheldon, Grant Van Horn, Subhransu Maji:
Human-in-the-Loop Visual Re-ID for Population Size Estimation. 185-202 - Lingchen Meng, Shiyi Lan, Hengduo Li, José M. Álvarez, Zuxuan Wu, Yu-Gang Jiang:
SegIC: Unleashing the Emergent Correspondence for In-Context Segmentation. 203-220 - Weiwei Sun, Eduard Trulls, Yang-Che Tseng, Sneha Sambandam, Gopal Sharma, Andrea Tagliasacchi, Kwang Moo Yi:
PointNeRF++: A Multi-scale, Point-Based Neural Radiance Field. 221-238 - Junfei Xiao, Ziqi Zhou, Wenxuan Li, Shiyi Lan, Jieru Mei, Zhiding Yu, Bingchen Zhao, Alan L. Yuille, Yuyin Zhou, Cihang Xie:
A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties. 239-258 - Bowen Shi, Peisen Zhao, Zichen Wang, Yuhang Zhang, Yaoming Wang, Jin Li, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian, Xiaopeng Zhang:
UMG-CLIP: A Unified Multi-granularity Vision Generalist for Open-World Understanding. 259-277 - Yao-Chih Lee, Zhoutong Zhang, Kevin Blackburn-Matzen, Simon Niklaus, Jianming Zhang, Jia-Bin Huang, Feng Liu:
Fast View Synthesis of Casual Videos with Soup-of-Planes. 278-296 - Neerja Thakkar, Karttikeya Mangalam, Andrea Bajcsy, Jitendra Malik:
Adaptive Human Trajectory Prediction via Latent Corridors. 297-314 - Rohan Choudhury, Koichiro Niinuma, Kris M. Kitani, László A. Jeni:
Video Question Answering with Procedural Programs. 315-332 - Wenhui Zhu, Xiwen Chen, Peijie Qiu, Aristeidis Sotiras, Abolfazl Razi, Yalin Wang:
DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification. 333-351 - Dong Huo, Zixin Guo, Xinxin Zuo, Zhihao Shi, Juwei Lu, Peng Dai, Songcen Xu, Li Cheng, Yee-Hong Yang:
TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling. 352-368 - Rongchang Li, Zhenhua Feng, Tianyang Xu, Linze Li, Xiaojun Wu, Muhammad Awais, Sara Atito Ali Ahmed, Josef Kittler:
C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition. 369-388 - Bin Xia, Shiyin Wang, Yingfan Tao, Yitong Wang, Jiaya Jia:
LLMGA: Multimodal Large Language Model Based Generation Assistant. 389-406 - Mi Luo, Zihui Xue, Alex Dimakis, Kristen Grauman:
Put Myself in Your Shoes: Lifting the Egocentric Perspective from Exocentric Videos. 407-425 - Sriram Narayanan, Mani Ramanagopal, Mark Sheinin, Aswin C. Sankaranarayanan, Srinivasa G. Narasimhan:
Shape from Heat Conduction. 426-444 - Moritz Heep, Eduard Zell:
An Adaptive Screen-Space Meshing Approach for Normal Integration. 445-461 - Seung Hyun Lee, Yinxiao Li, Junjie Ke, Innfarn Yoo, Han Zhang, Jiahui Yu, Qifei Wang, Fei Deng, Glenn Entis, Junfeng He, Gang Li, Sangpil Kim, Irfan Essa, Feng Yang:
Parrot: Pareto-Optimal Multi-reward Reinforcement Learning Framework for Text-to-Image Generation. 462-478 - Eugene Valassakis, Guillermo Garcia-Hernando:
HandDGP: Camera-Space Hand Mesh Prediction with Differentiable Global Positioning. 479-496
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.