default search action
18th ECCV 2024: Milan, Italy - Part XVIII
- Ales Leonardis
, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XVIII. Lecture Notes in Computer Science 15076, Springer 2025, ISBN 978-3-031-72648-4 - Cheng Shi, Yulin Zhang, Bin Yang, Jiajin Tang, Yuexin Ma, Sibei Yang:
Part2Object: Hierarchical Unsupervised 3D Instance Segmentation. 1-18 - Risa Shinoda
, Kaede Shiohara
:
PetFace: A Large-Scale Dataset and Benchmark for Animal Identification. 19-36 - Tianqi Liu
, Guangcong Wang
, Shoukang Hu
, Liao Shen
, Xinyi Ye
, Yuhang Zang
, Zhiguo Cao
, Wei Li
, Ziwei Liu
:
MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo. 37-53 - Davide Cozzolino
, Giovanni Poggi
, Matthias Nießner
, Luisa Verdoliva
:
Zero-Shot Detection of AI-Generated Images. 54-72 - Kecheng Zheng, Yifei Zhang, Wei Wu, Fan Lu, Shuailei Ma, Xin Jin, Wei Chen, Yujun Shen:
DreamLIP: Language-Image Pre-training with Long Captions. 73-90 - Ruijie Yao
, Sheng Jin
, Lumin Xu
, Wang Zeng
, Wentao Liu
, Chen Qian
, Ping Luo
, Ji Wu
:
GKGNet: Group K-Nearest Neighbor Based Graph Convolutional Network for Multi-label Image Recognition. 91-107 - Xinyu Xu
, Shengcheng Luo
, Yanchao Yang
, Yong-Lu Li
, Cewu Lu
:
DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-Level Control. 108-125 - Sheng Jin
, Shuhuai Li
, Tong Li
, Wentao Liu
, Chen Qian
, Ping Luo
:
You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-person Multi-task Human-Centric Perception. 126-146 - Jiaqi Xu, Mengyang Wu, Xiaohu You, Chi-Wing Fu, Qi Dou, Pheng-Ann Heng:
Towards Real-World Adverse Weather Image Restoration: Enhancing Clearness and Semantics with Vision-Language Models. 147-164 - Yifan Li, Anh Dao, Wentao Bao, Zhen Tan, Tianlong Chen, Huan Liu, Yu Kong:
Facial Affective Behavior Analysis with Instruction Tuning. 165-186 - Xiaoyi Bao, Siyang Sun, Shuailei Ma, Kecheng Zheng, Yuxin Guo, Guosheng Zhao, Yun Zheng, Xingang Wang:
CoReS: Orchestrating the Dance of Reasoning and Segmentation. 187-204 - Haoyu Zhao, Tianyi Lu, Jiaxi Gu, Xing Zhang, Qingping Zheng, Zuxuan Wu, Hang Xu, Yu-Gang Jiang:
MagDiff: Multi-alignment Diffusion for High-Fidelity Video Generation and Editing. 205-221 - Hang Guo, Jinmin Li, Tao Dai, Zhihao Ouyang, Xudong Ren, Shu-Tao Xia:
MambaIR: A Simple Baseline for Image Restoration with State-Space Model. 222-241 - Ishan Khatri, Kyle Vedder, Neehar Peri, Deva Ramanan, James Hays:
I Can't Believe It's Not Scene Flow! 242-257 - Zhonghang Liu
, Panzhong Lu
, Guoyang Xie
, Zhichao Lu
, Wen-Yan Lin
:
Rethinking Unsupervised Outlier Detection via Multiple Thresholding. 258-275 - Bowen Zhang
, Tianyu Yang
, Yu Li
, Lei Zhang
, Xi Zhao
:
Compress3D: A Compressed Latent Space for 3D Generation from a Single Image. 276-292 - Nhat Le
, Khoa Do
, Xuan Bui
, Tuong Do
, Erman Tjiputra
, Quang D. Tran
, Anh Nguyen
:
Scalable Group Choreography via Variational Phase Manifold Learning. 293-311 - Mingfang Zhang
, Yifei Huang
, Ruicong Liu
, Yoichi Sato
:
Masked Video and Body-Worn IMU Autoencoder for Egocentric Action Recognition. 312-330 - Jian Ma, Wenguan Wang, Yi Yang, Feng Zheng:
Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-Driven Diffusion. 331-349 - Huankang Guan
, Rynson W. H. Lau
:
PoseSOR: Human Pose Can Guide Our Attention. 350-366 - Bu Jin, Yupeng Zheng, Pengfei Li, Weize Li, Yuhang Zheng, Sujie Hu, Xinyu Liu, Jinwei Zhu, Zhijie Yan, Haiyang Sun, Kun Zhan, Peng Jia, Xiaoxiao Long, Yilun Chen, Hao Zhao:
TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes. 367-384 - Minjung Kim
, Hyung Suk Lim
, Soonyoung Lee, Bumsoo Kim, Gunhee Kim:
Bi-directional Contextual Attention for 3D Dense Captioning. 385-401 - Peng Xiao
, Yi Xie
, Xuemiao Xu
, Weihong Chen
, Huaidong Zhang
:
Multi-person Pose Forecasting with Individual Interaction Perceptron and Prior Learning. 402-419 - Fangcen Liu, Chenqiang Gao, Yaming Zhang, Junjie Guo, Jinghao Wang, Deyu Meng:
InfMAE: A Foundation Model in the Infrared Modality. 420-437 - Bin-Shih Wu
, Hong-En Chen
, Sheng-Yu Huang
, Yu-Chiang Frank Wang
:
TPA3D: Triplane Attention for Fast Text-to-3D Generation. 438-455 - Jiangming Shi
, Xiangbo Yin
, Yeyun Chen
, Yachao Zhang
, Zhizhong Zhang
, Yuan Xie
, Yanyun Qu
:
Multi-memory Matching for Unsupervised Visible-Infrared Person Re-identification. 456-474 - Xi Chen, Zhiheng Liu, Mengting Chen, Yutong Feng, Yu Liu, Yujun Shen, Hengshuang Zhao:
LivePhoto: Real Image Animation with Text-Guided Motion Control. 475-491
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.