default search action

combined dblp search
author search
venue search
publication search

ask others

Di Hu 0001

> Home > Persons

Person information

affiliation: Baidu Research, Big Data Laboratory, Beijing, China
affiliation: Renmin University of China, Gaoling School of Artificial Intelligence, Beijing, China
affiliation (former): Northwestern Polytechnical University, School of Computer Science and Engineering, OPTIMAL, Xi'an, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ml/LiHLXXD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ml/LiHLXXD24
Xingjian Li, Di Hu, Xuhong Li, Haoyi Xiong, Cheng-Zhong Xu, Dejing Dou:
Towards accurate knowledge transfer via target-awareness representation disentanglement. Mach. Learn. 113(2): 699-723 (2024)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/pr/YangZWWNH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pr/YangZWWNH24
Zequn Yang, Han Zhang, Yake Wei, Zheng Wang, Feiping Nie, Di Hu:
Geometric-inspired graph-based Incomplete Multi-view Clustering. Pattern Recognit. 147: 110082 (2024)
[c34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WangLLD0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WangLLD0L24
Yaoting Wang, Weisong Liu, Guangyao Li, Jian Ding, Di Hu, Xi Li:
Prompting Segmentation with Sound Is Generalizable Audio-Visual Source Localizer. AAAI 2024: 5669-5677
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WeiF0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WeiF0024
Yake Wei, Ruoxuan Feng, Zihe Wang, Di Hu:
Enhancing Multimodal Cooperation via Sample-Level Modality Valuation. CVPR 2024: 27328-27337
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/WeiLFH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/WeiLFH24
Yake Wei, Siwei Li, Ruoxuan Feng, Di Hu:
Diagnosing and Re-learning for Balanced Multimodal Learning. ECCV (64) 2024: 71-86
[c31]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/YangWL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YangWL024
Zequn Yang, Yake Wei, Ce Liang, Di Hu:
Quantifying and Enhancing Multi-modal Robustness with Modality Preference. ICLR 2024
[c30]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/Wei024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Wei024
Yake Wei, Di Hu:
MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance. ICML 2024
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/XiaWPWZHL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/XiaWPWZHL24
Wenke Xia, Dong Wang, Xincheng Pang, Zhigang Wang, Bin Zhao, Di Hu, Xuelong Li:
Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs. ICRA 2024: 2073-2080
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiD024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiD024
Guangyao Li, Henghui Du, Di Hu:
Boosting Audio Visual Question Answering via Key Semantic-Aware Cues. ACM Multimedia 2024: 5997-6005
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/SunZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/SunZ024
Peiwen Sun, Honggang Zhang, Di Hu:
Unveiling and Mitigating Bias in Audio Visual Segmentation. ACM Multimedia 2024: 7259-7268
[i48]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-06244
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-06244
Zequn Yang, Yake Wei, Ce Liang, Di Hu:
Quantifying and Enhancing Multi-modal Robustness with Modality Preference. CoRR abs/2402.06244 (2024)
[i47]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-18947
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-18947
Qingyang Zhang, Yake Wei, Zongbo Han, Huazhu Fu, Xi Peng, Cheng Deng, Qinghua Hu, Cai Xu, Jie Wen, Di Hu, Changqing Zhang:
Multimodal Fusion on Low-quality Data: A Comprehensive Survey. CoRR abs/2404.18947 (2024)
[i46]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17730
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17730
Yake Wei, Di Hu:
MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance. CoRR abs/2405.17730 (2024)
[i45]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-00439
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-00439
Jia Zeng, Qingwen Bu, Bangjun Wang, Wenke Xia, Li Chen, Hao Dong, Haoming Song, Dong Wang, Di Hu, Ping Luo, Heming Cui, Bin Zhao, Xuelong Li, Yu Qiao, Hongyang Li:
Learning Manipulation by Predicting Interaction. CoRR abs/2406.00439 (2024)
[i44]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-19853
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-19853
Yutao Zhu, Kun Zhou, Kelong Mao, Wentong Chen, Yiding Sun, Zhipeng Chen, Qian Cao, Yihan Wu, Yushuo Chen, Feng Wang, Lei Zhang, Junyi Li, Xiaolei Wang, Lei Wang, Beichen Zhang, Zican Dong, Xiaoxue Cheng, Yuhan Chen, Xinyu Tang, Yupeng Hou, Qiangqiang Ren, Xincheng Pang, Shufang Xie, Wayne Xin Zhao, Zhicheng Dou, Jiaxin Mao, Yankai Lin, Ruihua Song, Jun Xu, Xu Chen, Rui Yan, Zhewei Wei, Di Hu, Wenbing Huang, Ze-Feng Gao, Yueguo Chen, Weizheng Lu, Ji-Rong Wen:
YuLan: An Open-source Large Language Model. CoRR abs/2406.19853 (2024)
[i43]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-09705
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-09705
Yake Wei, Siwei Li, Ruoxuan Feng, Di Hu:
Diagnosing and Re-learning for Balanced Multimodal Learning. CoRR abs/2407.09705 (2024)
[i42]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-10947
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-10947
Yaoting Wang, Peiwen Sun, Yuanchao Li, Honggang Zhang, Di Hu:
Can Textual Semantics Mitigate Sounding Object Segmentation Preference? CoRR abs/2407.10947 (2024)
[i41]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-10957
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-10957
Yaoting Wang, Peiwen Sun, Dongzhan Zhou, Guangyao Li, Honggang Zhang, Di Hu:
Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes. CoRR abs/2407.10957 (2024)
[i40]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-16638
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-16638
Peiwen Sun, Honggang Zhang, Di Hu:
Unveiling and Mitigating Bias in Audio Visual Segmentation. CoRR abs/2407.16638 (2024)
[i39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-18743
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-18743
Jie Chen, Zhipeng Chen, Jiapeng Wang, Kun Zhou, Yutao Zhu, Jinhao Jiang, Yingqian Min, Wayne Xin Zhao, Zhicheng Dou, Jiaxin Mao, Yankai Lin, Ruihua Song, Jun Xu, Xu Chen, Rui Yan, Zhewei Wei, Di Hu, Wenbing Huang, Ji-Rong Wen:
Towards Effective and Efficient Continual Pre-training of Large Language Models. CoRR abs/2407.18743 (2024)
[i38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-01366
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-01366
Ruoxuan Feng, Di Hu, Wenke Ma, Xuelong Li:
Play to the Score: Stage-Guided Dynamic Multi-Sensory Fusion for Robotic Manipulation. CoRR abs/2408.01366 (2024)
[i37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-02912
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-02912
Jingxian Lu, Wenke Xia, Dong Wang, Zhigang Wang, Bin Zhao, Di Hu, Xuelong Li:
KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance. CoRR abs/2408.02912 (2024)
[i36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-05107
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-05107
Xincheng Pang, Wenke Xia, Zhigang Wang, Bin Zhao, Di Hu, Dong Wang, Xuelong Li:
Depth Helps: Improving Pre-trained RGB-based Policy with Depth Information Injection. CoRR abs/2408.05107 (2024)
2023
[j6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/aeog/HeidlerMHJLGWZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/aeog/HeidlerMHJLGWZ23
Konrad Heidler, Lichao Mou, Di Hu, Pu Jin, Guangyao Li, Chuang Gan, Ji-Rong Wen, Xiao Xiang Zhu:
Self-supervised audiovisual representation learning for remote sensing data. Int. J. Appl. Earth Obs. Geoinformation 116: 103130 (2023)
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/HuWNWL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/HuWNWL23
Di Hu, Zheng Wang, Feiping Nie, Rong Wang, Xuelong Li:
Self-Supervised Learning for Heterogeneous Audiovisual Scene Analysis. IEEE Trans. Multim. 25: 3534-3545 (2023)
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XuFZH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XuFZH23
Ruize Xu, Ruoxuan Feng, Shi-Xiong Zhang, Di Hu:
MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning. ICASSP 2023: 1-5
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/Deng00WX023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/Deng00WX023
Andong Deng, Xingjian Li, Di Hu, Tianyang Wang, Haoyi Xiong, Cheng-Zhong Xu:
Towards Inadequately Pre-trained Models in Transfer Learning. ICCV 2023: 19340-19351
[c24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiX023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiX023
Guangyao Li, Yixin Xu, Di Hu:
Multi-Scale Attention for Audio Question Answering. INTERSPEECH 2023: 3442-3446
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LinRX0WX0SZJ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LinRX0WX0SZJ023
Hongpeng Lin, Ludan Ruan, Wenke Xia, Peiyu Liu, Jingyuan Wen, Yixin Xu, Di Hu, Ruihua Song, Wayne Xin Zhao, Qin Jin, Zhiwu Lu:
TikTalk: A Video-Based Dialogue Dataset for Multi-Modal Chitchat in Real World. ACM Multimedia 2023: 1303-1313
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiH023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiH023
Guangyao Li, Wenxuan Hou, Di Hu:
Progressive Spatio-temporal Perception for Audio-Visual Question Answering. ACM Multimedia 2023: 7808-7816
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/wacv/ZhouZOZH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wacv/ZhouZOZH23
Xinchi Zhou, Dongzhan Zhou, Wanli Ouyang, Hang Zhou, Di Hu:
SeCo: Separating Unknown Musical Visual Sounds with Consistency Guidance. WACV 2023: 5157-5166
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/wacv/ZhouZHZO23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wacv/ZhouZHZO23
Xinchi Zhou, Dongzhan Zhou, Di Hu, Hang Zhou, Wanli Ouyang:
Exploiting Visual Context Semantics for Sound Source Localization. WACV 2023: 5188-5197
[i35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-05880
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-05880
Hongpeng Lin, Ludan Ruan, Wenke Xia, Peiyu Liu, Jingyuan Wen, Yixin Xu, Di Hu, Ruihua Song, Wayne Xin Zhao, Qin Jin, Zhiwu Lu:
TikTalk: A Multi-Modal Dialogue Dataset for Real-World Chitchat. CoRR abs/2301.05880 (2023)
[i34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-03533
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-03533
Ruoxuan Feng, Wenke Xia, Di Hu:
Revisiting Pre-training in Audio-Visual Learning. CoRR abs/2302.03533 (2023)
[i33]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-10912
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-10912
Wenke Xia, Xu Zhao, Xincheng Pang, Changqing Zhang, Di Hu:
Balanced Audiovisual Dataset for Imbalance Analysis. CoRR abs/2302.10912 (2023)
[i32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-05338
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-05338
Ruize Xu, Ruoxuan Feng, Shi-Xiong Zhang, Di Hu:
MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning. CoRR abs/2303.05338 (2023)
[i31]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-07775
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-07775
Wenke Xia, Xingjian Li, Andong Deng, Haoyi Xiong, Dejing Dou, Di Hu:
Robust Cross-Modal Knowledge Distillation for Unconstrained Videos. CoRR abs/2304.07775 (2023)
[i30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-17993
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-17993
Guangyao Li, Yixin Xu, Di Hu:
Multi-Scale Attention for Audio Question Answering. CoRR abs/2305.17993 (2023)
[i29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-09431
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-09431
Wenxuan Hou, Guangyao Li, Yapeng Tian, Di Hu:
Towards Long Form Audio-visual Video Understanding. CoRR abs/2306.09431 (2023)
[i28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-05421
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-05421
Guangyao Li, Wenxuan Hou, Di Hu:
Progressive Spatio-temporal Perception for Audio-Visual Question Answering. CoRR abs/2308.05421 (2023)
[i27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-06255
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-06255
Yake Wei, Ruoxuan Feng, Zihe Wang, Di Hu:
Enhancing Multi-modal Cooperation via Fine-grained Modality Valuation. CoRR abs/2309.06255 (2023)
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-07929
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-07929
Yaoting Wang, Weisong Liu, Guangyao Li, Jian Ding, Di Hu, Xi Li:
Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer. CoRR abs/2309.07929 (2023)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-02847
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-02847
Wenke Xia, Dong Wang, Xincheng Pang, Zhigang Wang, Bin Zhao, Di Hu:
Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs. CoRR abs/2311.02847 (2023)
2022
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/HuWQLSW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/HuWQLSW22
Di Hu, Yake Wei, Rui Qian, Weiyao Lin, Ruihua Song, Ji-Rong Wen:
Class-Aware Sounding Objects Localization via Audiovisual Correspondence. IEEE Trans. Pattern Anal. Mach. Intell. 44(12): 9844-9859 (2022)
[c19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LiuQZ0L0ZZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LiuQZ0L0ZZ22
Xian Liu, Rui Qian, Hang Zhou, Di Hu, Weiyao Lin, Ziwei Liu, Bolei Zhou, Xiaowei Zhou:
Visual Sound Localization in the Wild by Cross-Modal Interference Erasing. AAAI 2022: 1801-1809
[c18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhouZ0Z00O22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhouZ0Z00O22
Dongzhan Zhou, Xinchi Zhou, Di Hu, Hang Zhou, Lei Bai, Ziwei Liu, Wanli Ouyang:
SepFusion: Finding Optimal Fusion Structures for Visual Sound Separation. AAAI 2022: 3544-3552
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/PengWD0H22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/PengWD0H22
Xiaokang Peng, Yake Wei, Andong Deng, Dong Wang, Di Hu:
Balanced Multimodal Learning via On-the-fly Gradient Modulation. CVPR 2022: 8228-8237
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/LiWTXW022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/LiWTXW022
Guangyao Li, Yake Wei, Yapeng Tian, Chenliang Xu, Ji-Rong Wen, Di Hu:
Learning to Answer Questions in Dynamic Audio-Visual Scenarios. CVPR 2022: 19086-19096
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/FanHZCXH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/FanHZCXH22
Yingzi Fan, Longfei Han, Yue Zhang, Lechao Cheng, Chen Xia, Di Hu:
Dual Domain-Adversarial Learning for Audio-Visual Saliency Prediction. HCMA@MM 2022: 15-23
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-06406
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-06406
Xian Liu, Rui Qian, Hang Zhou, Di Hu, Weiyao Lin, Ziwei Liu, Bolei Zhou, Xiaowei Zhou:
Visual Sound Localization in the Wild by Cross-Modal Interference Erasing. CoRR abs/2202.06406 (2022)
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-04668
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-04668
Andong Deng, Xingjian Li, Zhibing Li, Di Hu, Chengzhong Xu, Dejing Dou:
Inadequately Pre-trained Models are Better Feature Extractors. CoRR abs/2203.04668 (2022)
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-13535
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-13535
Xinchi Zhou, Dongzhan Zhou, Wanli Ouyang, Hang Zhou, Ziwei Liu, Di Hu:
SeCo: Separating Unknown Musical Visual Sounds with Consistency Guidance. CoRR abs/2203.13535 (2022)
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-14072
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-14072
Guangyao Li, Yake Wei, Yapeng Tian, Chenliang Xu, Ji-Rong Wen, Di Hu:
Learning to Answer Questions in Dynamic Audio-Visual Scenarios. CoRR abs/2203.14072 (2022)
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15332
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15332
Xiaokang Peng, Yake Wei, Andong Deng, Dong Wang, Di Hu:
Balanced Multimodal Learning via On-the-fly Gradient Modulation. CoRR abs/2203.15332 (2022)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-05220
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-05220
Yingzi Fan, Longfei Han, Yue Zhang, Lechao Cheng, Chen Xia, Di Hu:
Dual Domain-Adversarial Learning for Audio-Visual Saliency Prediction. CoRR abs/2208.05220 (2022)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-09579
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-09579
Yake Wei, Di Hu, Yapeng Tian, Xuelong Li:
Learning in Audio-visual Context: A Review, Analysis, and New Perspective. CoRR abs/2208.09579 (2022)
2021
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/kais/YangXHXWZS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/kais/YangXHXWZS21
Sijia Yang, Haoyi Xiong, Di Hu, Kaibo Xu, Licheng Wang, Peizhen Zhu, Zeyi Sun:
Generalising combinatorial discriminant analysis through conditioning truncated Rayleigh flow. Knowl. Inf. Syst. 63(8): 2189-2208 (2021)
[c14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/002800D21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/002800D21
Dong Wang, Di Hu, Xingjian Li, Dejing Dou:
Temporal Relational Modeling with Self-Supervision for Action Segmentation. AAAI 2021: 2729-2737
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/TianHX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/TianHX21
Yapeng Tian, Di Hu, Chenliang Xu:
Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation. CVPR 2021: 2745-2754
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/BaiWW0D21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/BaiWW0D21
Zechen Bai, Zhigang Wang, Jian Wang, Di Hu, Errui Ding:
Unsupervised Multi-Source Domain Adaptation for Person Re-Identification. CVPR 2021: 12914-12923
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02026
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02026
Yapeng Tian, Di Hu, Chenliang Xu:
Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation. CoRR abs/2104.02026 (2021)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-12961
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-12961
Zechen Bai, Zhigang Wang, Jian Wang, Di Hu, Errui Ding:
Unsupervised Multi-Source Domain Adaptation for Person Re-Identification. CoRR abs/2104.12961 (2021)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2108-00688
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-00688
Konrad Heidler, Lichao Mou, Di Hu, Pu Jin, Guangyao Li, Chuang Gan, Ji-Rong Wen, Xiao Xiang Zhu:
Self-supervised Audiovisual Representation Learning for Remote Sensing Data. CoRR abs/2108.00688 (2021)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-11749
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-11749
Di Hu, Yake Wei, Rui Qian, Weiyao Lin, Ruihua Song, Ji-Rong Wen:
Class-aware Sounding Objects Localization via Audiovisual Correspondence. CoRR abs/2112.11749 (2021)
2020
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/HuLMJCJZD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/HuLMJCJZD20
Di Hu, Xuhong Li, Lichao Mou, Pu Jin, Dong Chen, Liping Jing, Xiaoxiang Zhu, Dejing Dou:
Cross-Task Transfer for Geotagged Audiovisual Aerial Scene Recognition. ECCV (24) 2020: 68-84
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/QianHDWXL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/QianHDWXL20
Rui Qian, Di Hu, Heinrich Dinkel, Mengyue Wu, Ning Xu, Weiyao Lin:
Multiple Sound Sources Localization from Coarse to Fine. ECCV (20) 2020: 292-308
[c9]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/HuQJTWDLD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HuQJTWDLD20
Di Hu, Rui Qian, Minyue Jiang, Xiao Tan, Shilei Wen, Errui Ding, Weiyao Lin, Dejing Dou:
Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching. NeurIPS 2020
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2001-09414
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-09414
Di Hu, Zheng Wang, Haoyi Xiong, Dong Wang, Feiping Nie, Dejing Dou:
Curriculum Audiovisual Learning. CoRR abs/2001.09414 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-07097
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-07097
Di Hu, Lichao Mou, Qingzhong Wang, Junyu Gao, Yuansheng Hua, Dejing Dou, Xiao Xiang Zhu:
Ambient Sound Helps: Audiovisual Crowd Counting in Extreme Conditions. CoRR abs/2005.07097 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-08449
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-08449
Di Hu, Xuhong Li, Lichao Mou, Pu Jin, Dong Chen, Liping Jing, Xiaoxiang Zhu, Dejing Dou:
Cross-Task Transfer for Multimodal Aerial Scene Recognition. CoRR abs/2005.08449 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-06355
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-06355
Rui Qian, Di Hu, Heinrich Dinkel, Mengyue Wu, Ning Xu, Weiyao Lin:
Multiple Sound Sources Localization from Coarse to Fine. CoRR abs/2007.06355 (2020)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-05466
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-05466
Di Hu, Rui Qian, Minyue Jiang, Xiao Tan, Shilei Wen, Errui Ding, Weiyao Lin, Dejing Dou:
Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching. CoRR abs/2010.05466 (2020)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-08532
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-08532
Xingjian Li, Di Hu, Xuhong Li, Haoyi Xiong, Zhi Ye, Zhipeng Wang, Chengzhong Xu, Dejing Dou:
Towards Accurate Knowledge Transfer via Target-awareness Representation Disentanglement. CoRR abs/2010.08532 (2020)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-07508
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-07508
Dong Wang, Di Hu, Xingjian Li, Dejing Dou:
Temporal Relational Modeling with Self-Supervision for Action Segmentation. CoRR abs/2012.07508 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tip/HuNL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tip/HuNL19
Di Hu, Feiping Nie, Xuelong Li:
Discrete Spectral Hashing for Efficient Similarity Retrieval. IEEE Trans. Image Process. 28(3): 1080-1091 (2019)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/HuNL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/HuNL19
Di Hu, Feiping Nie, Xuelong Li:
Deep Binary Reconstruction for Cross-Modal Hashing. IEEE Trans. Multim. 21(4): 973-985 (2019)
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/Hu0LN019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/Hu0LN019
Di Hu, Dong Wang, Xuelong Li, Feiping Nie, Qi Wang:
Listen to the Image. CVPR 2019: 7972-7981
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/HuNL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/HuNL19
Di Hu, Feiping Nie, Xuelong Li:
Deep Multimodal Clustering for Unsupervised Audiovisual Learning. CVPR 2019: 9248-9257
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuWNL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuWNL19
Di Hu, Chengze Wang, Feiping Nie, Xuelong Li:
Dense Multimodal Fusion for Hierarchically Joint Representation. ICASSP 2019: 3941-3945
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-09115
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-09115
Di Hu, Dong Wang, Xuelong Li, Feiping Nie, Qi Wang:
Listen to the Image. CoRR abs/1904.09115 (2019)
2018
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1807-03094
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-03094
Di Hu, Feiping Nie, Xuelong Li:
Deep Co-Clustering for Unsupervised Audiovisual Learning. CoRR abs/1807.03094 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-03402
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-03402
Di Hu, Feiping Nie, Xuelong Li:
Deep LDA Hashing. CoRR abs/1810.03402 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-03414
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-03414
Di Hu, Feiping Nie, Xuelong Li:
Dense Multimodal Fusion for Hierarchically Joint Representation. CoRR abs/1810.03414 (2018)
2017
[c5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LiHN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LiHN17
Xuelong Li, Di Hu, Feiping Nie:
Large Graph Hashing with Spectral Rotation. AAAI 2017: 2203-2209
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/LiHL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/LiHL17
Xuelong Li, Di Hu, Xiaoqiang Lu:
Image2song: Song Retrieval via Bridging Image Content and Lyric Words. ICCV 2017: 5650-5659
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiHN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiHN17
Xuelong Li, Di Hu, Feiping Nie:
Deep Binary Reconstruction for Cross-modal Hashing. ACM Multimedia 2017: 1398-1406
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1708-05127
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1708-05127
Xuelong Li, Di Hu, Feiping Nie:
Deep Binary Reconstruction for Cross-modal Hashing. CoRR abs/1708.05127 (2017)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1708-05851
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1708-05851
Xuelong Li, Di Hu, Xiaoqiang Lu:
Image2song: Song Retrieval via Bridging Image Content and Lyric Words. CoRR abs/1708.05851 (2017)
2016
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/HuLL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/HuLL16
Di Hu, Xuelong Li, Xiaoqiang Lu:
Temporal Multimodal Learning in Audiovisual Speech Recognition. CVPR 2016: 3574-3582
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuLL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuLL16
Di Hu, Xiaoqiang Lu, Xuelong Li:
Multimodal Learning via Exploring Deep Semantic Similarity. ACM Multimedia 2016: 342-346

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.