default search action
Hehe Fan
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Books and Theses
- 2020
- [b1]Hehe Fan:
From Video Classification to Video Prediction: Deep Learning Approaches to Video Modelling. University of Technology Sydney, Australia, 2020
Journal Articles
- 2024
- [j15]Yi Cheng, Hehe Fan, Dongyun Lin, Ying Sun, Mohan S. Kankanhalli, Joo-Hwee Lim:
Keyword-Aware Relative Spatio-Temporal Graph Networks for Video Question Answering. IEEE Trans. Multim. 26: 6131-6141 (2024) - [j14]Ming Li, Huazhu Fu, Shengfeng He, Hehe Fan, Jun Liu, Jussi Keppo, Mike Zheng Shou:
DR-FER: Discriminative and Robust Representation Learning for Facial Expression Recognition. IEEE Trans. Multim. 26: 6297-6309 (2024) - [j13]Xiaobo Hu, Youfang Lin, Hehe Fan, Shuo Wang, Zhihao Wu, Kai Lv:
Building Category Graphs Representation with Spatial and Temporal Attention for Visual Navigation. ACM Trans. Multim. Comput. Commun. Appl. 20(7): 217:1-217:22 (2024) - 2023
- [j12]Hehe Fan, Yi Yang, Mohan S. Kankanhalli:
Point Spatio-Temporal Transformer Networks for Point Cloud Video Modeling. IEEE Trans. Pattern Anal. Mach. Intell. 45(2): 2181-2192 (2023) - 2022
- [j11]Hehe Fan, Xin Yu, Yi Yang, Mohan S. Kankanhalli:
Deep Hierarchical Representation of Point Cloud Videos via Spatio-Temporal Decomposition. IEEE Trans. Pattern Anal. Mach. Intell. 44(12): 9918-9930 (2022) - [j10]Yi Cheng, Ying Sun, Hehe Fan, Tao Zhuo, Joo-Hwee Lim, Mohan S. Kankanhalli:
Entropy guided attention network for weakly-supervised action localization. Pattern Recognit. 129: 108718 (2022) - [j9]Hehe Fan, Tao Zhuo, Xin Yu, Yi Yang, Mohan S. Kankanhalli:
Understanding Atomic Hand-Object Interaction With Human Intention. IEEE Trans. Circuits Syst. Video Technol. 32(1): 275-285 (2022) - [j8]Hehe Fan, Ping Liu, Mingliang Xu, Yi Yang:
Unsupervised Visual Representation Learning via Dual-Level Progressive Similar Instance Selection. IEEE Trans. Cybern. 52(9): 8851-8861 (2022) - [j7]Linchao Zhu, Hehe Fan, Yawei Luo, Mingliang Xu, Yi Yang:
Temporal Cross-Layer Correlation Mining for Action Recognition. IEEE Trans. Multim. 24: 668-676 (2022) - 2021
- [j6]Linchao Zhu, Hehe Fan, Yawei Luo, Mingliang Xu, Yi Yang:
Few-Shot Common-Object Reasoning Using Common-Centric Localization Network. IEEE Trans. Image Process. 30: 4253-4262 (2021) - 2020
- [j5]Qianyu Feng, Yu Wu, Hehe Fan, Chenggang Yan, Mingliang Xu, Yi Yang:
Cascaded Revision Network for Novel Object Captioning. IEEE Trans. Circuits Syst. Video Technol. 30(10): 3413-3421 (2020) - [j4]Yuhang Ding, Hehe Fan, Mingliang Xu, Yi Yang:
Adaptive Exploration for Unsupervised Person Re-identification. ACM Trans. Multim. Comput. Commun. Appl. 16(1): 3:1-3:19 (2020) - [j3]Hehe Fan, Linchao Zhu, Yi Yang, Fei Wu:
Recurrent Attention Network with Reinforced Generator for Visual Dialog. ACM Trans. Multim. Comput. Commun. Appl. 16(3): 78:1-78:16 (2020) - 2018
- [j2]Hehe Fan, Liang Zheng, Chenggang Yan, Yi Yang:
Unsupervised Person Re-identification: Clustering and Fine-tuning. ACM Trans. Multim. Comput. Commun. Appl. 14(4): 83:1-83:18 (2018) - 2016
- [j1]Hong Zhang, Wenping Zhang, Wenhe Liu, Xin Xu, Hehe Fan:
Multiple kernel visual-auditory representation learning for retrieval. Multim. Tools Appl. 75(15): 9169-9184 (2016)
Conference and Workshop Papers
- 2024
- [c24]Yuze Hao, Jianrong Zhang, Tao Zhuo, Fuan Wen, Hehe Fan:
Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling. AAAI 2024: 2076-2084 - [c23]Hang Du, Guoshun Nan, Sicheng Zhang, Binzhu Xie, Junrui Xu, Hehe Fan, Qimei Cui, Xiaofeng Tao, Xudong Jiang:
DocMSU: A Comprehensive Benchmark for Document-Level Multimodal Sarcasm Understanding. AAAI 2024: 17933-17941 - [c22]Ruijie Quan, Wenguan Wang, Fan Ma, Hehe Fan, Yi Yang:
Clustering for Protein Representation Learning. CVPR 2024: 319-329 - [c21]Hang Du, Sicheng Zhang, Binzhu Xie, Guoshun Nan, Jiayang Zhang, Junrui Xu, Hangyu Liu, Sicong Leng, Jiangming Liu, Hehe Fan, Dajiu Huang, Jing Feng, Linli Chen, Can Zhang, Xuhuan Li, Hao Zhang, Jianhang Chen, Qimei Cui, Xiaofeng Tao:
Uncovering what, why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly. CVPR 2024: 18793-18803 - [c20]Wei Li, Hehe Fan, Yongkang Wong, Yi Yang, Mohan S. Kankanhalli:
Improving Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning. ICML 2024 - [c19]Wu Chen, Hehe Fan, Qiuping Jiang, Chao Huang, Yi Yang:
Progressive Point Cloud Denoising with Cross-Stage Cross-Coder Adaptive Edge Graph Convolution Network. ACM Multimedia 2024: 6578-6587 - 2023
- [c18]Xiaoyu Feng, Heming Du, Hehe Fan, Yueqi Duan, Yongpan Liu:
SEFormer: Structure Embedding Transformer for 3D Object Detection. AAAI 2023: 632-640 - [c17]Guangzhi Wang, Hehe Fan, Mohan S. Kankanhalli:
Text to Point Cloud Localization with Relation-Enhanced Transformer. AAAI 2023: 2501-2509 - [c16]Hehe Fan, Linchao Zhu, Yi Yang, Mohan S. Kankanhalli:
PointListNet: Deep Learning on 3D Point Lists. CVPR 2023: 17692-17701 - [c15]Ming Li, Xiangyu Xu, Hehe Fan, Pan Zhou, Jun Liu, Jia-Wei Liu, Jiahe Li, Jussi Keppo, Mike Zheng Shou, Shuicheng Yan:
STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition. ICCV 2023: 5083-5092 - [c14]Xiaoxiao Sheng, Zhiqiang Shen, Gang Xiao, Longguang Wang, Yulan Guo, Hehe Fan:
Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos. ICCV 2023: 16469-16478 - [c13]Zhiqiang Shen, Xiaoxiao Sheng, Hehe Fan, Longguang Wang, Yulan Guo, Qiong Liu, Hao Wen, Xi Zhou:
Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos. ICCV 2023: 16534-16543 - [c12]Hehe Fan, Zhangyang Wang, Yi Yang, Mohan S. Kankanhalli:
Continuous-Discrete Convolution for Geometry-Sequence Modeling in Proteins. ICLR 2023 - 2022
- [c11]Hehe Fan, Xiaojun Chang, Wanyue Zhang, Yi Cheng, Ying Sun, Mohan S. Kankanhalli:
Self-Supervised Global-Local Structure Modeling for Point Cloud Domain Adaptation with Reliable Voted Pseudo Labels. CVPR 2022: 6367-6376 - [c10]Hanxue Liang, Hehe Fan, Zhiwen Fan, Yi Wang, Tianlong Chen, Yu Cheng, Zhangyang Wang:
Point Cloud Domain Adaptation via Masked Local 3D Structure Prediction. ECCV (3) 2022: 156-172 - 2021
- [c9]Hehe Fan, Yi Yang, Mohan S. Kankanhalli:
Point 4D Transformer Networks for Spatio-Temporal Modeling in Point Cloud Videos. CVPR 2021: 14204-14213 - [c8]Hehe Fan, Xin Yu, Yuhang Ding, Yi Yang, Mohan S. Kankanhalli:
PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences. ICLR 2021 - [c7]Hehe Fan, Mohan S. Kankanhalli:
Motion = Video - Content: Towards Unsupervised Learning of Motion Representation from Videos. MMAsia 2021: 2:1-2:7 - 2020
- [c6]Hehe Fan, Yi Yang:
Person Tube Retrieval via Language Description. AAAI 2020: 10754-10761 - 2019
- [c5]Hehe Fan, Linchao Zhu, Yi Yang:
Cubic LSTMs for Video Prediction. AAAI 2019: 8263-8270 - [c4]Qianyu Feng, Guoliang Kang, Hehe Fan, Yi Yang:
Attract or Distract: Exploit the Margin of Open Set. ICCV 2019: 7989-7998 - 2018
- [c3]Hehe Fan, Zhongwen Xu, Linchao Zhu, Chenggang Yan, Jianjun Ge, Yi Yang:
Watching a Small Portion could be as Good as Watching All: Towards Efficient Video Classification. IJCAI 2018: 705-711 - 2017
- [c2]Hehe Fan, Xiaojun Chang, De Cheng, Yi Yang, Dong Xu, Alexander G. Hauptmann:
Complex Event Detection by Identifying Reliable Shots from Untrimmed Videos. ICCV 2017: 736-744 - 2016
- [c1]Junwei Liang, Jia Chen, Poyao Huang, Xuanchong Li, Lu Jiang, Zhenzhong Lan, Pingbo Pan, Hehe Fan, Qin Jin, Jiande Sun, Yang Chen, Yi Yang, Alexander G. Hauptmann:
Informedia @ TRECVID 2016. TRECVID 2016
Informal and Other Publications
- 2024
- [i33]Yuze Hao, Jianrong Zhang, Tao Zhuo, Fuan Wen, Hehe Fan:
Hand-Centric Motion Refinement for 3D Hand-Object Interaction via Hierarchical Spatial-Temporal Modeling. CoRR abs/2401.15987 (2024) - [i32]Zhenglin Zhou, Fan Ma, Hehe Fan, Yi Yang:
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting. CoRR abs/2402.06149 (2024) - [i31]Chao Wang, Hehe Fan, Ruijie Quan, Yi Yang:
ProtChatGPT: Towards Understanding Proteins with Large Language Models. CoRR abs/2402.09649 (2024) - [i30]Xiangpeng Yang, Linchao Zhu, Hehe Fan, Yi Yang:
EVA: Zero-shot Accurate Attributes and Multi-Object Video Editing. CoRR abs/2403.16111 (2024) - [i29]Ruijie Quan, Wenguan Wang, Fan Ma, Hehe Fan, Yi Yang:
Clustering for Protein Representation Learning. CoRR abs/2404.00254 (2024) - [i28]Hang Du, Sicheng Zhang, Binzhu Xie, Guoshun Nan, Jiayang Zhang, Junrui Xu, Hangyu Liu, Sicong Leng, Jiangming Liu, Hehe Fan, Dajiu Huang, Jing Feng, Linli Chen, Can Zhang, Xuhuan Li, Hao Zhang, Jianhang Chen, Qimei Cui, Xiaofeng Tao:
Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly. CoRR abs/2405.00181 (2024) - [i27]Wei Li, Hehe Fan, Yongkang Wong, Mohan S. Kankanhalli, Yi Yang:
TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment. CoRR abs/2405.13911 (2024) - [i26]Yue Zhang, Hehe Fan, Yi Yang:
Prompt-Aware Adapter: Towards Learning Adaptive Visual Tokens for Multimodal Large Language Models. CoRR abs/2405.15684 (2024) - [i25]Wenjie Zhuo, Fan Ma, Hehe Fan, Yi Yang:
VividDreamer: Invariant Score Distillation For Hyper-Realistic Text-to-3D Generation. CoRR abs/2407.09822 (2024) - [i24]Guoliang Chen, Fei Wang, Kun Li, Zhiliang Wu, Hehe Fan, Yi Yang, Meng Wang, Dan Guo:
Prototype Learning for Micro-gesture Classification. CoRR abs/2408.03097 (2024) - [i23]Wenjin Hou, Dingjie Fu, Kun Li, Shiming Chen, Hehe Fan, Yi Yang:
ZeroMamba: Exploring Visual State Space Model for Zero-Shot Learning. CoRR abs/2408.14868 (2024) - [i22]Yuxuan Hou, Jianrong Zhang, Hua Chen, Min Zhou, Faxin Yu, Hehe Fan, Yi Yang:
CktGen: Specification-Conditioned Analog Circuit Generation. CoRR abs/2410.00995 (2024) - 2023
- [i21]Ming Li, Jun Liu, Hehe Fan, Jiawei Liu, Jiahe Li, Mike Zheng Shou, Jussi Keppo:
STPrivacy: Spatio-Temporal Tubelet Sparsification and Anonymization for Privacy-preserving Action Recognition. CoRR abs/2301.03046 (2023) - [i20]Guangzhi Wang, Hehe Fan, Mohan S. Kankanhalli:
Text to Point Cloud Localization with Relation-Enhanced Transformer. CoRR abs/2301.05372 (2023) - [i19]Yi Cheng, Ziwei Xu, Fen Fang, Dongyun Lin, Hehe Fan, Yongkang Wong, Ying Sun, Mohan S. Kankanhalli:
A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023. CoRR abs/2307.06569 (2023) - [i18]Yi Cheng, Hehe Fan, Dongyun Lin, Ying Sun, Mohan S. Kankanhalli, Joo-Hwee Lim:
Keyword-Aware Relative Spatio-Temporal Graph Networks for Video Question Answering. CoRR abs/2307.13250 (2023) - [i17]Yue Zhang, Hehe Fan, Yi Yang, Mohan S. Kankanhalli:
DPMix: Mixture of Depth and Point Cloud Video Experts for 4D Action Segmentation. CoRR abs/2307.16803 (2023) - [i16]Zhiqiang Shen, Xiaoxiao Sheng, Hehe Fan, Longguang Wang, Yulan Guo, Qiong Liu, Hao Wen, Xi Zhou:
Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos. CoRR abs/2308.09245 (2023) - [i15]Xiaoxiao Sheng, Zhiqiang Shen, Gang Xiao, Longguang Wang, Yulan Guo, Hehe Fan:
Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos. CoRR abs/2308.09247 (2023) - [i14]Tao Zhuo, Zhiyong Cheng, Hehe Fan, Mohan S. Kankanhalli:
Prior-Free Continual Learning with Unlabeled Data in the Wild. CoRR abs/2310.10417 (2023) - [i13]Yu Lu, Linchao Zhu, Hehe Fan, Yi Yang:
FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax. CoRR abs/2311.15813 (2023) - [i12]Xiaobo Hu, Youfang Lin, Yue Liu, Jinwen Wang, Shuo Wang, Hehe Fan, Kai Lv:
A Reliable Representation with Bidirectional Transition Model for Visual Reinforcement Learning Generalization. CoRR abs/2312.01915 (2023) - [i11]Xiaobo Hu, Youfang Lin, Hehe Fan, Shuo Wang, Zhihao Wu, Kai Lv:
Building Category Graphs Representation with Spatial and Temporal Attention for Visual Navigation. CoRR abs/2312.03327 (2023) - [i10]Hang Du, Guoshun Nan, Sicheng Zhang, Binzhu Xie, Junrui Xu, Hehe Fan, Qimei Cui, Xiaofeng Tao, Xudong Jiang:
DocMSU: A Comprehensive Benchmark for Document-level Multimodal Sarcasm Understanding. CoRR abs/2312.16023 (2023) - 2022
- [i9]Hehe Fan, Xin Yu, Yuhang Ding, Yi Yang, Mohan S. Kankanhalli:
PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences. CoRR abs/2205.13713 (2022) - [i8]Xiaoyu Feng, Heming Du, Yueqi Duan, Yongpan Liu, Hehe Fan:
SEFormer: Structure Embedding Transformer for 3D Object Detection. CoRR abs/2209.01745 (2022) - [i7]Yi Wang, Zhiwen Fan, Tianlong Chen, Hehe Fan, Zhangyang Wang:
Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer? CoRR abs/2209.07026 (2022) - 2019
- [i6]Hehe Fan, Linchao Zhu, Yi Yang:
Cubic LSTMs for Video Prediction. CoRR abs/1904.09412 (2019) - [i5]Yuhang Ding, Hehe Fan, Mingliang Xu, Yi Yang:
Adaptive Exploration for Unsupervised Person Re-Identification. CoRR abs/1907.04194 (2019) - [i4]Qianyu Feng, Guoliang Kang, Hehe Fan, Yi Yang:
Attract or Distract: Exploit the Margin of Open Set. CoRR abs/1908.01925 (2019) - [i3]Qianyu Feng, Yu Wu, Hehe Fan, Chenggang Yan, Yi Yang:
Cascaded Revision Network for Novel Object Captioning. CoRR abs/1908.02726 (2019) - [i2]Hehe Fan, Yi Yang:
PointRNN: Point Recurrent Neural Network for Moving Point Cloud Processing. CoRR abs/1910.08287 (2019) - 2017
- [i1]Hehe Fan, Liang Zheng, Yi Yang:
Unsupervised Person Re-identification: Clustering and Fine-tuning. CoRR abs/1705.10444 (2017)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-08 20:31 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint