default search action
Xitong Yang
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j6]Zuxuan Wu, Zejia Weng, Wujian Peng, Xitong Yang, Ang Li, Larry S. Davis, Yu-Gang Jiang:
Building an Open-Vocabulary Video CLIP Model With Better Architectures, Optimization and Data. IEEE Trans. Pattern Anal. Mach. Intell. 46(7): 4747-4762 (2024) - [j5]Tianshuo Bai, Wanru Mao, Guangyao Wang, Hanjie Liu, Aifei Zhang, Shihang Fu, Shuaikai Liu, Jianchao Hu, Xitong Yang, Biao Pan, Wei W. Xing, Wang Kang:
An End-to-End In-Memory Computing System Based on a 40-nm eFlash-Based IMC SoC: Circuits, Toolchains, and Systems Co-Design Framework. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 43(6): 1729-1740 (2024) - [c22]Chaoyi Zhang, Xitong Yang, Ji Hou, Kris Kitani, Weidong Cai, Fu-Jen Chu:
EgoSG: Learning 3D Scene Graphs from Egocentric RGB-D Sequences. CVPR Workshops 2024: 2535-2545 - [c21]Yuhan Shen, Huiyu Wang, Xitong Yang, Matt Feiszli, Ehsan Elhamifar, Lorenzo Torresani, Effrosyni Mavroudi:
Learning to Segment Referred Objects from Narrated Egocentric Videos. CVPR 2024: 14510-14520 - [c20]Md Mohaiminul Islam, Ngan Ho, Xitong Yang, Tushar Nagarajan, Lorenzo Torresani, Gedas Bertasius:
Video ReCap: Recursive Captioning of Hour-Long Videos. CVPR 2024: 18198-18208 - [c19]Kristen Grauman, Andrew Westbury, Lorenzo Torresani, Kris Kitani, Jitendra Malik, Triantafyllos Afouras, Kumar Ashutosh, Vijay Baiyya, Siddhant Bansal, Bikram Boote, Eugene Byrne, Zachary Chavis, Joya Chen, Feng Cheng, Fu-Jen Chu, Sean Crane, Avijit Dasgupta, Jing Dong, María Escobar, Cristhian Forigua, Abrham Gebreselasie, Sanjay Haresh, Jing Huang, Md Mohaiminul Islam, Suyog Dutt Jain, Rawal Khirodkar, Devansh Kukreja, Kevin J. Liang, Jia-Wei Liu, Sagnik Majumder, Yongsen Mao, Miguel Martin, Effrosyni Mavroudi, Tushar Nagarajan, Francesco Ragusa, Santhosh Kumar Ramakrishnan, Luigi Seminara, Arjun Somayazulu, Yale Song, Shan Su, Zihui Xue, Edward Zhang, Jinxu Zhang, Angela Castillo, Changan Chen, Xinzhu Fu, Ryosuke Furuta, Cristina González, Prince Gupta, Jiabo Hu, Yifei Huang, Yiming Huang, Weslie Khoo, Anush Kumar, Robert Kuo, Sach Lakhavani, Miao Liu, Mi Luo, Zhengyi Luo, Brighid Meredith, Austin Miller, Oluwatumininu Oguntola, Xiaqing Pan, Penny Peng, Shraman Pramanick, Merey Ramazanova, Fiona Ryan, Wei Shan, Kiran Somasundaram, Chenan Song, Audrey Southerland, Masatoshi Tateno, Huiyu Wang, Yuchen Wang, Takuma Yagi, Mingfei Yan, Xitong Yang, Zecheng Yu, Shengxin Cindy Zha, Chen Zhao, Ziwei Zhao, Zhifan Zhu, Jeff Zhuo, Pablo Arbeláez, Gedas Bertasius, Dima Damen, Jakob Engel, Giovanni Maria Farinella, Antonino Furnari, Bernard Ghanem, Judy Hoffman, C. V. Jawahar, Richard A. Newcombe, Hyun Soo Park, James M. Rehg, Yoichi Sato, Manolis Savva, Jianbo Shi, Mike Zheng Shout, Michael Wray:
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives. CVPR 2024: 19383-19400 - [i24]Md Mohaiminul Islam, Ngan Ho, Xitong Yang, Tushar Nagarajan, Lorenzo Torresani, Gedas Bertasius:
Video ReCap: Recursive Captioning of Hour-Long Videos. CoRR abs/2402.13250 (2024) - [i23]Zi-Yi Dou, Xitong Yang, Tushar Nagarajan, Huiyu Wang, Jing Huang, Nanyun Peng, Kris Kitani, Fu-Jen Chu:
Unlocking Exocentric Video-Language Data for Egocentric Video Representation Learning. CoRR abs/2408.03567 (2024) - [i22]Zejia Weng, Xitong Yang, Zhen Xing, Zuxuan Wu, Yu-Gang Jiang:
GenRec: Unifying Video Generation and Recognition with Diffusion Models. CoRR abs/2408.15241 (2024) - [i21]Md Mohaiminul Islam, Tushar Nagarajan, Huiyu Wang, Fu-Jen Chu, Kris Kitani, Gedas Bertasius, Xitong Yang:
Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos. CoRR abs/2409.20557 (2024) - 2023
- [j4]Huaqiu Chen, Rong Ma, Bingjie Zhou, Xitong Yang, Fuhui Duan, Guangming Wang:
Integrated immunological analysis of single-cell and bulky tissue transcriptomes reveals the role of interactions between M0 macrophages and naïve CD4+ T cells in the immunosuppressive microenvironment of cervical cancer. Comput. Biol. Medicine 163: 107151 (2023) - [c18]Bo He, Xitong Yang, Hanyu Wang, Zuxuan Wu, Hao Chen, Shuaiyi Huang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava:
Towards Scalable Neural Representation for Diverse Videos. CVPR 2023: 6132-6142 - [c17]Xitong Yang, Fu-Jen Chu, Matt Feiszli, Raghav Goyal, Lorenzo Torresani, Du Tran:
Relational Space-Time Query in Long-Form Videos. CVPR 2023: 6398-6408 - [c16]Shiyi Lan, Xitong Yang, Zhiding Yu, Zuxuan Wu, José M. Álvarez, Anima Anandkumar:
Vision Transformers are Good Mask Auto-Labelers. CVPR 2023: 23745-23755 - [c15]Zejia Weng, Xitong Yang, Ang Li, Zuxuan Wu, Yu-Gang Jiang:
Open-VCLIP: Transforming CLIP to an Open-vocabulary Video Model via Interpolated Weight Optimization. ICML 2023: 36978-36989 - [i20]Shiyi Lan, Xitong Yang, Zhiding Yu, Zuxuan Wu, José M. Álvarez, Anima Anandkumar:
Vision Transformers Are Good Mask Auto-Labelers. CoRR abs/2301.03992 (2023) - [i19]Zejia Weng, Xitong Yang, Ang Li, Zuxuan Wu, Yu-Gang Jiang:
Transforming CLIP to an Open-vocabulary Video Model via Interpolated Weight Optimization. CoRR abs/2302.00624 (2023) - [i18]Raghav Goyal, Effrosyni Mavroudi, Xitong Yang, Sainbayar Sukhbaatar, Leonid Sigal, Matt Feiszli, Lorenzo Torresani, Du Tran:
MINOTAUR: Multi-task Video Grounding From Multimodal Queries. CoRR abs/2302.08063 (2023) - [i17]Bo He, Xitong Yang, Hanyu Wang, Zuxuan Wu, Hao Chen, Shuaiyi Huang, Yixuan Ren, Ser-Nam Lim, Abhinav Shrivastava:
Towards Scalable Neural Representation for Diverse Videos. CoRR abs/2303.14124 (2023) - [i16]Zuxuan Wu, Zejia Weng, Wujian Peng, Xitong Yang, Ang Li, Larry S. Davis, Yu-Gang Jiang:
Building an Open-Vocabulary Video CLIP Model with Better Architectures, Optimization and Data. CoRR abs/2310.05010 (2023) - 2022
- [c14]Bo He, Xitong Yang, Le Kang, Zhiyu Cheng, Xin Zhou, Abhinav Shrivastava:
ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization. CVPR 2022: 13915-13925 - [c13]Junke Wang, Xitong Yang, Hengduo Li, Li Liu, Zuxuan Wu, Yu-Gang Jiang:
Efficient Video Transformers with Spatial-Temporal Token Selection. ECCV (35) 2022: 69-86 - [c12]Zejia Weng, Xitong Yang, Ang Li, Zuxuan Wu, Yu-Gang Jiang:
Semi-supervised Vision Transformers. ECCV (30) 2022: 605-620 - [i15]Bo He, Xitong Yang, Le Kang, Zhiyu Cheng, Xin Zhou, Abhinav Shrivastava:
ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization. CoRR abs/2203.15187 (2022) - 2021
- [b1]Xitong Yang:
Long-Term Temporal Modeling for video Action Understanding. University of Maryland, College Park, MD, USA, 2021 - [c11]Xitong Yang, Xiaodong Yang, Sifei Liu, Deqing Sun, Larry Davis, Jan Kautz:
Hierarchical Contrastive Motion Learning for Video Action Recognition. BMVC 2021: 109 - [c10]Bo He, Xitong Yang, Zuxuan Wu, Hao Chen, Ser-Nam Lim, Abhinav Shrivastava:
GTA: Global Temporal Attention for Video Action Understanding. BMVC 2021: 292 - [c9]Xitong Yang, Haoqi Fan, Lorenzo Torresani, Larry S. Davis, Heng Wang:
Beyond Short Clips: End-to-End Video-Level Learning With Collaborative Memories. CVPR 2021: 7567-7576 - [i14]Xitong Yang, Haoqi Fan, Lorenzo Torresani, Larry Davis, Heng Wang:
Beyond Short Clips: End-to-End Video-Level Learning with Collaborative Memories. CoRR abs/2104.01198 (2021) - [i13]Zejia Weng, Xitong Yang, Ang Li, Zuxuan Wu, Yu-Gang Jiang:
Semi-Supervised Vision Transformers. CoRR abs/2111.11067 (2021) - [i12]Junke Wang, Xitong Yang, Hengduo Li, Zuxuan Wu, Yu-Gang Jiang:
Efficient Video Transformers with Spatial-Temporal Token Selection. CoRR abs/2111.11591 (2021) - 2020
- [c8]Ahmed Taha, Xitong Yang, Abhinav Shrivastava, Larry Davis:
A Generic Visualization Approach for Convolutional Neural Networks. ECCV (17) 2020: 734-750 - [i11]Ahmed Taha, Xitong Yang, Abhinav Shrivastava, Larry Davis:
A Generic Visualization Approach for Convolutional Neural Networks. CoRR abs/2007.09748 (2020) - [i10]Xitong Yang, Xiaodong Yang, Sifei Liu, Deqing Sun, Larry Davis, Jan Kautz:
Hierarchical Contrastive Motion Learning for Video Action Recognition. CoRR abs/2007.10321 (2020) - [i9]Bo He, Xitong Yang, Zuxuan Wu, Hao Chen, Ser-Nam Lim, Abhinav Shrivastava:
GTA: Global Temporal Attention for Video Action Understanding. CoRR abs/2012.08510 (2020)
2010 – 2019
- 2019
- [j3]Wei Qian, Wending Li, Yasuhiro Sogawa, Ryohei Fujimaki, Xitong Yang, Ji Liu:
An Interactive Greedy Approach to Group Sparsity in High Dimensions. Technometrics 61(3): 409-421 (2019) - [c7]Xitong Yang, Xiaodong Yang, Ming-Yu Liu, Fanyi Xiao, Larry S. Davis, Jan Kautz:
STEP: Spatio-Temporal Progressive Learning for Video Action Detection. CVPR 2019: 264-272 - [c6]Wei Luo, Xitong Yang, Xianjie Mo, Yuheng Lu, Larry Davis, Jun Li, Jian Yang, Ser-Nam Lim:
Cross-X Learning for Fine-Grained Visual Categorization. ICCV 2019: 8241-8250 - [i8]Ahmed Taha, Yi-Ting Chen, Xitong Yang, Teruhisa Misu, Larry Davis:
Exploring Uncertainty in Conditional Multi-Modal Retrieval Systems. CoRR abs/1901.07702 (2019) - [i7]Xitong Yang, Xiaodong Yang, Ming-Yu Liu, Fanyi Xiao, Larry Davis, Jan Kautz:
STEP: Spatio-Temporal Progressive Learning for Video Action Detection. CoRR abs/1904.09288 (2019) - [i6]Wei Luo, Xitong Yang, Xianjie Mo, Yuheng Lu, Larry S. Davis, Jun Li, Jian Yang, Ser-Nam Lim:
Cross-X Learning for Fine-Grained Visual Categorization. CoRR abs/1909.04412 (2019) - 2018
- [j2]Edgar A. Bernal, Xitong Yang, Qun Li, Jayant Kumar, Sriganesh Madhvanath, Palghat Ramesh, Raja Bala:
Deep Temporal Multimodal Fusion for Medical Procedure Monitoring Using Wearable Sensors. IEEE Trans. Multim. 20(1): 107-118 (2018) - [c5]Xitong Yang, Zheng Xu, Jiebo Luo:
Towards Perceptual Image Dehazing by Physics-Based Disentanglement and Adversarial Training. AAAI 2018: 7485-7492 - [c4]Zheng Xu, Xitong Yang, Xue Li, Xiaoshuai Sun:
Strong Baseline for Single Image Dehazing with Deep Features and Instance Normalization. BMVC 2018: 243 - [i5]Zheng Xu, Xitong Yang, Xue Li, Xiaoshuai Sun:
The Effectiveness of Instance Normalization: a Strong Baseline for Single Image Dehazing. CoRR abs/1805.03305 (2018) - [i4]Ahmed Taha, Moustafa Meshry, Xitong Yang, Yi-Ting Chen, Larry Davis:
Two Stream Self-Supervised Learning for Action Recognition. CoRR abs/1806.07383 (2018) - 2017
- [j1]Xitong Yang, Jiebo Luo:
Tracking Illicit Drug Dealing and Abuse on Instagram Using Multimodal Analysis. ACM Trans. Intell. Syst. Technol. 8(4): 58:1-58:15 (2017) - [c3]Xitong Yang, Palghat Ramesh, Radha Chitta, Sriganesh Madhvanath, Edgar A. Bernal, Jiebo Luo:
Deep Multimodal Representation Learning from Temporal Data. CVPR 2017: 5066-5074 - [i3]Xitong Yang, Palghat Ramesh, Radha Chitta, Sriganesh Madhvanath, Edgar A. Bernal, Jiebo Luo:
Deep Multimodal Representation Learning from Temporal Data. CoRR abs/1704.03152 (2017) - 2016
- [i2]Xitong Yang, Jiebo Luo:
Tracking Illicit Drug Dealing and Abuse on Instagram using Multimodal Analysis. CoRR abs/1605.02710 (2016) - 2015
- [c2]Yuncheng Li, Xitong Yang, Jiebo Luo:
Semantic Video Entity Linking Based on Visual Content and Metadata. ICCV 2015: 4615-4623 - [c1]Xitong Yang, Yuncheng Li, Jiebo Luo:
Pinterest Board Recommendation for Twitter Users. ACM Multimedia 2015: 963-966 - [i1]Xitong Yang, Yuncheng Li, Jiebo Luo:
Pinterest Board Recommendation for Twitter Users. CoRR abs/1509.00511 (2015)
Coauthor Index
aka: Larry S. Davis
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-30 20:35 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint