default search action
Dhiraj Joshi
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i8]Shengcao Cao, Dhiraj Joshi, Liang-Yan Gui, Yu-Xiong Wang:
HASSOD: Hierarchical Adaptive Self-Supervised Object Detection. CoRR abs/2402.03311 (2024) - [i7]David Wood, Boris Lublinsky, Alexy Roytman, Shivdeep Singh, Abdulhamid Adebayo, Revital Eres, Mohammad Nassar, Hima Patel, S. Yousaf Shah, Constantin Adam, Petros Zerfos, Nirmit Desai, Daiki Tsuzuku, Takuya Goto, Michele Dolfi, Saptha Surendran, Paramesvaran Selvam, Sungeun An, Yuan Chi Chang, Dhiraj Joshi, Hajar Emami-Gohari, Xuan-Hong Dang, Yan Koyfman, Shahrokh Daijavad:
Data-Prep-Kit: getting your data ready for LLM application development. CoRR abs/2409.18164 (2024) - 2023
- [c44]Hanjing Wang, Dhiraj Joshi, Shiqiang Wang, Qiang Ji:
Gradient-based Uncertainty Attribution for Explainable Bayesian Deep Learning. CVPR 2023: 12044-12053 - [c43]Shengcao Cao, Dhiraj Joshi, Liang-Yan Gui, Yu-Xiong Wang:
Contrastive Mean Teacher for Domain Adaptive Object Detectors. CVPR 2023: 23839-23848 - [c42]Shengcao Cao, Dhiraj Joshi, Liangyan Gui, Yu-Xiong Wang:
HASSOD: Hierarchical Adaptive Self-Supervised Object Detection. NeurIPS 2023 - [i6]Hanjing Wang, Dhiraj Joshi, Shiqiang Wang, Qiang Ji:
Gradient-based Uncertainty Attribution for Explainable Bayesian Deep Learning. CoRR abs/2304.04824 (2023) - [i5]Shengcao Cao, Dhiraj Joshi, Liang-Yan Gui, Yu-Xiong Wang:
Contrastive Mean Teacher for Domain Adaptive Object Detectors. CoRR abs/2305.03034 (2023) - 2021
- [c41]Andrew Rouditchenko, Angie W. Boggust, David Harwath, Brian Chen, Dhiraj Joshi, Samuel Thomas, Kartik Audhkhasi, Hilde Kuehne, Rameswar Panda, Rogério Schmidt Feris, Brian Kingsbury, Michael Picheny, Antonio Torralba, James R. Glass:
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos. Interspeech 2021: 1584-1588 - 2020
- [i4]Andrew Rouditchenko, Angie W. Boggust, David Harwath, Dhiraj Joshi, Samuel Thomas, Kartik Audhkhasi, Rogério Feris, Brian Kingsbury, Michael Picheny, Antonio Torralba, James R. Glass:
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos. CoRR abs/2006.09199 (2020)
2010 – 2019
- 2019
- [j12]Michele Merler, Khoi-Nguyen C. Mac, Dhiraj Joshi, Quoc-Bao Nguyen, Stephen Hammer, John Kent, Jinjun Xiong, Minh N. Do, John R. Smith, Rogério Schmidt Feris:
Automatic Curation of Sports Highlights Using Multimodal Excitement Features. IEEE Trans. Multim. 21(5): 1147-1160 (2019) - [c40]Angie W. Boggust, Kartik Audhkhasi, Dhiraj Joshi, David Harwath, Samuel Thomas, Rogério Schmidt Feris, Danny Gutfreund, Yang Zhang, Antonio Torralba, Michael Picheny, James R. Glass:
Grounding Spoken Words in Unlabeled Video. CVPR Workshops 2019: 29-32 - [c39]Khoi-Nguyen C. Mac, Dhiraj Joshi, Raymond A. Yeh, Jinjun Xiong, Rogério Schmidt Feris, Minh N. Do:
Learning Motion in Feature Space: Locally-Consistent Deformable Convolution Networks for Fine-Grained Action Detection. ICCV 2019: 6281-6290 - [i3]Sicheng Zhao, Shangfei Wang, Mohammad Soleymani, Dhiraj Joshi, Qiang Ji:
Affective Computing for Large-Scale Heterogeneous Multimedia Data: A Survey. CoRR abs/1911.05609 (2019) - 2018
- [c38]Michele Merler, Dhiraj Joshi, Khoi-Nguyen C. Mac, Quoc-Bao Nguyen, Stephen Hammer, John Kent, Jinjun Xiong, Minh N. Do, John R. Smith, Rogério Schmidt Feris:
The Excitement of Sports: Automatic Highlights Using Audio/Visual Cues. CVPR Workshops 2018: 2520-2523 - [i2]Khoi-Nguyen C. Mac, Dhiraj Joshi, Raymond A. Yeh, Jinjun Xiong, Rogério Schmidt Feris, Minh N. Do:
Locally-Consistent Deformable Convolution Networks for Fine-Grained Action Detection. CoRR abs/1811.08815 (2018) - 2017
- [c37]Michele Merler, Dhiraj Joshi, Quoc-Bao Nguyen, Stephen Hammer, John Kent, John R. Smith, Rogério Schmidt Feris:
Automatic Curation of Golf Highlights Using Multimodal Excitement Features. CVPR Workshops 2017: 57-65 - [c36]Ryosuke Shigenaka, Yan-Ying Chen, Francine Chen, Dhiraj Joshi, Yukihiro Tsuboshita:
Image-based user profiling of frequent and regular venue categories. ICME 2017: 541-546 - [c35]Dhiraj Joshi, Michele Merler, Quoc-Bao Nguyen, Stephen Hammer, John Kent, John R. Smith, Rogério Schmidt Feris:
IBM High-Five: Highlights From Intelligent Video Engine. ACM Multimedia 2017: 1249-1250 - [c34]John R. Smith, Dhiraj Joshi, Benoit Huet, Winston H. Hsu, Jozef Cota:
Harnessing A.I. for Augmenting Creativity: Application to Movie Trailer Creation. ACM Multimedia 2017: 1799-1808 - [i1]Michele Merler, Dhiraj Joshi, Quoc-Bao Nguyen, Stephen Hammer, John Kent, John R. Smith, Rogério Schmidt Feris:
Automatic Curation of Golf Highlights using Multimodal Excitement Features. CoRR abs/1707.07075 (2017) - 2016
- [c33]Bor-Chun Chen, Yan-Ying Chen, Francine Chen, Dhiraj Joshi:
Business-Aware Visual Concept Discovery from Social Media for Multimodal Business Venue Recognition. AAAI 2016: 101-107 - [c32]Yan-Ting Chen, Francine Chen, Matthew Cooper, Dhiraj Joshi:
Using business-aware latent topics for image captioning in social media. ICME 2016: 1-6 - 2015
- [c31]Bokai Cao, Francine Chen, Dhiraj Joshi, Philip S. Yu:
Inferring crowd-sourced venues for tweets. IEEE BigData 2015: 639-648 - [c30]Dhiraj Joshi, Matthew Cooper, Francine Chen, Yan-Ying Chen:
Building User Profiles from Shared Photos. MMCommons@ACM Multimedia 2015: 37-42 - 2014
- [c29]Junjie Cai, Qiong Liu, Francine Chen, Dhiraj Joshi, Qi Tian:
Scalable Image Search with Multiple Index Tables. ICMR 2014: 407 - [c28]Francine Chen, Dhiraj Joshi, Yasuhide Miura, Tomoko Ohkuma:
Social Media-based Profiling of Business Locations. GeoMM 2014: 1-6 - [c27]Huizhong Chen, Matthew Cooper, Dhiraj Joshi, Bernd Girod:
Multi-modal Language Models for Lecture Video Retrieval. ACM Multimedia 2014: 1081-1084 - [c26]Dhiraj Joshi, Francine Chen, Lynn Wilcox:
Finding selfies of users in microblogged photos. SoMeRA@SIGIR 2014: 33-34 - 2013
- [j11]Xin Jin, Jiebo Luo, Jie Yu, Gang Wang, Dhiraj Joshi, Jiawei Han:
Reinforced Similarity Integration in Image-Rich Information Networks. IEEE Trans. Knowl. Data Eng. 25(2): 448-460 (2013) - 2012
- [j10]Dhiraj Joshi, Andrew C. Gallagher, Jie Yu, Jiebo Luo:
Inferring photographic location using geotagged web images. Multim. Tools Appl. 56(1): 131-153 (2012) - [c25]Hua Wang, Dhiraj Joshi, Jiebo Luo, Heng Huang, Minwoo Park:
Simultaneous Image Annotation and Geo-Tag Prediction via Correlation Guided Multi-task Learning. ISM 2012: 69-72 - [c24]Minwoo Park, Dhiraj Joshi, Alexander C. Loui:
Tag Cloud++ - Scalable Tag Clouds for Arbitrary Layouts. ISM 2012: 318-325 - 2011
- [j9]Jiebo Luo, Dhiraj Joshi, Jie Yu, Andrew C. Gallagher:
Geotagging in multimedia and computer vision - a survey. Multim. Tools Appl. 51(1): 187-211 (2011) - [j8]Dhiraj Joshi, Ritendra Datta, Elena A. Fedorovskaya, Quang-Tuan Luong, James Ze Wang, Jia Li, Jiebo Luo:
Aesthetics and Emotions in Images. IEEE Signal Process. Mag. 28(5): 94-115 (2011) - [c23]Charles Parker, Dhiraj Joshi, Phoury Lei, Jiebo Luo:
Finding geographically representative music via social media. MIRUM 2011: 27-32 - [c22]Vivek K. Singh, Jiebo Luo, Dhiraj Joshi, Phoury Lei, Madirakshi Das, Peter O. Stubler:
Reliving on demand: a total viewer experience. ACM Multimedia 2011: 333-342 - [c21]Vivek K. Singh, Jiebo Luo, Dhiraj Joshi, Madirakshi Das, Phoury Lei, Peter O. Stubler:
Dynamic media show drivable by semantics. ACM Multimedia 2011: 815-816 - [p1]Dhiraj Joshi, Jiebo Luo, Jie Yu, Phoury Lei, Andrew C. Gallagher:
Using Geotags to Derive Rich Tag-Clouds for Image Annotation. Social Media Modeling and Computing 2011: 239-256 - 2010
- [c20]Dhiraj Joshi, Andrew C. Gallagher, Jie Yu, Jiebo Luo:
Exploring user image tags for geo-location inference. ICASSP 2010: 5598-5601 - [c19]Dhiraj Joshi, Mark D. Wood, Jiebo Luo:
Suggesting Songs for Media Creation Using Semantics. ICPR 2010: 3208-3211 - [c18]Dhiraj Joshi:
Semantic understanding of geotagged pictures. Multimedia Information Retrieval 2010: 5-6 - [c17]Xin Jin, Jiebo Luo, Jie Yu, Gang Wang, Dhiraj Joshi, Jiawei Han:
iRIN: image retrieval in image-rich information networks. WWW 2010: 1261-1264
2000 – 2009
- 2009
- [j7]Matthieu Cord, Padraig Cunningham, Dhiraj Joshi:
Machine Learning Techniques for Multimedia: Case Studies on Organization and Retrieval. J. Electronic Imaging 18(3): 039901 (2009) - [c16]Andrew C. Gallagher, Dhiraj Joshi, Jie Yu, Jiebo Luo:
Geo-location inference from image content and user tags. CVPR Workshops 2009: 55-62 - [c15]Jie Yu, Dhiraj Joshi, Jiebo Luo:
Connecting people in photo-sharing sites by photo content and user annotations. ICME 2009: 1464-1467 - 2008
- [j6]Ritendra Datta, Dhiraj Joshi, Jia Li, James Ze Wang:
Image retrieval: Ideas, influences, and trends of the new age. ACM Comput. Surv. 40(2): 5:1-5:60 (2008) - [j5]Henk M. Blanken, Arjen P. de Vries, Henk Ernst Blok, Ling Feng, Dhiraj Joshi:
Multimedia Retrieval. J. Electronic Imaging 17(3): 039901 (2008) - [c14]Dhiraj Joshi, Jiebo Luo:
Inferring generic activities and events from image content and bags of geo-tags. CIVR 2008: 37-46 - [c13]Jiebo Luo, Wei Hao, Dale McIntyre, Dhiraj Joshi, Jie Yu:
Recognizing picture-taking environment from satellite images: A feasibility study. ICPR 2008: 1-4 - [c12]Jiebo Luo, Jie Yu, Dhiraj Joshi, Wei Hao:
Event recognition: viewing the world with a third eye. ACM Multimedia 2008: 1071-1080 - 2007
- [c11]Dhiraj Joshi, Milind R. Naphade, Apostol Natsev:
Semantics reinforcement and fusion learning for multimedia streams. CIVR 2007: 309-316 - [c10]Dhiraj Joshi, Milind R. Naphade, Apostol Natsev:
A Greedy Performance Driven Algorithm for Decision Fusion Learning. ICIP (6) 2007: 25-28 - [c9]Ritendra Datta, Dhiraj Joshi, Jia Li, James Ze Wang:
Tagging over time: real-world image annotation by lightweight meta-learning. ACM Multimedia 2007: 393-402 - 2006
- [j4]Dhiraj Joshi, Jia Li, James Ze Wang:
A Computationally Efficient Approach to the Estimation of Two- and Three-Dimensional Hidden Markov Models. IEEE Trans. Image Process. 15(7): 1871-1886 (2006) - [j3]Dhiraj Joshi, James Ze Wang, Jia Li:
The Story Picturing Engine - a system for automatic text illustration. ACM Trans. Multim. Comput. Commun. Appl. 2(1): 68-89 (2006) - [c8]Ritendra Datta, Dhiraj Joshi, Jia Li, James Ze Wang:
Studying Aesthetics in Photographic Images Using a Computational Approach. ECCV (3) 2006: 288-301 - [c7]Dhiraj Joshi, Daniel Gatica-Perez:
Discovering groups of people in Google news. HCM@MM 2006: 55-64 - [c6]Murray Campbell, Alexander Haubold, Shahram Ebadollahi, Dhiraj Joshi, Milind R. Naphade, Apostol Natsev, Joachim Seidl, John R. Smith, Katya Scheinberg, Jelena Tesic, Lexing Xie:
IBM Research TRECVID-2006 Video Retrieval System. TRECVID 2006 - [c5]Dhiraj Joshi, Ritendra Datta, Ziming Zhuang, W. P. Weiss, Marc Friedenberg, Jia Li, James Ze Wang:
PARAgrab: A Comprehensive Architecture for Web Image Management and Multimodal Querying. VLDB 2006: 1163-1166 - 2005
- [j2]Dhiraj Joshi, James Ze Wang:
Multimedia Systems and Content-Based Image Retrieval. By Sagarmay Deb, Idea Group Publishing, 2004, $79.95 ISBN 1-59140-156-9. Inf. Process. Manag. 41(2): 406-407 (2005) - [c4]Dhiraj Joshi, Jia Li, James Ze Wang:
Parameter estimation of multi-dimensional hidden Markov models - a scalable approach. ICIP (3) 2005: 149-152 - 2004
- [c3]Jia Li, Dhiraj Joshi, James Ze Wang:
Stochastic modeling of volume images with a 3-d hidden markov model. ICIP 2004: 2359-2362 - [c2]Dhiraj Joshi, James Ze Wang, Jia Li:
The story picturing engine: finding elite images to illustrate a story using mutual reinforcement. Multimedia Information Retrieval 2004: 119-126 - 2002
- [j1]Kalyanmoy Deb, Ashish Anand, Dhiraj Joshi:
A Computationally Efficient Evolutionary Algorithm for Real-Parameter Optimization. Evol. Comput. 10(4): 345-369 (2002) - [c1]Kalyanrnoy Deb, Dhiraj Joshi, Ashish Anand:
Real-coded evolutionary algorithms with parent-centric recombination. IEEE Congress on Evolutionary Computation 2002: 61-66
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-21 21:29 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint