Author: Jin, Tao : Search

research-article

Open Access

Deadline and Period Assignment for Guaranteeing Timely Response of the Cyber-Physical System

ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 30, Issue 1Article No.: 1, Pages 1–26https://doi.org/10.1145/3689048

Cyber-physical systems (CPSs) need to respond to each change of each monitored object in time. The entire response process can be divided into two stages: the update stage and the control stage. Tasks in CPSs can thus be divided into two kinds: update ...

research-article

Calibrating Prompt from History for Continual Vision-Language Retrieval and Grounding

MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 4302–4311https://doi.org/10.1145/3664647.3681387

In the field of machine learning, continual learning is a crucial concept that allows models to adapt to non-stationary data distributions. However, most of the existing works focus on uni-modal settings and ignore the multi-modal data. In this paper, to ...

research-article

Boosting Speech Recognition Robustness to Modality-Distortion with Contrast-Augmented Prompts

MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 3838–3847https://doi.org/10.1145/3664647.3681347

In the burgeoning field of Audio-Visual Speech Recognition (AVSR), extant research has predominantly concentrated on the training paradigms tailored for high-quality resources. However, owing to the challenges inherent in real-world data collection, ...

research-article

Low-rank Prompt Interaction for Continual Vision-Language Retrieval

MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 8257–8266https://doi.org/10.1145/3664647.3681264

Research on continual learning in multi-modal tasks has been receiving increasing attention. However, most existing work overlooks the explicit cross-modal and cross-task interactions. In this paper, we innovatively propose the Low-rank Prompt I...

research-article

PIRN: Phase Invariant Reconstruction Network for infrared image super-resolution

Neurocomputing (NEUROC), Volume 599, Issue Chttps://doi.org/10.1016/j.neucom.2024.128221

Abstract

Single image super-resolution (SR) reconstruction plays a crucial role in various fields, including surveillance and remote sensing. However, the majority of available SR reconstruction methods are designed primarily for visible images, making it ...

research-article

A method for image–text matching based on semantic filtering and adaptive adjustment

Journal on Image and Video Processing (JIVP), Volume 2024, Issue 1https://doi.org/10.1186/s13640-024-00639-y

Abstract

As image–text matching (a critical task in the field of computer vision) links cross-modal data, it has captured extensive attention. Most of the existing methods intended for matching images and texts explore the local similarity levels between ...

research-article

EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration

KDD '24: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data MiningPages 3245–3254https://doi.org/10.1145/3637528.3671775

Generative retrieval has recently emerged as a promising approach to sequential recommendation, framing candidate item retrieval as an autoregressive sequence generation problem. However, existing generative methods typically focus solely on either ...

research-article

Multi-Granularity Relational Attention Network for Audio-Visual Question Answering

IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 34, Issue 8Pages 7080–7094https://doi.org/10.1109/TCSVT.2023.3264524

Recent methods for video question answering (VideoQA), aiming to generate answers based on given questions and video content, have made significant progress in cross-modal interaction. From the perspective of video understating, these existing frameworks ...

Article

Development of a Bistable Multi-joint Modular Gripper with Enhanced Adaptability and Speed

Intelligent Robotics and ApplicationsPages 201–213https://doi.org/10.1007/978-981-96-0798-3_16

Abstract

Handling dynamic objects is a significant challenge in robotics, necessitating the development of grippers capable of safe and reliable manipulation without compromising speed. Traditional rigid grippers, when used in dynamic environments, often ...

research-article

Borda regret minimization for generalized linear dueling bandits

ICML'24: Proceedings of the 41st International Conference on Machine LearningArticle No.: 2195, Pages 53571–53596

Dueling bandits are widely used to model preferential feedback prevalent in many applications such as recommendation systems and ranking. In this paper, we study the Borda regret minimization problem for dueling bandits, which aims to identify the item ...

research-article

FreeBind: free lunch in unified multimodal space via knowledge fusion

ICML'24: Proceedings of the 41st International Conference on Machine LearningArticle No.: 2139, Pages 52233–52246

Unified multi-model representation spaces are the foundation of multimodal understanding and generation. However, the billions of model parameters and catastrophic forgetting problems make it challenging to further enhance pre-trained unified spaces. In ...

research-article

Non-confusing generation of customized concepts in diffusion models

ICML'24: Proceedings of the 41st International Conference on Machine LearningArticle No.: 1206, Pages 29935–29948

We tackle the common challenge of inter-concept visual confusion in compositional concept generation using text-guided diffusion models (TGDMs). It becomes even more pronounced in the generation of customized concepts, due to the scarcity of user-...

research-article

A three-field based finite element analysis for a class of magnetoelastic materials

Tao Jin

Finite Elements in Analysis and Design (FEAD), Volume 233, Issue Chttps://doi.org/10.1016/j.finel.2024.104126

Abstract

A simple yet effective material model was proposed by Zhao et al. (2019) and demonstrated to be capable of modeling the shape transformations of various planar and three-dimensional material samples programmed with the so-called “hard-magnetic ...

research-article

Reputation incentives with public supervision promote cooperation in evolutionary games

Applied Mathematics and Computation (APMC), Volume 466, Issue Chttps://doi.org/10.1016/j.amc.2023.128445

Abstract

Public supervision, as a source of social behavioral norms and moral guidelines, exerts important guidance and influence on individuals. To maintain public order, in this study, we propose a reputation incentives mechanism with public supervision,...

Highlights

We propose a reputation incentives mechanism with public supervision.
The dynamic process takes into account individual differences and incorporates diverse evaluation standards.
We study on the impact of the evaluation intensity ...

research-article

Research on remote control system of tracking trolley based on ESP8266

CCEAI '24: Proceedings of the 2024 8th International Conference on Control Engineering and Artificial IntelligencePages 171–177https://doi.org/10.1145/3640824.3640851

Traditional line-tracking cars usually use wired transmission to perform real-time AD (Analog to Digital) signal sampling and control, which has many disadvantages, such as cumbersome steps, multiple constraints, poor real-time performance, and low time ...

research-article

Assembly Action Recognition based on Dual Stream Fusion of Skeleton and Video Data

ICIGP '24: Proceedings of the 2024 7th International Conference on Image and Graphics ProcessingPages 159–165https://doi.org/10.1145/3647649.3647676

In order to solve the problem of low accuracy of workers' assembly action recognition relying only on human skeletal data in complex background change environments, an assembly action recognition network based on dual stream fusion of skeleton and video ...

research-article

Trust-aware conditional adversarial domain adaptation with feature norm alignment

Neural Networks (NENE), Volume 168, Issue CPages 518–530https://doi.org/10.1016/j.neunet.2023.10.002

Abstract

Adversarial learning has proven to be an effective method for capturing transferable features for unsupervised domain adaptation. However, some existing conditional adversarial domain adaptation methods assign equal importance to different ...

Highlights

The feature norms of each domain usually follow a complex distribution.
Data transferability is precisely quantified by Gaussian-uniform mixture model.
Mixed information can better guide features away from the decision boundary.

...

research-article

Rethinking Missing Modality Learning from a Decoding Perspective

MM '23: Proceedings of the 31st ACM International Conference on MultimediaPages 4431–4439https://doi.org/10.1145/3581783.3612291

Conventional pipeline of multimodal learning consists of three stages, including encoding, fusion, and decoding. Most existing methods under missing modality condition focus on the first stage and aim to learn the modality invariant representation or ...

demonstration

Public Access

RadarHD: Demonstrating Lidar-like Point Clouds from mmWave Radar

ACM MobiCom '23: Proceedings of the 29th Annual International Conference on Mobile Computing and NetworkingArticle No.: 106, Pages 1–3https://doi.org/10.1145/3570361.3614077

Millimeter wave radars can perceive through occlusions like dust, fog, smoke and clothes. But compared to cameras and lidars, their perception quality is orders of magnitude poorer. RadarHD [3] tackles this problem of poor quality by creating a machine ...

Article

NegT5: A Cross-Task Text-to-Text Framework for Negation in Question Answering

Intelligent Information and Database SystemsPages 272–285https://doi.org/10.1007/978-981-99-5837-5_23

Abstract

Negation is a fundamental grammatical construct that plays a crucial role in understanding QA tasks. It has been revealed that models trained with SQuAD1 still produce original responses when presented with negated sentences. To mitigate this ...

Applied Filters

People

Names

Institutions

Authors

Advisors

Reviewers

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder

Upcoming Conferences