Author: Xie, Hong : Search

research-article

A Meta-Learning Approach to Mitigating the Estimation Bias of Q-Learning

ACM Transactions on Knowledge Discovery from Data (TKDD), Volume 18, Issue 9Article No.: 226, Pages 1–23https://doi.org/10.1145/3688849

It is a longstanding problem that Q-learning suffers from the overestimation bias. This issue originates from the fact that Q-learning uses the expectation of maximum Q-value to approximate the maximum expected Q-value. A number of algorithms, such as ...

research-article

Asynchronous SGD with stale gradient dynamic adjustment for deep learning training

Information Sciences: an International Journal (ISCI), Volume 681, Issue Chttps://doi.org/10.1016/j.ins.2024.121220

Abstract

Asynchronous stochastic gradient descent (ASGD) is a computationally efficient algorithm, which speeds up deep learning training and plays an important role in distributed deep learning. However, ASGD suffers from the stale gradient problem, i.e.,...

research-article

Free

JUST ACCEPTED

Online Incentive Protocol Design for Reposting Service in Online Social Networks

ACM Transactions on the Web (TWEB), Just Accepted https://doi.org/10.1145/3696473

Reposting plays an essential role in boosting visibility on online social networks (OSNs). In this paper, we study the problem of designing “reposting service” in an OSN to incentivize “transactions” between requesters (users who seek to enhance ...

research-article

Adaptive moving average Q-learning

Knowledge and Information Systems (KAIS), Volume 66, Issue 12Pages 7389–7417https://doi.org/10.1007/s10115-024-02190-8

Abstract

A variety of algorithms have been proposed to address the long-standing overestimation bias problem of Q-learning. Reducing this overestimation bias may lead to an underestimation bias, such as double Q-learning. However, it is still unclear how ...

Article

Minimizing Survey Questions for PTSD Prediction Following Acute Trauma

Artificial Intelligence in MedicinePages 90–100https://doi.org/10.1007/978-3-031-66538-7_11

Abstract

Traumatic experiences have the potential to give rise to posttraumatic stress disorder (PTSD), a debilitating psychiatric condition associated with impairments in both social and occupational functioning. There has been great interest in utilizing ...

Article

False Negative Sample Aware Negative Sampling for Recommendation

Advances in Knowledge Discovery and Data MiningPages 195–206https://doi.org/10.1007/978-981-97-2262-4_16

Abstract

Negative sampling plays a key role in implicit feedback collaborative filtering. It draws high-quality negative samples from a large number of uninteracted samples. Existing methods primarily focus on hard negative samples, while overlooking the ...

research-article

Robust and efficient algorithms for conversational contextual bandit

Information Sciences: an International Journal (ISCI), Volume 657, Issue Chttps://doi.org/10.1016/j.ins.2023.119993

Abstract

Conversational contextual bandit is one of the notable variants of contextual bandit and it is shown to have superior performance in recommendation applications. The core idea of conversational contextual bandits utilizing is conversational ...

research-article

Q-learning with heterogeneous update strategy

Information Sciences: an International Journal (ISCI), Volume 656, Issue Chttps://doi.org/10.1016/j.ins.2023.119902

Abstract

A variety of algorithms has been proposed to mitigate the overestimation bias of Q-learning. These algorithms reduce the estimation of maximum Q-value, i.e., homogeneous update. As a result, some of these algorithms such as Double Q-learning ...

research-article

Uncertainty-aware instance reweighting for off-policy learning

NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing SystemsArticle No.: 3224, Pages 73691–73718

Off-policy learning, referring to the procedure of policy optimization with access only to logged feedback data, has shown importance in various real-world applications, such as search engines and recommender systems. While the ground-truth logging ...

research-article

Open Access

Relieving Popularity Bias in Interactive Recommendation: A Diversity-Novelty-Aware Reinforcement Learning Approach

ACM Transactions on Information Systems (TOIS), Volume 42, Issue 2Article No.: 52, Pages 1–30https://doi.org/10.1145/3618107

While personalization increases the utility of item recommendation, it also suffers from the issue of popularity bias. However, previous methods emphasize adopting supervised learning models to relieve popularity bias in the static recommendation, ...

Article

TSTD:A Cross-modal Two Stages Network with New Trans-decoder for Point Cloud Semantic Segmentation

Pattern Recognition and Computer VisionPages 130–141https://doi.org/10.1007/978-981-99-8543-2_11

Abstract

In recent years, exploring integrated heterogeneous features architecture has become one of the hot spots in 3D point cloud understanding. However, the efficacy of end-to-end training in enhancing the precision of multi-view fusion for point cloud ...

Article

A Voxel-Based Multiview Point Cloud Refinement Method via Factor Graph Optimization

Pattern Recognition and Computer VisionPages 234–245https://doi.org/10.1007/978-981-99-8432-9_19

Abstract

lidar enables fast reconstruction of the real world using high-precision point cloud maps. It usually requires the pose information (also called trajectory) of point clouds obtained by lidar at different times so that all scans are unified in the ...

research-article

Open Access

Contrastive Learning based Item Representation with Asymmetric Augmentation for Sequential Recommendation

ADMIT '23: Proceedings of the 2023 2nd International Conference on Algorithms, Data Mining, and Information TechnologyPages 68–73https://doi.org/10.1145/3625403.3625418

Contrastive learning has been widely applied in sequential recommendation to improve the recommendation performance. Existing contrastive learning methods focus on adjusting the views number of positive and negative samples to enhance the item ...

research-article

Optimizing recommendations under abandonment risks: Models and algorithms

Performance Evaluation (PEVA), Volume 161, Issue Chttps://doi.org/10.1016/j.peva.2023.102351

Abstract

User abandonment behaviors are quite common in recommendation applications such as online shopping recommendation and news recommendation. To maximize its total “reward” under the risk of user abandonment, the online platform needs to carefully ...

Highlights

Model the user abandonment behavior in recommendation systems via Markov decision processes.
Transfer other similar users’ information to optimize future decisions.
An algorithmic framework with two components and theoretical ...

Article

Estimating Dynamic Posttraumatic Stress Symptom Trajectories with Functional Data Analysis

Brain InformaticsPages 348–356https://doi.org/10.1007/978-3-031-43075-6_30

Abstract

Posttraumatic stress disorder (PTSD) is a mental health condition that may develop following exposure to trauma, with diverse and complex longitudinal trajectories of symptoms during the days to months after a traumatic event. To supplement ...

research-article

Probabilistic Modeling of Assimilate-Contrast Effects in Online Rating Systems

IEEE Transactions on Knowledge and Data Engineering (IEEECS_TKDE), Volume 36, Issue 2Pages 795–808https://doi.org/10.1109/TKDE.2023.3292352

Online rating system serves as an indispensable building block for many web applications. Previous studies showed that due to assimilate-contrast effects, historical ratings could significantly distort users’ ratings, leading to low accuracy of ...

research-article

A New Outlier Removal Strategy Based on Reliability of Correspondence Graph for Fast Point Cloud Registration

IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 45, Issue 7Pages 7986–8002https://doi.org/10.1109/TPAMI.2022.3226498

Registration is a basic yet crucial task in point cloud processing. In correspondence-based point cloud registration, matching correspondences by point feature techniques may lead to an extremely high outlier (false correspondence) ratio. Current outlier ...

research-article

Efficient algorithms for multi-armed bandits with additional feedbacks: Modeling and algorithms

Information Sciences: an International Journal (ISCI), Volume 633, Issue CPages 453–468https://doi.org/10.1016/j.ins.2023.03.060

Abstract

Multi-armed bandits (MAB) are widely applied to optimize networking applications such as crowdsensing and mobile edge computing. Additional feedbacks (or partial feedbacks) on some arms are usually possible to be collected in many networking ...

Article

A Thompson Sampling Approach to Unifying Causal Inference and Bandit Learning

Advances in Knowledge Discovery and Data MiningPages 255–266https://doi.org/10.1007/978-3-031-33377-4_20

Abstract

Offline logged data is quite common in many web applications such as recommendation, Internet advertising, etc., which offers great potentials to improve online decision making. It is a non-trivial task to utilize offline logged data for online ...

Article

A Multi-player MAB Approach for Distributed Selection Problems

Advances in Knowledge Discovery and Data MiningPages 243–254https://doi.org/10.1007/978-3-031-33377-4_19

Abstract

Motivated by distributed selection problems, we formulate a new variant of multi-player multi-armed bandit (MAB) model, which captures stochastic arrival of requests to each arm and the policy of allocating requests to players. The challenge is ...

Applied Filters

People

Names

Institutions

Authors

Editors

Reviewers

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder