Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleOctober 2024
3D Question Answering with Scene Graph Reasoning
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 1370–1378https://doi.org/10.1145/3664647.36815173DQA has gained considerable attention due to its enhanced spatial understanding capabilities compared to image-based VQA. However, existing 3DQA methods have explicitly focused on integrating text and color-coded point cloud features, thereby ...
- research-articleSeptember 2024
KF-VTON: Keypoints-Driven Flow Based Virtual Try-On Network
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 20, Issue 9Article No.: 293, Pages 1–23https://doi.org/10.1145/3673903Image-based virtual try-on aims to fit a target garment to a reference person. Most existing methods are limited to solving the Garment-To-Person (G2P) try-on task that transfers a garment from a clean product image to the reference person and do not ...
- research-articleJuly 2024
Multi2Human: Controllable human image generation with multimodal controls
AbstractGenerating high-quality and diverse human images presents a substantial difficulty within the field of computer vision, especially in developing controllable generative models that can utilize input from various modalities. Such models could ...
Highlights- A novel framework for controllable human image generation with multimodal controls is proposed. The framework’s flexibility lies in its ability to incorporate different types of control signals, which can be modified to suit the ...
- research-articleJune 2024
Recurrent Appearance Flow for Occlusion-Free Virtual Try-On
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 20, Issue 8Article No.: 239, Pages 1–17https://doi.org/10.1145/3659581Image-based virtual try-on aims at transferring a target in-shop garment onto a reference person, and has garnered significant attention from the research communities recently. However, previous methods have faced severe challenges in handling occlusion ...
- research-articleJuly 2024
H 2GCN: A hybrid hypergraph convolution network for skeleton-based action recognition
Journal of King Saud University - Computer and Information Sciences (JKSUCIS), Volume 36, Issue 5https://doi.org/10.1016/j.jksuci.2024.102072AbstractRecent GCN-based works have achieved remarkable results for skeleton-based human action recognition. Nevertheless, while existing approaches extensively investigate pairwise joint relationships, only a limited number of models explore the ...
-
- research-articleApril 2024
PointCMC: cross-modal multi-scale correspondences learning for point cloud understanding
AbstractExisting cross-modal frameworks have achieved impressive performance in point cloud object representations learning, where a 2D image encoder is employed to transfer knowledge to a 3D point cloud encoder. However, the local structures between ...
- research-articleFebruary 2024
PU-GAT: Point cloud upsampling with graph attention network
AbstractPoint cloud upsampling has been extensively studied, however, the existing approaches suffer from the losing of structural information due to neglect of spatial dependencies between points. In this work, we propose PU-GAT, a novel 3D point cloud ...
Graphical abstractDisplay Omitted
- research-articleOctober 2023
Structure-Aware Subspace Clustering
IEEE Transactions on Knowledge and Data Engineering (IEEECS_TKDE), Volume 35, Issue 10Pages 10569–10582https://doi.org/10.1109/TKDE.2023.3249765Subspace clustering has attracted much attention because of its ability to group unlabeled high-dimensional data into multiple subspaces. Existing graph-based subspace clustering methods focus on either the sparsity of data affinity or the low rank of ...
- ArticleDecember 2023
PCCNet: A Few-Shot Patch-Wise Contrastive Colorization Network
AbstractFew-shot colorization aims to learn a model to colorize images with little training data. Yet, existing models often fail to keep color consistency due to ignored patch correlations of the images. In this paper, we propose PCCNet, a novel Patch-...
- research-articleJuly 2023
Position‐aware spatio‐temporal graph convolutional networks for skeleton‐based action recognition
AbstractGraph Convolutional Networks (GCNs) have been widely used in skeleton‐based action recognition. Though significant performance has been achieved, it is still challenging to effectively model the complex dynamics of skeleton sequences. A novel ...
A novel position‐aware graph convolutional networks for skeleton‐based action recognition is proposed. Our model investigated the position encoding modules and the subgraph masking operator to address some defects holding in existing models, such as the ...
- research-articleMarch 2023
StyleBERT: Text-audio sentiment analysis with Bi-directional Style Enhancement
AbstractRecent multimodal sentiment analysis works focus on establishing sophisticated fusion strategies for better performance. However, a major limitation of these works is that they ignore effective modality representation learning before ...
Highlights- Before multimodal fusion, we must perform modality representation learning.
- ...
- research-articleDecember 2022
PVT: Point‐voxel transformer for point cloud learning
International Journal of Intelligent Systems (IJIS), Volume 37, Issue 12Pages 11985–12008https://doi.org/10.1002/int.23073AbstractThe recently developed pure transformer architectures have attained promising accuracy on point cloud learning benchmarks compared to convolutional neural networks. However, existing point cloud Transformers are computationally expensive because ...
- research-articleJanuary 2022
Graph-PBN: Graph-based parallel branch network for efficient point cloud learning
Graphical abstractDisplay Omitted
AbstractIn recent years, approaches based on graph convolutional networks (GCNs) have achieved state-of-the-art performance in point cloud learning. The typical pipeline of GCNs is modeled as a two-stage learning process: graph construction ...
- ArticleOctober 2020
VH3D-LSFM: Video-Based Human 3D Pose Estimation with Long-Term and Short-Term Pose Fusion Mechanism
AbstractFollowing the success of 2D human pose estimation from a single image, a lot of work focus on video-based 3D human pose estimation by exploiting temporal information. In this scenario, several recent works have achieved significant advances via ...
- ArticleOctober 2020
Multi-human Parsing with Pose and Boundary Guidance
AbstractIn this work, we present a novel end-to-end semantic segmentation framework for multi-human parsing, which integrates both the high- and low-level features. Our framework includes three modules: segmentation module, pose estimation module, and ...
- research-articleAugust 2020
Co-skeletons: Consistent curve skeletons for shape families
Computers and Graphics (CGRS), Volume 90, Issue CPages 62–72https://doi.org/10.1016/j.cag.2020.05.006Highlights- We present co-skeletons, a new method that computes consistent curve skeletons for 3D shapes from a given family.
Display Omitted
AbstractWe present co-skeletons, a new method that computes consistent curve skeletons for 3D shapes from a given family. We compute co-skeletons in terms of sampling density and semantic relevance, while preserving the desired characteristics ...
- research-articleJanuary 2018
Integrated Modeling, Simulation, and Visualization for Nanomaterials
Computer aided modeling and simulation of nanomaterials can describe the correlation between the material’s microstructure and its macroscopic properties quantitatively. In this paper, we propose an integrated modeling, simulation, and visualization ...
- research-articleNovember 2014
Interactive optimization of near-isometric shape correspondence
VRCAI '14: Proceedings of the 13th ACM SIGGRAPH International Conference on Virtual-Reality Continuum and its Applications in IndustryPages 43–46https://doi.org/10.1145/2670473.2670484In this paper, we present an interactive approach for near-isometric shape correspondence. Our key motivation is that the intention of the users in the correspondence problem is valuable, which helps to not only reduce search space for finding the ...
- articleFebruary 2014
CAD/Graphics 2013: Interactive shape co-segmentation via label propagation
In this paper, we present an interactive approach for shape co-segmentation via label propagation. Our intuitive approach is able to produce error-free results and is very effective at handling out-of-sample data. Specifically, we start by over-...
- articleOctober 2013
SMI 2013: Unsupervised co-segmentation of 3D shapes via affinity aggregation spectral clustering
Computers and Graphics (CGRS), Volume 37, Issue 6Pages 628–637https://doi.org/10.1016/j.cag.2013.05.015Many shape co-segmentation methods employ multiple descriptors to measure the similarities between parts of a set of shapes in a descriptor space. Different shape descriptors characterize a shape in different aspects. Simply concatenating them into a ...