Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleOctober 2024
All rivers run into the sea: Unified Modality Brain-Inspired Emotional Central Mechanism
- Xinji Mai,
- Junxiong Lin,
- Haoran Wang,
- Zeng Tao,
- Yan Wang,
- Shaoqi Yan,
- Xuan Tong,
- Jiawen Yu,
- Boyang Wang,
- Ziheng Zhou,
- Qing Zhao,
- Shuyong Gao,
- Wenqiang Zhang
MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 632–641https://doi.org/10.1145/3664647.3681228In the field of affective computing, fully leveraging information from a variety of sensory modalities is essential for the comprehensive understanding and processing of human emotions. Inspired by the process through which the human brain handles ...
- ArticleSeptember 2024
Contrastive Learning Enhanced Diffusion Model for Improving Tropical Cyclone Intensity Estimation with Test-Time Adaptation
Machine Learning and Knowledge Discovery in Databases. Applied Data Science TrackPages 418–434https://doi.org/10.1007/978-3-031-70378-2_26AbstractTropical cyclone (TC) intensity estimation from satellite images is the very first and critical step of making TC forecasts, whose SOTA performance is achieved by methods built upon CNN based regression models. Unlike discriminative models trained ...
- ArticleMay 2024
Ranking Enhanced Supervised Contrastive Learning for Regression
Advances in Knowledge Discovery and Data MiningPages 15–27https://doi.org/10.1007/978-981-97-2253-2_2AbstractSupervised contrastive learning has shown promising results in image classification tasks where the representations are pulled together if they share same labels or otherwise pushed apart. Such dispersion process in the representation space ...
- review-articleJanuary 2023
GNSS antispoofing method using the intersection angle between two directions of arrival (IA-DOA) for multiantenna receivers
AbstractGiven the increasing number of spoofing attacks, keeping global navigation satellite system transmissions secure has recently become a focus. Many approaches have been proposed to defend against spoofing. Typical antispoofing methods against a ...
-
- research-articleMay 2021
AlphaBlock: An Evaluation Framework for Blockchain Consensus Algorithms
SBC '21: Proceedings of the Ninth International Workshop on Security in Blockchain and Cloud ComputingPages 17–22https://doi.org/10.1145/3457977.3460297Consensus algorithm is the core of blockchain and it plays a crucial role in the performance of the blockchain. In general, there are two types of blockchain consensus algorithms: the Bitcoin-like Nakamoto consensus (NC) algorithms and the Byzantine ...
- research-articleFebruary 2020
No-reference image quality assessment based on neighborhood co-occurrence matrix
AbstractNo-reference image quality assessment (NR-IQA) aims to develop models that can predict the quality of distorted image automatically and accurately in the absent of reference image. Previous NR-IQA methods based on natural scene ...
Highlights- The significant of spatial correlation of pixels for quality evaluation is analyzed.
- research-articleDecember 2019
Characterizing Subtle Facial Movements via Riemannian Manifold
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 15, Issue 3sArticle No.: 94, Pages 1–24https://doi.org/10.1145/3342227Characterizing subtle facial movements from videos is one of the most intensive topics in computer vision research. It is, however, challenging, since (1) the intensity of subtle facial muscle movement is usually low, (2) the duration may be transient, ...
- ArticleNovember 2019
No-Reference Image Quality Assessment via Multi-order Perception Similarity
AbstractNo-reference image quality assessment (NR-IQA) aims to develop models that can predict the quality of distorted image automatically and accurately without the reference. Lack of reference makes NR-IQA based on feature learning difficult to avoid ...
- research-articleAugust 2018
Background Subtraction Using Spatio-Temporal Group Sparsity Recovery
IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 28, Issue 8Pages 1737–1751https://doi.org/10.1109/TCSVT.2017.2697972Background subtraction is a key step in a wide spectrum of video applications, such as object tracking and human behavior analysis. Compressive sensing-based methods, which make little specific assumptions about the background, have recently attracted ...
- research-articleApril 2018
Blind Image Quality Assessment Based on Visuo-Spatial Series Statistics
2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)Pages 3161–3165https://doi.org/10.1109/ICASSP.2018.8462303Existing blind image quality assessment (BIQA) methods based on statistics attach limited attention to the relative position of pixels. Features in these BIQA methods are too flimsy to characterize quite a few distortions with strong locality or ...
- research-articleMarch 2017
Image denoising via group sparsity residual constraint
- Zhiyuan Zha,
- Xin Liu,
- Ziheng Zhou,
- Xiaohua Huang,
- Jingang Shi,
- Zhenhong Shang,
- Lan Tang,
- Yechao Bai,
- Qiong Wang,
- Xinggan Zhang
2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)Pages 1787–1791https://doi.org/10.1109/ICASSP.2017.7952464Group sparsity has shown great potential in various low-level vision tasks (e.g, image denoising, deblurring and inpainting). In this paper, we propose a new prior model for image denoising via group sparsity residual constraint (GSRC). To enhance the ...
- research-articleJanuary 2016
Depth estimation for image dehazing of surveillance on education
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology (JIFS), Volume 31, Issue 5Pages 2629–2636https://doi.org/10.3233/JIFS-169103Foggy weather brings lots of inconvenience for outdoor safety surveillance in the densely populated school education area. Research on image and video dehazing is able to solve this problem. Most existing methods recover the haze-free scenes relying on ...
- research-articleDecember 2014
3D Visual Speech Animation from Image Sequences
ICVGIP '14: Proceedings of the 2014 Indian Conference on Computer Vision Graphics and Image ProcessingArticle No.: 47, Pages 1–7https://doi.org/10.1145/2683483.2683530In this paper we describe an early version of our system which synthesizes 3D visual speech including tongue and teeth from frontal facial image sequences. This system is developed for 3D Visual Speech Animation (VSA) using images generated by an ...
- ArticleAugust 2014
Facial 3D Shape Estimation from Images for Visual Speech Animation
ICPR '14: Proceedings of the 2014 22nd International Conference on Pattern RecognitionPages 40–45https://doi.org/10.1109/ICPR.2014.17In this paper we describe the first version of our system for estimating 3D shape sequences from images of the frontal face. This approach is developed with 3D Visual Speech Animation (VSA) as the target application. In particular, the focus is on the ...
- articleJanuary 2014
A Compact Representation of Visual Speech Data Using Latent Variables
IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 36, Issue 1Pages 181–187https://doi.org/10.1109/TPAMI.2013.173The problem of visual speech recognition involves the decoding of the video dynamics of a talking mouth in a high-dimensional visual space. In this paper, we propose a generative latent variable model to provide a compact representation of visual speech ...
- research-articleOctober 2013
Video Texture Synthesis With Multi-Frame LBP-TOP and Diffeomorphic Growth Model
IEEE Transactions on Image Processing (TIP), Volume 22, Issue 10Pages 3879–3891https://doi.org/10.1109/TIP.2013.2263148Video texture synthesis is the process of providing a continuous and infinitely varying stream of frames, which plays an important role in computer vision and graphics. However, it still remains a challenging problem to generate high-quality synthesis ...
- research-articleOctober 2012
An Image-Based Visual Speech Animation System
IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 22, Issue 10Pages 1420–1432https://doi.org/10.1109/TCSVT.2012.2199399An image-based visual speech animation system is presented in this paper. A video model is proposed to preserve the video dynamics of a talking face. The model represents a video sequence by a low-dimensional continuous curve embedded in a path graph ...
- ArticleJune 2011
Towards a practical lipreading system
CVPR '11: Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern RecognitionPages 137–144https://doi.org/10.1109/CVPR.2011.5995345A practical lipreading system can be considered either as subject dependent (SD) or subject-independent (SI). An SD system is user-specific, i.e., customized for some particular user while an SI system has to cope with a large number of users. These two ...
- research-articleDecember 2010
Synthesizing a talking mouth
ICVGIP '10: Proceedings of the Seventh Indian Conference on Computer Vision, Graphics and Image ProcessingPages 211–218https://doi.org/10.1145/1924559.1924588This paper presents a visually realistic animation system for synthesizing a talking mouth. Video synthesis is achieved by first learning generative models from the recorded speech videos and then using the learned models to generate videos for novel ...