Author: Zhou, Ziheng : Search

research-article

Free

All rivers run into the sea: Unified Modality Brain-Inspired Emotional Central Mechanism

MM '24: Proceedings of the 32nd ACM International Conference on MultimediaPages 632–641https://doi.org/10.1145/3664647.3681228

In the field of affective computing, fully leveraging information from a variety of sensory modalities is essential for the comprehensive understanding and processing of human emotions. Inspired by the process through which the human brain handles ...

Article

Contrastive Learning Enhanced Diffusion Model for Improving Tropical Cyclone Intensity Estimation with Test-Time Adaptation

Machine Learning and Knowledge Discovery in Databases. Applied Data Science TrackPages 418–434https://doi.org/10.1007/978-3-031-70378-2_26

Abstract

Tropical cyclone (TC) intensity estimation from satellite images is the very first and critical step of making TC forecasts, whose SOTA performance is achieved by methods built upon CNN based regression models. Unlike discriminative models trained ...

research-article

$C / N_{0}$ estimation based on acquisition correlation ratio for short GNSS data

GPS Solutions (SPGPS), Volume 28, Issue 3https://doi.org/10.1007/s10291-024-01666-y

Abstract

Carrier-to-noise ratio ( $C / N_{0}$ ) represents the ratio of signal power and noise power density, and it is of great significance in many applications of Global Navigation Satellite System (GNSS), such as satellite signal quality monitoring, spoofing ... $_{}$ $_{}$ $_{}$ $_{}$ $_{}$ $_{}$ $_{}$ $_{}$ $_{}$

Article

Ranking Enhanced Supervised Contrastive Learning for Regression

Advances in Knowledge Discovery and Data MiningPages 15–27https://doi.org/10.1007/978-981-97-2253-2_2

Abstract

Supervised contrastive learning has shown promising results in image classification tasks where the representations are pulled together if they share same labels or otherwise pushed apart. Such dispersion process in the representation space ...

review-article

GNSS antispoofing method using the intersection angle between two directions of arrival (IA-DOA) for multiantenna receivers

GPS Solutions (SPGPS), Volume 27, Issue 1https://doi.org/10.1007/s10291-022-01345-w

Abstract

Given the increasing number of spoofing attacks, keeping global navigation satellite system transmissions secure has recently become a focus. Many approaches have been proposed to defend against spoofing. Typical antispoofing methods against a ...

research-article

AlphaBlock: An Evaluation Framework for Blockchain Consensus Algorithms

SBC '21: Proceedings of the Ninth International Workshop on Security in Blockchain and Cloud ComputingPages 17–22https://doi.org/10.1145/3457977.3460297

Consensus algorithm is the core of blockchain and it plays a crucial role in the performance of the blockchain. In general, there are two types of blockchain consensus algorithms: the Bitcoin-like Nakamoto consensus (NC) algorithms and the Byzantine ...

research-article

No-reference image quality assessment based on neighborhood co-occurrence matrix

Image Communication (IMAG), Volume 81, Issue Chttps://doi.org/10.1016/j.image.2019.115680

Abstract

No-reference image quality assessment (NR-IQA) aims to develop models that can predict the quality of distorted image automatically and accurately in the absent of reference image. Previous NR-IQA methods based on natural scene ...

Highlights

The significant of spatial correlation of pixels for quality evaluation is analyzed.

research-article

Characterizing Subtle Facial Movements via Riemannian Manifold

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), Volume 15, Issue 3sArticle No.: 94, Pages 1–24https://doi.org/10.1145/3342227

Characterizing subtle facial movements from videos is one of the most intensive topics in computer vision research. It is, however, challenging, since (1) the intensity of subtle facial muscle movement is usually low, (2) the duration may be transient, ...

Article

No-Reference Image Quality Assessment via Multi-order Perception Similarity

Pattern Recognition and Computer VisionPages 607–619https://doi.org/10.1007/978-3-030-31723-2_52

Abstract

No-reference image quality assessment (NR-IQA) aims to develop models that can predict the quality of distorted image automatically and accurately without the reference. Lack of reference makes NR-IQA based on feature learning difficult to avoid ...

research-article

Background Subtraction Using Spatio-Temporal Group Sparsity Recovery

IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 28, Issue 8Pages 1737–1751https://doi.org/10.1109/TCSVT.2017.2697972

Background subtraction is a key step in a wide spectrum of video applications, such as object tracking and human behavior analysis. Compressive sensing-based methods, which make little specific assumptions about the background, have recently attracted ...

research-article

Blind Image Quality Assessment Based on Visuo-Spatial Series Statistics

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)Pages 3161–3165https://doi.org/10.1109/ICASSP.2018.8462303

Existing blind image quality assessment (BIQA) methods based on statistics attach limited attention to the relative position of pixels. Features in these BIQA methods are too flimsy to characterize quite a few distortions with strong locality or ...

research-article

Image denoising via group sparsity residual constraint

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)Pages 1787–1791https://doi.org/10.1109/ICASSP.2017.7952464

Group sparsity has shown great potential in various low-level vision tasks (e.g, image denoising, deblurring and inpainting). In this paper, we propose a new prior model for image denoising via group sparsity residual constraint (GSRC). To enhance the ...

research-article

Depth estimation for image dehazing of surveillance on education

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology (JIFS), Volume 31, Issue 5Pages 2629–2636https://doi.org/10.3233/JIFS-169103

Foggy weather brings lots of inconvenience for outdoor safety surveillance in the densely populated school education area. Research on image and video dehazing is able to solve this problem. Most existing methods recover the haze-free scenes relying on ...

research-article

3D Visual Speech Animation from Image Sequences

ICVGIP '14: Proceedings of the 2014 Indian Conference on Computer Vision Graphics and Image ProcessingArticle No.: 47, Pages 1–7https://doi.org/10.1145/2683483.2683530

In this paper we describe an early version of our system which synthesizes 3D visual speech including tongue and teeth from frontal facial image sequences. This system is developed for 3D Visual Speech Animation (VSA) using images generated by an ...

Article

Facial 3D Shape Estimation from Images for Visual Speech Animation

ICPR '14: Proceedings of the 2014 22nd International Conference on Pattern RecognitionPages 40–45https://doi.org/10.1109/ICPR.2014.17

In this paper we describe the first version of our system for estimating 3D shape sequences from images of the frontal face. This approach is developed with 3D Visual Speech Animation (VSA) as the target application. In particular, the focus is on the ...

article

A Compact Representation of Visual Speech Data Using Latent Variables

IEEE Transactions on Pattern Analysis and Machine Intelligence (ITPM), Volume 36, Issue 1Pages 181–187https://doi.org/10.1109/TPAMI.2013.173

The problem of visual speech recognition involves the decoding of the video dynamics of a talking mouth in a high-dimensional visual space. In this paper, we propose a generative latent variable model to provide a compact representation of visual speech ...

research-article

Video Texture Synthesis With Multi-Frame LBP-TOP and Diffeomorphic Growth Model

IEEE Transactions on Image Processing (TIP), Volume 22, Issue 10Pages 3879–3891https://doi.org/10.1109/TIP.2013.2263148

Video texture synthesis is the process of providing a continuous and infinitely varying stream of frames, which plays an important role in computer vision and graphics. However, it still remains a challenging problem to generate high-quality synthesis ...

research-article

An Image-Based Visual Speech Animation System

IEEE Transactions on Circuits and Systems for Video Technology (IEEETCSVT), Volume 22, Issue 10Pages 1420–1432https://doi.org/10.1109/TCSVT.2012.2199399

An image-based visual speech animation system is presented in this paper. A video model is proposed to preserve the video dynamics of a talking face. The model represents a video sequence by a low-dimensional continuous curve embedded in a path graph ...

Article

Towards a practical lipreading system

CVPR '11: Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern RecognitionPages 137–144https://doi.org/10.1109/CVPR.2011.5995345

A practical lipreading system can be considered either as subject dependent (SD) or subject-independent (SI). An SD system is user-specific, i.e., customized for some particular user while an SI system has to cope with a large number of users. These two ...

research-article

Synthesizing a talking mouth

ICVGIP '10: Proceedings of the Seventh Indian Conference on Computer Vision, Graphics and Image ProcessingPages 211–218https://doi.org/10.1145/1924559.1924588

This paper presents a visually realistic animation system for synthesizing a talking mouth. Video synthesis is achieved by first learning generative models from the recorded speech videos and then using the learned models to generate videos for novel ...

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

All rivers run into the sea: Unified Modality Brain-Inspired Emotional Central Mechanism

Contrastive Learning Enhanced Diffusion Model for Improving Tropical Cyclone Intensity Estimation with Test-Time Adaptation

$C / N_{0}$ estimation based on acquisition correlation ratio for short GNSS data

Ranking Enhanced Supervised Contrastive Learning for Regression

GNSS antispoofing method using the intersection angle between two directions of arrival (IA-DOA) for multiantenna receivers

AlphaBlock: An Evaluation Framework for Blockchain Consensus Algorithms

No-reference image quality assessment based on neighborhood co-occurrence matrix

Characterizing Subtle Facial Movements via Riemannian Manifold

No-Reference Image Quality Assessment via Multi-order Perception Similarity

Background Subtraction Using Spatio-Temporal Group Sparsity Recovery

Blind Image Quality Assessment Based on Visuo-Spatial Series Statistics

Image denoising via group sparsity residual constraint

Depth estimation for image dehazing of surveillance on education

3D Visual Speech Animation from Image Sequences

Facial 3D Shape Estimation from Images for Visual Speech Animation

A Compact Representation of Visual Speech Data Using Latent Variables

Video Texture Synthesis With Multi-Frame LBP-TOP and Diffeomorphic Growth Model

An Image-Based Visual Speech Animation System

Towards a practical lipreading system

Synthesizing a talking mouth

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder

C/N0 estimation based on acquisition correlation ratio for short GNSS data

$C / N_{0}$ estimation based on acquisition correlation ratio for short GNSS data