Procedural modeling of artificially cultivated shrub roses
Decorative plants require skillful pruning by gardeners. However, conventional plant modeling techniques have focused only on the biological growth rules of wild plants and do not yet consider artificial care. In this study, an interactive system ...
GeoSegNet: point cloud semantic segmentation via geometric encoder–decoder modeling
- Chen Chen,
- Yisen Wang,
- Honghua Chen,
- Xuefeng Yan,
- Dayong Ren,
- Yanwen Guo,
- Haoran Xie,
- Fu Lee Wang,
- Mingqiang Wei
Semantic segmentation of point clouds, aiming to assign each point a semantic category, is critical to 3D scene understanding. Although significant advances in recent years, most of the existing methods still suffer from either the object-level ...
Context-aware personality estimation and emotion recognition in social interaction
Personality and emotion as intrinsic factors often have great influences on the cognition of people’s behavior. In computer vision, there is a lot of work done on the recognition of emotions, such as classification of a person’s emotions via ...
Structure-preserving image smoothing via contrastive learning
Image smoothing is an important processing operation that highlights low-frequency structural parts of an image and suppresses the noise and high-frequency textures. In the paper, we post an intriguing question of how to combine the paired ...
MFFNet: multimodal feature fusion network for point cloud semantic segmentation
We introduce a multimodal feature fusion network (MFFNet) for 3D point cloud semantic segmentation. Unlike previous methods that directly learn from colored point clouds (XYZRGB), MFFNet transforms point clouds to 2D RGB image and frequency image ...
Unsupervised contrastive learning with simple transformation for 3D point cloud data
Though a number of point cloud learning methods have been proposed to handle unordered points, most of them are supervised and require labels for training. By contrast, unsupervised learning of point cloud data has received much less attention to ...
RT-less: a multi-scene RGB dataset for 6D pose estimation of reflective texture-less objects
The 6D (6 Degree of freedom) pose estimation (or pose measurement) of machined reflective texture-less objects, which are common in industry, is a significant but challenging technique. It has attracted increasing attention in academia and ...
An adaptive methodology for rock mass fracture image enhancement with generalized gamma correction
Rock fractures are an important indicator for assessing the quality of rock masses. The significant differences in the color and texture distributions of rock mass exposures lead to great challenges in the automated extraction of fractures from ...
AttentionDIP: attention-based deep image prior model to restore satellite and aerial images from gamma distributed speckle interference
Image restoration is an inevitable pre-processing step in most satellite imaging applications. The satellite imaging modality such as Synthetic Aperture Radar (SAR) is prone to speckle distortions due to constructive and destructive interference ...
Partial point cloud registration algorithm based on deep learning and non-corresponding point estimation
For the limitations of global feature-based deep learning point cloud registration algorithms in partial point cloud registration, this paper proposes a partial point cloud registration algorithm NcPE-PNLK combining global features and ...
A powerful method for interactive content-based image retrieval by variable compressed convolutional info neural networks
There is a need for efficient methods to retrieve and obtain the visual data that a client need. New methods for content-based image retrieval (CBIR) have emerged due to recent developments in deep neural networks. However, there are still issues ...
A new pixel-wise data processing method for reflectance transformation imaging
Reflectance transformation imaging (RTI) is one of the most widely used techniques in order to digitize and analyze material appearance of a surface, finding a great level of utility and applicability in the field of cultural heritage as well as ...
Image denoising using difference classifier and trimmed global mean filter adaptive approach
Inexpensive cameras, different acquisition devices and mobile phones have resulted in the production of digital images in all spheres of life. There is common of these images for extracting useful information in many fields to perform image ...
Point-voxel dual stream transformer for 3d point cloud learning
Recently, the success of Transformer in natural language processing and image processing inspires researchers to apply Transformer in point cloud processing. However, existing point cloud Transformer methods have problems with massive parameters, ...
Blind quality assessment of screen content images via edge histogram descriptor and statistical moments
With the growth in utilizing desktop sharing and remote control applications in recent years for many purposes like online education and remote working, quality assessment (QA) of screen images has become a hot topic. It could be used to enhance ...
ZRDNet: zero-reference image defogging by physics-based decomposition–reconstruction mechanism and perception fusion
This paper investigates challenging fully unsupervised defogging problems, i.e., how to remove fog by feeding only foggy images in deep neural networks rather than using paired or unpaired synthetic images, and how to overcome the problems of ...
Soft-edge-guided significant coordinate attention network for scene text image super-resolution
Scene text image super-resolution (STISR) aims to enhance the resolution and visual quality of low-resolution scene text images, thereby improving the performance of some text-related downstream vision tasks. However, many existing STISR methods ...
ADVISE: ADaptive feature relevance and VISual Explanations for convolutional neural networks
To equip convolutional neural networks (CNNs) with explainability, it is essential to interpret how opaque models make specific decisions, understand what causes the errors, improve the architecture design, and identify unethical biases in the ...
6D object pose estimation based on dense convolutional object center voting with improved accuracy and efficiency
6D object pose estimation is an important application of computer vision and a basic module in robotic manipulation, but dealing with occlusion in a cluttered environment, handling symmetries, and textureless surfaces, are real issues. Other ...
DualMLP: a two-stream fusion model for 3D point cloud classification
In this paper, we present DualMLP, a novel 3D model that introduces the idea of a two-stream network for existing 3D models to handle the trade-off between the number of points and the computational overhead. Existing works on point clouds use a ...
Age-invariant face recognition based on identity-age shared features
Decoupling the mixed face features to obtain identity features that are not disturbed by age information is the key to achieving cross-age face recognition. Established mainstream identity feature extraction methods decouple facial representations ...
Underwater image enhancement algorithm based on color correction and contrast enhancement
Due to the complex underwater environment and the selective absorption and scattering effect of water on light waves, underwater images often suffer from issues such as low contrast, color distortion, and blurred details. This paper presents a ...
A dynamic learning framework integrating attention mechanism for point cloud registration
To improve the low accuracy problem of existing point cloud registration algorithms attributed to deficient point cloud geometric features, we proposed a new point cloud registration network inspired by dynamic feature extraction and the graph ...
Making paper labels smart for augmented wine recognition
An invisible layer of knowledge is progressively growing with the emergence of situated visualizations and reality-based information retrieval systems. In essence, digital content will overlap with real-world entities, eventually providing ...
Texture-aware re-parameterization to mitigate accuracy drop after quantization for 4K/8K image super-resolution
In this paper, we aim to improve super-resolution (SR) imaging quality on 4K/8K images with a negligible increase in computational cost and alleviate the accuracy drop after quantization. Experiments have discovered two phenomena: (1) the re-...
Two-stage dual-resolution face network for cross-resolution face recognition in surveillance systems
Face recognition for surveillance remains a complex challenge due to the disparity between low-resolution (LR) face images captured by surveillance cameras and the typically high-resolution (HR) face images in databases. To address this cross-...
Human body construction based on combination of parametric and nonparametric reconstruction methods
Nowadays, 3D human models are widely used in the garment industry where it is important to reconstruct compliant human model from scan of the body under partial dress for privacy reasons. A new 3D human construction method based on the combination ...
A rotation robust shape transformer for cartoon character recognition
Recognizing cartoon characters accurately is important for animators to design and create cartoon scenarios by utilizing existing cartoon materials. Current deep learning approaches are sensitive to image rotation and heavily rely on rich textures ...