No abstract available.
Front Matter
Front Matter
Fused Geometry Augmented Images for Analyzing Textured Mesh
In this paper, we propose a novel multi-modal mesh surface representation fusing texture and geometric data. Our approach defines an inverse mapping between different geometric descriptors computed on the mesh surface or its down-sampled version, ...
Textureless Object Recognition Using an RGB-D Sensor
Object recognition is a significant task in an industrial assembly line, where a robotic arm should pick a small, textureless, and mostly homogeneous object to place it in its designated location. Despite all the recent advancements in object ...
CSIOR: An Ordered Structured Resampling of Mesh Surfaces
Triangular mesh is one of the most popular 3D modalities which usage spans a wide variety of application in computer vision, computer graphics and multimedia. Raw mesh surfaces generated by 3D scanning devices often suffer from mesh irregularity ...
Front Matter
Background Subtraction by Difference Clustering
Previous approaches to background subtraction typically considered the problem as a classification of pixels over time. We frame the problem as clustering the difference vectors between pixels in the current frame and in the background image set, ...
Semantic Learning for Image Compression (SLIC)
Image compression is an ever-evolving problem with different approaches coming into prominence at different times. Data analysts are still contemplating about the best approach with compression ratio, visual quality and complexity of the ...
Background Subtraction Based on Principal Motion for a Freely Moving Camera
As a fundamental research topic of computer vision, background subtraction technology can be implemented in many applications. This problem becomes even more challenging once the videos are obtained with a moving camera. To solve the challenging ...
Front Matter
Ontology Based Framework for Tactile Internet Applications
In the past decade, auditory and visual multimedia have reached an advanced quality level which is characteristically referred to as high definition (HD) and beyond. On the contrary, technical solutions addressing the sense of touch, which are ...
Potential of Deep Features for Opinion-Unaware, Distortion-Unaware, No-Reference Image Quality Assessment
Image Quality Assessment algorithms predict a quality score for a pristine or distorted input image, such that it correlates with human opinion. Traditional methods required a non-distorted “reference” version of the input image to compare with, ...
Non-invasive Lactate Threshold Estimation Using Machine Learning
The Lactate threshold (LT) has gained special attention in the sport world and is considered one of the potential indicators to evaluate individual performance in different sports. Traditionally, measuring LT requires frequent collection of blood ...
Front Matter
An Interdisciplinary Framework for Citizen-Centered Smart Cities and Smart Living
Rapid population growth and urbanization have led to increasing demands for management, healthcare, safety, among many other concerns, resulting in the recent formation and worldwide investment in IoT and ICT-enabled smart cities. Citizen-centric ...
Implementing Robotic Platforms for Therapies Using Qualitative Factors in Mexico
In recent years, robotic platforms (RP) have been implemented to assist human beings during therapeutic treatments. As a result, successful experimental cases have been reported around the world. However, there is not enough information about them ...
Foveated Haptic Gaze
- Bijan Fakhri,
- Troy McDaniel,
- Heni Ben Amor,
- Hemanth Venkateswara,
- Abhik Chowdhury,
- Sethuraman Panchanathan
As digital worlds become ubiquitous via video games, simulations, virtual and augmented reality, people with disabilities who cannot access those worlds are becoming increasingly disenfranchised. More often than not the design of these ...
Front Matter
Accurate Kidney Segmentation in CT Scans Using Deep Transfer Learning
- John Brandon Graham-Knight,
- Kymora Scotland,
- Victor KF. Wong,
- Abtin Djavadifar,
- Dirk Lange,
- Ben Chew,
- Patricia Lasserre,
- Homayoun Najjaran
A competitive model for kidney segmentation in CT scans is trained using the publicly-available KiTS19 dataset. The model performed well against the KiTS19 test dataset, achieving a Sørensen–Dice coefficient of 0.9620 when generating kidney ...
End to End Robust Point-Cloud Alignment Using Unsupervised Deep Learning
The point-cloud alignment methods help robots to map their environment, recognize target objects and estimate rigid-body object poses from the 3D vision sensor data. In this paper, we propose a robust and computationally efficient approach for ...
Homography-Based Vehicle Pose Estimation from a Single Image by Using Machine-Learning for Wheel-Region and Tire-Road Contact Point Detection
Image-based metric measurement and development of traffic surveillance systems have attracted wide interests within academia and industry for the past decade due to recent advancements in computer vision and the processing power required for ...
Front Matter
Using Participatory Design to Create a User Interface for Analyzing Pivotal Response Treatment Video Probes
Training caregivers in pivotal response treatment (PRT) has been shown to help improve communication skills in children with autism. PRT training programs are implemented by clinicians that provide instruction, modelling, and assessment. ...
Robot-Assisted Composite Manufacturing Based on Machine Learning Applied to Multi-view Computer Vision
- Abtin Djavadifar,
- John Brandon Graham-Knight,
- Kashish Gupta,
- Marian Körber,
- Patricia Lasserre,
- Homayoun Najjaran
This paper introduces an automated wrinkle detection method on semi-finished fiber products in the aerospace manufacturing industry. Machine learning, computer vision techniques, and evidential reasoning are combined to detect wrinkles during the ...
Tile Priorities in Adaptive 360-Degree Video Streaming
For video applications, tiled streaming is a popular way to deliver viewport dependent 360-degree video. Unfortunately, dynamic adaptation to network bandwidth fluctuations of such video streams is still a challenge. This paper proposes a method ...
Improving Temporal Stability in Inverse Tone-Mapped High Dynamic Range Video
Inverse tone mapping (ITM) is widely used to convert a standard dynamic range (SDR) image to its high dynamic range (HDR) version. While using frame-specific ITMs for a video sequence, temporal stability needs to be maintained not to cause visual ...
Front Matter
Assessing the Capability of Deep-Learning Models in Parkinson’s Disease Diagnosis
Parkinson’s Disease is one of the leading age-related neurological disorders affecting the general population. Current diagnostic techniques rely on patient symptoms rather than biomarkers. Symptomatic diagnoses are subjective and can vary highly. ...
Remote Photoplethysmography (rPPG) for Contactless Heart Rate Monitoring Using a Single Monochrome and Color Camera
Human vital signs are essential information that are closely related to both physical cardiac assessments and psychological emotion studies. One of the most important data is the heart rate, which is closely connected to the clinical state of the ...
Index Terms
- Smart Multimedia: Second International Conference, ICSM 2019, San Diego, CA, USA, December 16–18, 2019, Revised Selected Papers