Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleDecember 2023
Lite FPN_SSD: A Reconfiguration SSD with Adapting Feature Pyramid Network Scheme for Small Object Detection
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 493–500https://doi.org/10.1145/3628797.3629020Detecting small objects poses a significant challenge in computer vision because of the low resolution and fuzzy feature representation. Although one-stage detection techniques alleviate the problem caused by scale difference to some extent, they also ...
- research-articleDecember 2023
Boosting Facial Landmark Detection via Self-supervised and Semi-supervised Learning
- Chau Nguyen Minh,
- Toan Nguyen Ngoc,
- Tuyen Le Dinh,
- Sang Dinh Viet,
- Pooi-Mun Wong,
- Chin-Boon Chng,
- Chee-Kong Chui
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 485–492https://doi.org/10.1145/3628797.3629017Keypoint detection is one of the main focused fields in computer vision with various applications. Traditional fully-supervised deep learning methods currently dominate the field with impressive accuracy, but typically require careful, expensive, and ...
- research-articleDecember 2023
ConvTransNet: Merging Convolution with Transformer to Enhance Polyp Segmentation
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 631–638https://doi.org/10.1145/3628797.3629014Colonoscopy is widely acknowledged as the most efficient screening method for detecting colorectal cancer and its early stages, such as polyps. However, the procedure faces challenges with high miss rates due to the heterogeneity of polyps and the ...
- research-articleDecember 2023
Resnet Video 3D for Gait Retrieval: A Deep Learning Approach to Human Identification
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 886–892https://doi.org/10.1145/3628797.3629013Gait, the distinctive way a person walks, is a useful biometric trait for various applications such as crime prevention, forensic identification, and social security. Gait retrieval, which aims to find the person who matches a given gait, is an active ...
- research-articleDecember 2023
Volumetric CT Segmentation with Mask Propagation Using Segment Anything
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 623–630https://doi.org/10.1145/3628797.3629012Medical imaging is both an interesting and challenging field for researchers, mainly due to their lack of labeled data, especially in segmentation tasks. Many interactive system has been introduced to help streamline the annotation workflow, and most ...
-
- research-articleDecember 2023
LSegDiff: A Latent Diffusion Model for Medical Image Segmentation
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 456–462https://doi.org/10.1145/3628797.3629010Initially designed for image generation, diffusion models can also be effectively applied to various tasks, including semantic segmentation. However, most existing diffusion-based approaches for semantic segmentation operate in high-dimensional pixel ...
- research-articleDecember 2023
Language Knowledge-Assisted in Topology Construction for Skeleton-Based Action Recognition
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 443–449https://doi.org/10.1145/3628797.3629008Skeleton-based action recognition is a challenging problem due to the high dimensionality and noisy nature of skeleton data. Graph convolution networks (GCNs), which use graph topology to extract representative features, have been effective for skeleton-...
- research-articleDecember 2023
Multiclass Skin Disease Classification within Dermoscopic Images Using Deep Neural Networks
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 607–614https://doi.org/10.1145/3628797.3629001Computer-aided skin lesion classification has been gaining attention in dermoscopy and skin cancer diagnosis, as early detection reduces the complexity of the treatment process. Various techniques that utilize the power of deep neural networks have been ...
- research-articleDecember 2023
Sharpness and Gradient Aware Minimization for Memory-based Continual Learning
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 189–196https://doi.org/10.1145/3628797.3629000Memory-based Continual Learning methods (CL) preserve performance on old data by storing a small buffer of seen samples to re-learn with current data. Despite their impressive results, these methods may still obtain sub-optimal solutions as a result of ...
- research-articleDecember 2023
Automatic Step Recognition with Video and Kinematic Data for Intelligent Operating Room and Beyond
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 599–606https://doi.org/10.1145/3628797.3628999With the continuous development of intelligent operating room systems, the segmentation and automatic recognition of surgical workflow have become challenging research fields. In recent years, an increasing number of models have been proposed to address ...
- research-articleDecember 2023
Efficient Video Retrieval with Advanced Deep Learning Models
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 945–952https://doi.org/10.1145/3628797.3628995Video retrieval is the process of finding specific video content in a large database. This is a crucial challenge in the age of digital multimedia. This article proposes a new approach to video retrieval using advanced deep learning models to extract ...
- research-articleDecember 2023
Sketch2Reality: Immersive 3D Indoor Scene Synthesis via Sketches
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 863–869https://doi.org/10.1145/3628797.3628991Sketching indoor scenes is helpful in daily activities as it allows for quick visualization and planning of room layouts, furniture arrangements, design ideas, or scene creation for games and entertainment. This motivates our proposal of Sketch2Reality, ...
- research-articleDecember 2023
IncepSE: Leveraging InceptionTime's performance with Squeeze and Excitaion mechanism in ECG analysis
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 578–584https://doi.org/10.1145/3628797.3628987Our study focuses on the potential for modifications of Inception-like architecture within the electrocardiogram (ECG) domain. To this end, we introduce IncepSE, a novel network characterized by strategic architectural incorporation that leverages the ...
- research-articleDecember 2023
Zero-shot Video Retrieval using CLIP with Temporally Ordered Multi-query Scoring
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 938–944https://doi.org/10.1145/3628797.3628984In this work, we present a new method for video retrieval using OpenAI’s CLIP and Temporally Ordered Multi-query Scoring (TOMS). Our approach extends CLIP with a scoring function for matching multiple ordered queries, which enables fast, accurate video ...
- research-articleDecember 2023
Combining Deep Learning And Medical Knowledge to Detect Cadiomegaly and Pleural Effusion in Chest X-rays Diagnosis
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 562–569https://doi.org/10.1145/3628797.3628981X-ray imaging plays a crucial role in diagnosing various medical conditions, especially those affecting the respiratory and cardiovascular systems. However, interpreting X-ray images can be time-intensive for radiologists. This paper addresses this ...
- research-articleDecember 2023
Optimizing Results in Aerial Images through Post-Processing Techniques on YOLOv7
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 409–414https://doi.org/10.1145/3628797.3628980Object detection in aerial images has garnered significant attention from the research community in recent years. The challenges posed by small objects, diverse orientations, and complex backgrounds have spurred extensive research efforts. In this paper,...
- research-articleDecember 2023
Impact of the ground truth quality for handwriting recognition
- Michael Jungo,
- Lars Vögtlin,
- Atefeh Fakhari,
- Nathan Wegmann,
- Rolf Ingold,
- Andreas Fischer,
- Anna Scius-Bertrand
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 135–140https://doi.org/10.1145/3628797.3628976Handwriting recognition is a key technology for accessing the content of old manuscripts, helping to preserve cultural heritage. Deep learning shows an impressive performance in solving this task. However, to achieve its full potential, it requires a ...
- research-articleDecember 2023
Bayesian method for bee counting with noise-labeled data
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 401–408https://doi.org/10.1145/3628797.3628969Bee counting is an essential task for monitoring the health of bee colonies. However, it is challenging, as bees are often small and difficult to see. One approach to bee counting is detecting individuals and using that information to count. However, ...
- research-articleDecember 2023
Improving Multilingual Neural Machine Translation with Artificial Labels
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 79–84https://doi.org/10.1145/3628797.3628964Inspired by the work which uses Artificial Translation Units for generation of synthetic data in low-resource Neural Machine Translation systems [12], we propose using these translation units to enhance ability of sharing information between translation ...
- research-articleDecember 2023
RoSENet: Rotary Squeeze and Excitation for Vietnamese Food Recognition
SOICT '23: Proceedings of the 12th International Symposium on Information and Communication TechnologyPages 71–78https://doi.org/10.1145/3628797.3628962Along with the accelerated impact of social media in Vietnam, food recognition presents unique opportunities for food identification, food sharing, and tourist attractions. However, the literature on Vietnamese food is mostly still unexplored. Moreover, ...