Scene understanding

Applied Filters

People

Publications

Conferences

Publication Date

7 Results for: Book/Issue: MMAsia '23: Proceedings of the 5th ACM International Conference on Multimedia in AsiaEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,766,563 records)|Limit your search to The ACM Full-Text Collection (759,377 records)

Showing 1 - 7of7 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
January 2024
Adapting Object Detection to Fisheye Cameras: A Knowledge Distillation with Semi-Pseudo-Label Approach
MMAsia '23: Proceedings of the 5th ACM International Conference on Multimedia in AsiaArticle No.: 111, Pages 1–6https://doi.org/10.1145/3595916.3628350

In this paper, we introduce a lightweight object detection system, custom-designed for fisheye cameras and optimized for quick deployment on embedded systems. Given the constraints of training solely on standard images, our methodology centers on the ...
0
76
Metrics
Total Citations0
Total Downloads76
Last 12 Months76
Last 6 weeks3
Get Access
research-article
January 2024
Monocular 3D Pose Estimation of Very Small Airplane in the Air
MMAsia '23: Proceedings of the 5th ACM International Conference on Multimedia in AsiaArticle No.: 82, Pages 1–7https://doi.org/10.1145/3595916.3626456

In this paper, a novel pose estimation algorithm is proposed specifically for maneuvering airplanes in the air. The algorithm consists of two main stages. The first stage involves semantic segmentation of a monocular input image of a flying airplane, ...
0
73
Metrics
Total Citations0
Total Downloads73
Last 12 Months73
Last 6 weeks5
Get Access
research-article
January 2024
Multi-Scale Superpoint Network for 3D Point Cloud Semantic Segmentation
- Ft Zheng,
- Le Hui,
- Jin Xie,
- Haofeng Zhang
MMAsia '23: Proceedings of the 5th ACM International Conference on Multimedia in AsiaArticle No.: 75, Pages 1–7https://doi.org/10.1145/3595916.3626449

3D point cloud semantic segmentation is a fundamental task for 3D scene understanding. However, most existing pipelines usually use k-NN or ball query operation to form hard neighborhoods, which may cross different semantic objects, resulting low-...
0
147
Metrics
Total Citations0
Total Downloads147
Last 12 Months147
Last 6 weeks8
Get Access
research-article
January 2024
Image Cropping under Design Constraints
MMAsia '23: Proceedings of the 5th ACM International Conference on Multimedia in AsiaArticle No.: 40, Pages 1–7https://doi.org/10.1145/3595916.3626412

Image cropping is essential in image editing for obtaining a compositionally enhanced image. In display media, image cropping is a prospective technique for automatically creating media content. However, image cropping for media contents is often ...
0
53
Metrics
Total Citations0
Total Downloads53
Last 12 Months53
Last 6 weeks0
1
Supplementary Material
Appendix
Get Access
research-article
Open Access
January 2024
Adaptive Fusion for Visual Question Answering: Integrating Multi-Label Classification and Similarity Matching
MMAsia '23: Proceedings of the 5th ACM International Conference on Multimedia in AsiaArticle No.: 12, Pages 1–7https://doi.org/10.1145/3595916.3626381

Visual Question Answering (VQA) is an important multimodal task in which models are required to answer questions based on visual cues. However, most visual question-answering models suffer from the language prior problem, which is caused by data bias. ...
0
191
Metrics
Total Citations0
Total Downloads191
Last 12 Months191
Last 6 weeks30
View online with eReader
View this article in HTML format
PDF
research-article
January 2024
From Pixels to Explanations: Uncovering the Reasoning Process in Visual Question Answering
MMAsia '23: Proceedings of the 5th ACM International Conference on Multimedia in AsiaArticle No.: 7, Pages 1–9https://doi.org/10.1145/3595916.3626376

Visual reasoning requires models to construct a reasoning process towards the final decision. Previous studies have used attention maps or textual explanations to illustrate the reasoning process, but both have their limitations. Attention maps can be ...
0
128
Metrics
Total Citations0
Total Downloads128
Last 12 Months128
Last 6 weeks2
1
Supplementary Material
Appendix
Get Access
demonstration
January 2024
A consulting system for guiding various image recognitions
MMAsia '23: Proceedings of the 5th ACM International Conference on Multimedia in AsiaArticle No.: 108, Pages 1–3https://doi.org/10.1145/3595916.3626356

In recent years, various image recognition tasks have been used in many real-world applications thanks to the development and open sources of computer vision technologies. However, the expertise of users is often required for selecting appropriate ...
0
29
Metrics
Total Citations0
Total Downloads29
Last 12 Months29
Last 6 weeks0
Get Access

Applied Filters

People

Names

Institutions

Authors

Publications

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Adapting Object Detection to Fisheye Cameras: A Knowledge Distillation with Semi-Pseudo-Label Approach

Monocular 3D Pose Estimation of Very Small Airplane in the Air

Multi-Scale Superpoint Network for 3D Point Cloud Semantic Segmentation

Image Cropping under Design Constraints

Adaptive Fusion for Visual Question Answering: Integrating Multi-Label Classification and Similarity Matching

From Pixels to Explanations: Uncovering the Reasoning Process in Visual Question Answering

A consulting system for guiding various image recognitions