default search action
16th VISIGRAPP 2021 - Volume 5: VISAPP
- Giovanni Maria Farinella, Petia Radeva, José Braz, Kadi Bouatouch:
Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2021, Volume 5: VISAPP, Online Streaming, February 8-10, 2021. SCITEPRESS 2021, ISBN 978-989-758-488-6
Invited Speakers
- Federico Tombari:
3D Indoor Scene Understanding with Scene Graphs and Self-supervision. VISIGRAPP 2021: 5 - Dieter Schmalstieg:
Visualization in the Real World: Confluence of Visualization and Augmented Reality. VISIGRAPP 2021: 7 - Nathalie Henry Riche:
Keynote Lecture. VISIGRAPP 2021: 9
Image and Video Understanding
- Gabriela Vozáriková, Richard Stana, Gabriel Semanisin:
Clothing Parsing using Extended U-Net. 15-24 - Shi Guo, Yang Liu, Yong Ni, Wei Ni:
Lightweight SSD: Real-time Lightweight Single Shot Detector for Mobile Devices. 25-35 - Shogo Fukuda, Masashi Nishiyama, Yoshio Iwai:
Reduction in Communication via Image Selection for Homomorphic Encryption-based Privacy-protected Person Re-identification. 36-47 - João Silva Ferreira, André Restivo, Hugo Sereno Ferreira:
Automatically Generating Websites from Hand-drawn Mockups. 48-58 - Matthia Sabatelli, Mike Kestemont, Pierre Geurts:
On the Transferability of Winning Tickets in Non-natural Image Datasets. 59-69 - Rajat Sharma, Tobias Schwandt, Christian Kunert, Steffen Urban, Wolfgang Broll:
Point Cloud Upsampling and Normal Estimation using Deep Learning for Robust Surface Reconstruction. 70-79 - Nerea Aranjuelo, Jorge García, Luis Unzueta, Sara García, Unai Elordi, Oihana Otaegui:
Building Synthetic Simulated Environments for Configuring and Training Multi-camera Systems for Surveillance Applications. 80-91 - Khadija Khaldi, Shishir K. Shah:
CUPR: Contrastive Unsupervised Learning for Person Re-identification. 92-100 - Ruslan Rakhimov, Denis Volkhonskiy, Alexey Artemov, Denis Zorin, Evgeny Burnaev:
Latent Video Transformer. 101-112 - Dinesh Kumar, Dharmendra Sharma:
Feature Map Upscaling to Improve Scale Invariance in Convolutional Neural Networks. 113-122 - Najda Vidimlic, Alexandra Levin, Mohammad Loni, Masoud Daneshtalab:
Image Synthesisation and Data Augmentation for Safe Object Detection in Aircraft Auto-landing System. 123-135 - Hao Sun, Nick E. Pears, Hang Dai:
A Human Ear Reconstruction Autoencoder. 136-145 - Daniel Koudouna, Kasim Terzic:
Few-shot Linguistic Grounding of Visual Attributes and Relations using Gaussian Kernels. 146-156 - Amr M. Nagy, László Czúni:
Detecting Object Defects with Fusioning Convolutional Siamese Neural Networks. 157-163 - Marlon Marcon, Olga Regina Pereira Bellon, Luciano Silva:
Towards Real-time Object Recognition and Pose Estimation in Point Clouds. 164-174 - Simon Evain, Christine Guillemot:
A Neural Network with Adversarial Loss for Light Field Synthesis from a Single Image. 175-184 - Luca Ciampi, Carlos Santiago, João Paulo Costeira, Claudio Gennaro, Giuseppe Amato:
Domain Adaptation for Traffic Density Estimation. VISIGRAPP (5: VISAPP) 2021: 185-195 - Zainy M. Malakan, Nayyer Aafaq, Ghulam Mubashar Hassan, Ajmal Mian:
Contextualise, Attend, Modulate and Tell: Visual Storytelling. 196-205 - Kaiqiang Huang, Sarah Jane Delany, Susan McKeever:
Fairer Evaluation of Zero Shot Action Recognition in Videos. 206-215 - Jacek Komorowski, Grzegorz Kurzejamski, Monika Wysoczanska, Tomasz Trzcinski:
Global Point Cloud Descriptor for Place Recognition in Indoor Environments. 216-224 - Alexander Gillert, Uwe Freiherr von Lukas:
Towards Combined Open Set Recognition and Out-of-Distribution Detection for Fine-grained Classification. 225-233 - Haruya Ishikawa, Masaki Hayashi, Trong Huy Phan, Kazuma Yamamoto, Makoto Masuda, Yoshimitsu Aoki:
Analysis of Recent Re-Identification Architectures for Tracking-by-Detection Paradigm in Multi-Object Tracking. 234-244 - Mathias Gudiksen, Sebastian Falk, Lasse Nymark Hansen, Frederik Brønnum Jensen, Andreas Møgelmose:
Facial Exposure Quality Estimation for Aesthetic Evaluation. 247-255 - Rajiv Kumar, Rishabh Dabral, G. Sivakumar:
Learning Unsupervised Cross-domain Image-to-Image Translation using a Shared Discriminator. 256-264 - Calin Timbus, Vlad Miclea, Camelia Lemnaru:
Approaching the Semantic Segmentation in Medical Problems: A Solution for Pneumothorax Detection. 265-272 - Fatma Bouhlel, Hazar Mliki, Mohamed Hammami:
Crowd Behavior Analysis based on Convolutional Neural Network: Social Distancing Control COVID-19. 273-280 - Gerald A. Zwettler, Christoph Praschl, David Baumgartner, Tobias Zucali, Dora Turk, Martin Hanreich, Andreas Schuler:
Three-step Alignment Approach for Fitting a Normalized Mask of a Person Rotating in A-Pose or T-Pose Essential for 3D Reconstruction based on 2D Images and CGI Derived Reference Target Pose. 281-292 - Masaki Sugimoto, Ryosuke Furuta, Yukinobu Taniguchi:
Weakly-supervised Human-object Interaction Detection. 293-300 - Yacine Yaddaden, Sylvie Daniel, Denis Laurendeau:
Online Point Cloud Object Recognition System using Local Descriptors for Real-time Applications. 301-308 - Mandhatya Singh, Puneet Goyal:
ChartSight: An Automated Scheme for Assisting Visually Impaired in Understanding Scientific Charts. 309-318 - Tobias Scheck, Ana Pérez Grassi, Gangolf Hirtz:
Unsupervised Domain Adaptation from Synthetic to Real Images for Anchorless Object Detection. 319-327 - Alessandro Masullo, Toby Perrett, Dima Damen, Tilo Burghardt, Majid Mirmehdi:
No Need for a Lab: Towards Multi-sensory Fusion for Ambient Assisted Living in Real-world Living Homes. 328-337 - Fred N. Kiwanuka, Omar Eltaher Abuelmaatti, Anang Hudaya Muhamad Amin, Brian J. Mukwaya:
Tropical Skin Disease Classification using Connected Attribute Filters. 338-345 - Daniel Lehmann, Marc Ebner:
Are Image Patches Beneficial for Initializing Convolutional Neural Network Models? 346-353 - Alessandro Simoni, Andrea D'Eusanio, Stefano Pini, Guido Borghi, Roberto Vezzani:
Improving Car Model Classification through Vehicle Keypoint Localization. 354-361 - Luca Ballan, Ombretta Strafforello, Klamer Schutte:
Long-term Behaviour Recognition in Videos with Actor-focused Region Attention. 362-369 - Nikolas Gomes de Sá, Lucas Pascotti Valem, Daniel Carlos Guimarães Pedronette:
A Multi-level Rank Correlation Measure for Image Retrieval. 370-378 - Chihiro Nakatsuka, Jianfeng Xu, Kazuyuki Tasaka:
Learning Joint Twist Rotation for 3D Human Pose Estimation from a Single Image. 379-386 - Quentin Portes, José Mendès Carvalho, Julien Pinquier, Frédéric Lerasle:
Multimodal Neural Network for Sentiment Analysis in Embedded Systems. 387-398 - Xingye Li, Zhigang Zhu:
A Snapshot-based Approach for Self-supervised Feature Learning and Weakly-supervised Classification on Point Cloud Data. 399-408 - Satyajit Tourani, Dhagash Desai, Udit Singh Parihar, Sourav Garg, Ravi Kiran Sarvadevabhatla, Michael Milford, K. Madhava Krishna:
Early Bird: Loop Closures from Opposing Viewpoints for Perceptually-aliased Indoor Environments. 409-416 - Florian Teich, Timo Lüddecke, Florentin Wörgötter:
3D Object Classification via Part Graphs. 417-426 - Mickael Delamare, Cyril Laville, Adnane Cabani, Houcine Chafouk:
Graph Convolutional Networks Skeleton-based Action Recognition for Continuous Data Stream: A Sliding Window Approach. 427-435 - David Montero, Luis Unzueta, Jon Goenetxea, Nerea Aranjuelo, Estíbaliz Loyo, Oihana Otaegui, Marcos Nieto:
Multi-Stage Dynamic Batching and On-Demand I-Vector Clustering for Cost-effective Video Surveillance. 436-443 - Yoshiaki Homma, Toshiki Kikuchi, Yuko Ozasa:
Non-Maximum Suppression for Unknown Class Objects using Image Similarity. 444-449 - Nima Khairdoost, Steven S. Beauchemin, Michael A. Bauer:
Road Lane Detection and Classification in Urban and Suburban Areas based on CNNs. 450-457 - Paola Cañas, Juan Diego Ortega, Marcos Nieto, Oihana Otaegui:
Detection of Distraction-related Actions on DMD: An Image and a Video-based Approach Comparison. 458-465 - Julia Böhlke, Dimitri Korsch, Paul Bodesheim, Joachim Denzler:
Lightweight Filtering of Noisy Web Data: Augmenting Fine-grained Datasets with Selected Internet Images. 466-477 - Sergey Pavlov, Yoshihiro Kanamori, Yuki Endo:
Line2depth: Indoor Depth Estimation from Line Drawings. 478-483 - Hussein Chaaban, Michèle Gouiffès, Annelies Braffort:
Automatic Annotation and Segmentation of Sign Language Videos: Base-level Features and Lexical Signs Classification. 484-491 - Takuya Tsukahara, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi:
Collaborative Learning of Generative Adversarial Networks. 492-499 - Arturo Fuentes, Francisco Javier Sánchez, Thomas Voncina, Jorge Bernal:
LAMV: Learning to Predict Where Spectators Look in Live Music Performances. 500-507 - Anca Ignat, Ioan Pavaloi:
Occluded Iris Recognition using SURF Features. 508-515 - Tyler C. Folsom:
Convolutional Neural Networks with Fixed Weights. 516-523 - Luca Bergamini, Stefano Pini, Alessandro Simoni, Roberto Vezzani, Simone Calderara, Rick B. D'Eath, Robert B. Fisher:
Extracting Accurate Long-term Behavior Changes from a Large Pig Dataset. 524-533 - Mikaël Jacquemont, Thomas Vuillaume, Alexandre Benoît, Gilles Maurin, Patrick Lambert:
Multi-Task Architecture with Attention for Imaging Atmospheric Cherenkov Telescope Data Analysis. 534-544 - Jun Yang, Zhaogong Zhang, Xuexia Wang:
GAPF: Curve Text Detection based on Generative Adversarial Networks and Pixel Fluctuations. 545-552 - Cheng Li, Arash Pourtaherian, Lonneke van Onzenoort, Peter H. N. de With:
Automated Infant Monitoring based on R-CNN and HMM. 553-560 - David Duque-Arias, Santiago Velasco-Forero, Jean-Emmanuel Deschaud, François Goulette, Andrés Serna, Etienne Decencière, Beatriz Marcotegui:
On Power Jaccard Losses for Semantic Segmentation. 561-568 - Jishu Miao, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi:
3D Object Detection with Normal-map on Point Clouds. 569-576 - Ryota Ikedo, Kazuhiro Hotta:
Feature Sharing Cooperative Network for Semantic Segmentation. 577-584 - Jason Jung, Naveed Akhtar, Ghulam Mubashar Hassan:
Analysing Adversarial Examples for Deep Learning. 585-592 - Florian Kälber, Okan Köpüklü, Nicolas H. Lehment, Gerhard Rigoll:
U-Net based Zero-hour Defect Inspection of Electronic Components and Semiconductors. 593-601 - Heba Hassan, Marwan Torki, Mohamed E. Hussein:
SCAN: Sequence-character Aware Network for Text Recognition. 602-609 - Jinsong Liu, Mark P. Philipsen, Thomas B. Moeslund:
Supervised versus Self-supervised Assistant for Surveillance of Harbor Fronts. 610-617 - Robin Deléarde, Camille Kurtz, Philippe Dejean, Laurent Wendling:
Segment My Object: A Pipeline to Extract Segmented Objects in Images based on Labels or Bounding Boxes. 618-625 - Masahiro Mitsuhara, Hiroshi Fukui, Yusuke Sakashita, Takanori Ogata, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi:
Embedding Human Knowledge into Deep Neural Network via Attention Map. 626-636 - Feiyan Hu, Eva Mohedano, Noel E. O'Connor, Kevin McGuinness:
Temporal Bilinear Encoding Network of Audio-visual Features at Low Sampling Rates. 637-644 - Mina Basirat, Peter M. Roth:
S*ReLU: Learning Piecewise Linear Activation Functions via Particle Swarm Optimization. 645-652 - Ahmed Nady, Elsayed E. Hemayed:
Player Identification in Different Sports. 653-660 - Amir Ismail, Maroua Mehri, Anis Sahbani, Najoua Essoukri Ben Amara:
Performance Benchmarking of YOLO Architectures for Vehicle License Plate Detection from Real-time Videos Captured by a Mobile Robot. 661-668 - Chaudhary Muhammad Aqdus Ilyas, Rita Nunes, Kamal Nasrollahi, Matthias Rehm, Thomas B. Moeslund:
Deep Emotion Recognition through Upper Body Movements and Facial Expression. 669-679 - Jon Goenetxea, Luis Unzueta, Unai Elordi, Oihana Otaegui, Fadi Dornaika:
Efficient Multi-task based Facial Landmark and Gesture Detection in Monocular Images. 680-687 - Kazimierz Choros:
Audience Shot Detection for Automatic Analysis of Soccer Sports Videos. 688-695
Motion, Tracking and Stereo Vision
- Yong Deng, Jimin Xiao, Steven Zhiying Zhou:
A Lightweight Real-time Stereo Depth Estimation Network with Dynamic Upsampling Modules. 701-710 - Swapnil Daga, Gokul B. Nair, Anirudha Ramesh, Rahul Sajnani, Junaid Ahmed Ansari, K. Madhava Krishna:
BirdSLAM: Monocular Multibody SLAM in Bird's-eye View. 711-721 - Mikael Persson, Per-Erik Forssén:
Independently Moving Object Trajectories from Sequential Hierarchical Ransac. 722-731 - Susana Ruano, Aljosa Smolic:
A Benchmark for 3D Reconstruction from Aerial Imagery in an Urban Environment. 732-741 - Abdelrahman Eldesokey, Michael Felsberg:
Normalized Convolution Upsampling for Refined Optical Flow Estimation. 742-752 - Tiago S. Nazaré, Rodrigo Fernandes de Mello, Moacir A. Ponti:
Investigating 3D Convolutional Layers as Feature Extractors for Anomaly Detection Systems Applied to Surveillance Videos. 753-762 - Lucas Valença, Luca Silva, Thiago Chaves, Arlindo Gómes, Lucas Silva Figueiredo, Lucio Cossio, Sébastien Tandel, João Paulo Lima, Francisco Simões, Veronica Teichrieb:
Real-time Monocular 6DoF Tracking of Textureless Objects using Photometrically-enhanced Edges. 763-773 - Martin Ahrnbom, Mikael G. Nilsson, Håkan Ardö:
Real-time and Online Segmentation Multi-target Tracking with Track Revival Re-identification. 777-784 - Rhoda Gbadeyan, Chris Joslin:
Object based Hybrid Video Compression. 785-792 - Zachary Mueller, Sotirios Diamantas:
Modeling a priori Unknown Environments: Place Recognition with Optical Flow Fingerprints. 793-800 - Ivan A. Nikolov, Claus B. Madsen:
Quantifying Wind Turbine Blade Surface Roughness using Sandpaper Grit Sizes: An Initial Exploration. 801-808 - Julian Seuffert, Ana Pérez Grassi, Tobias Scheck, Gangolf Hirtz:
A Study on the Influence of Omnidirectional Distortion on CNN-based Stereo Vision. 809-816 - Ghani O. Lawal, Michael A. Greenspan:
Procam Calibration from a Single Pose of a Planar Target. 817-827 - Shuhei Tarashima:
Object Hypotheses as Points for Efficient Multi-Object Tracking. 828-835
Mobile and Egocentric Vision for Humans and Robots
- Pourya Hoseini, Shuvo Kumar Paul, Mircea Nicolescu, Monica N. Nicolescu:
A Surface and Appearance-based Next Best View System for Active Object Recognition. 841-851 - Ayyappa Swamy Thatavarthy, Tanu Sharma, Harshit Sankhla, Mukul Khanna, K. Madhava Krishna:
Multi-view Planarity Constraints for Skyline Estimation from UAV Images in City Scale Urban Environments. 852-860 - Koji Takeda, Kanji Tanaka:
Boosting Self-localization with Graph Convolutional Neural Networks. 861-868 - Hemang Chawla, Matti Jukola, Shabbir Marzban, Elahe Arani, Bahram Zonooz:
Practical Auto-calibration for Spatial Scene-understanding from Crowdsourced Dashcamera Videos. 869-880 - Cheng Li, Genyu Song, Arash Pourtaherian, Peter H. N. de With:
Dual CNN-based Face Tracking Algorithm for an Automated Infant Monitoring System. 881-887 - Joakim Bruslund Haurum, Moaaz M. J. Allahham, Mathias S. Lynge, Kasper Schøn Henriksen, Ivan A. Nikolov, Thomas B. Moeslund:
Sewer Defect Classification using Synthetic Point Clouds. 891-900 - Stefan Saftescu, Paul Newman:
Learning to Correct Reconstructions from Multiple Views. 901-909 - Yasuyo Kita, Ichiro Matsuda, Nobuyuki Kita:
Integration of Multiple RGB-D Data of a Deformed Clothing Item into Its Canonical Shape. 910-918 - Mateusz Majcher, Bogdan Kwolek:
Fiducial Points-supported Object Pose Tracking on RGB Images via Particle Filtering with Heuristic Optimization. 919-926 - Philip Scales, Mykhailo Rimel, Olivier Aycard:
Visual-based Global Localization from Ceiling Images using Convolutional Neural Networks. 927-934 - Sathyanarayanan N. Aakur, Arunkumar Bagavathi:
Unsupervised Gaze Prediction in Egocentric Videos by Energy-based Surprise Modeling. 935-942 - Muhammad Fikko Fadjrimiratno, Yusuke Hatae, Tetsu Matsukawa, Einoshin Suzuki:
Detecting Anomalies from Human Activities by an Autonomous Mobile Robot based on "Fast and Slow" Thinking. 943-953 - Daniele Di Mauro, Antonino Furnari, Giovanni Signorello, Giovanni Maria Farinella:
Unsupervised Domain Adaptation for 6DOF Indoor Localization. 954-961
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.