Project Details
Soudobé metody zpracování, analýzy a zobrazování multimediálních a 3D dat
Project Period: 1. 3. 2023 - 31. 12. 2025
Project Type: grant
Code: FIT-S-23-8278
Agency: Brno University of Technology
Program: Vnitřní projekty VUT
multimedia data, 3D data, data processing, data analysis, data display
Multimedia and 3D data are important and necessary data for an increasing number of applications of modern computer systems, in which their use is irreplaceable. At the same time, it is known that the processing of such data is difficult and computationally demanding, and this also applies to their display and analysis. Therefore, research in this area is one of the more difficult and important. The project continues the earlier project "Modern methods of processing, analysis and display of multimedia and 3D data".
Bambušek Daniel, Ing. (UPGM FIT VUT)
Bartl Vojtěch, Ing., Ph.D. (UPGM FIT VUT)
Bažout David, Ing. (UPGM FIT VUT)
Beneš Karel, Ing. (UPGM FIT VUT)
Beran Vítězslav, doc. Ing., Ph.D. (UPGM FIT VUT)
Bobák Petr, Ing. (UPGM FIT VUT)
Brukner Jan, Ing. (UPGM FIT VUT)
Burget Lukáš, doc. Ing., Ph.D. (UPGM FIT VUT)
Čadík Martin, doc. Ing., Ph.D. (UPGM FIT VUT)
Černocký Jan, prof. Dr. Ing. (UPGM FIT VUT)
Dobeš Petr, Ing. (UPGM FIT VUT)
Dočekal Martin, Ing. (UPGM FIT VUT)
Dubovec Pavol, Ing. (FIT VUT)
Fajčík Martin, Ing., Ph.D. (UPGM FIT VUT)
Hanák Jiří, Ing. (UPGM FIT VUT)
Herout Adam, prof. Ing., Ph.D. (UPGM FIT VUT)
Hříbek David, Ing. (UPGM FIT VUT)
Chlubna Tomáš, Ing. (UPGM FIT VUT)
Chudý Peter, doc. Ing., Ph.D. MBA (UPGM FIT VUT)
Kapinus Michal, Ing. (UPGM FIT VUT)
Karas Matej, Ing. (UPGM FIT VUT)
Kišš Martin, Ing. (UPGM FIT VUT)
Klem Richard, Ing. (FIT VUT)
Klepárník Petr, Ing., Ph.D. (UPGM FIT VUT)
Kocour Martin, Ing. (UPGM FIT VUT)
Kohút Jan, Ing. (UPGM FIT VUT)
Landini Federico Nicolás (UPGM FIT VUT)
Liška Jakub, Ing. (FIT VUT)
Maršík Lukáš, Ing. (UPGM FIT VUT)
Mošner Ladislav, Ing. (UPGM FIT VUT)
Munzar Milan, Ing. (UPGM FIT VUT)
Nguyen Son Hai, Ing. (UPGM FIT VUT)
Nosko Svetozár, Ing. (UPGM FIT VUT)
Novák Jiří, Ing., Ph.D. (UPGM FIT VUT)
Ondřej Karel, Ing. (UPGM FIT VUT)
Pavlus Ján, Ing. (UPGM FIT VUT)
Peng Junyi, Msc. Eng. (UPGM FIT VUT)
Polášek Tomáš, Ing. (UPGM FIT VUT)
Reich Bořek, Ing. (UPGM FIT VUT)
Sedláček Šimon, Ing. (FIT VUT)
Smrž Pavel, doc. RNDr., Ph.D. (UPGM FIT VUT)
Strýček Šimon, Ing. (FIT VUT)
Španěl Michal, doc. Ing., Ph.D. (UPGM FIT VUT)
Špaňhel Jakub, Ing., Ph.D. (UPGM FIT VUT)
Šůstek Martin, Ing. (FIT VUT)
Švec Ján, Ing. (UPGM FIT VUT)
Švec Tomáš, Ing. (UPGM FIT VUT)
Tesařová Alena, Ing. (UPGM FIT VUT)
Vendrame Katia, Ing. (FIT VUT)
Vlnas Michal, Ing. (UPGM FIT VUT)
2024
- BHATTACHARJEE Mrinmoy, NIGMATULINA Iuliia, PRASAD Amrutha, RANGAPPA Pradeep, MADIKERI Srikanth, MOTLÍČEK Petr, HELMKE Hartmut and KLEINERT Matthias. Contextual Biasing Methods for Improving Rare Word Detection in Automatic Speech Recognition. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 12652-12656. ISBN 979-8-3503-4485-1. Detail
- NOVÁK Jiří and CHUDÝ Peter. Dynamic Soaring in Uncertain Wind Conditions: Polynomial Chaos Expansion Approach. In: Machine Learning, Optimization, and Data Science. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Grasmere: Springer Nature Switzerland AG, 2024, pp. 104-115. ISBN 978-3-031-53968-8. ISSN 0302-9743. Detail
- CHLUBNA Tomáš, ZEMČÍK Pavel and MILET Tomáš. Efficient Random-Access GPU Video Decoding for Light-Field Rendering. Journal of Visual Communication and Image Representation, vol. 2024, no. 102, pp. 1-14. ISSN 1047-3203. Detail
- MACIEJEWSKI Matthew, KLEMENT Dominik, HUANG Ruizhe, WIESNER Matthew and KHUDANPUR Sanjeev. Evaluating the Santa Barbara Corpus: Challenges of the Breadth of Conversational Spoken Language. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 2155-2160. ISSN 1990-9772. Detail
- PRASAD Amrutha, CAROFILIS Andrés, VANDERREYDT Geoffroy, KHALIL Driss, MADIKERI Srikanth, MOTLÍČEK Petr and SCHUEPBACH Christof. Fine-Tuning Self-Supervised Models for Language Identification Using Orthonormal Constraint. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 11921-11925. ISBN 979-8-3503-4485-1. Detail
- CHLUBNA Tomáš, MILET Tomáš and ZEMČÍK Pavel. How Capturing Camera Trajectory Distortion Affects User Experience on Looking Glass 3D Display. Multimedia Tools and Applications, vol. 2024, no. 83, pp. 20265-20287. ISSN 1573-7721. Detail
- BENEŠ Karel, KOCOUR Martin and BURGET Lukáš. Hystoc: Obtaining Word Confidences for Fusion of End-To-End ASR Systems. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Seoul: IEEE Signal Processing Society, 2024, pp. 11276-11280. ISBN 979-8-3503-4485-1. Detail
- CHLUBNA Tomáš, MILET Tomáš and ZEMČÍK Pavel. Lightweight All-Focused Light Field Rendering. Computer Vision and Image Understanding, vol. 244, no. 7, 2024, pp. 7-8. ISSN 1077-3142. Detail
- KUBÍK Tibor and ŠPANĚL Michal. LMVSegRNN and Poseidon3D: Addressing Challenging Teeth Segmentation Cases in 3D Dental Surface Orthodontic Scans. Bioengineering, vol. 11, no. 10, 2024, pp. 1-18. ISSN 2306-5354. Detail
- ESPUNA Fontcuberta Aleix, PRASAD Amrutha, MOTLÍČEK Petr, MADIKERI Srikanth and SCHUEPBACH Christof. Normalising Flows for Speaker and Language Recognition Backend. In: Proceedings of Odyssey 2024: The Speaker and Language Recognition Workshop. Quebec: International Speech Communication Association, 2024, pp. 74-80. Detail
- BOBÁK Petr, ČMOLÍK Ladislav and ČADÍK Martin. Reinforced Labels: Multi-Agent Deep Reinforcement Learning for Point-Feature Label Placement. IEEE Transactions on Visualization and Computer Graphics, vol. 30, no. 9, 2024, pp. 5908-5922. ISSN 1077-2626. Detail
- NOVÁK Jiří, HANÁK Jiří and CHUDÝ Peter. Reliability-Based Control System Optimization in Uncertain Conditions. In: AIAA Aviation Forum and ASCEND, 2024. Las Vegas: American Institute of Aeronautics and Astronautics, 2024, pp. 1-15. ISBN 978-1-62410-716-0. Detail
- YUSUF Bolaji, BASKAR Karthick Murali, ROSENBERG Andrew and RAMABHADRAN Bhuvana. Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models. In: Proceedings of Interspeech 2024. Kos: International Speech Communication Association, 2024, pp. 792-796. ISSN 1990-9772. Detail
- PRASAD Amrutha, MADIKERI Srikanth, KHALIL Driss, MOTLÍČEK Petr and SCHUEPBACH Christof. Speech and Language Recognition with Low-rank Adaptation of Pretrained Models. In: Proceedings of Interspeech. Kos Island: International Speech Communication Association, 2024, pp. 2825-2829. ISSN 1990-9772. Detail
2023
- ZULUAGA-GOMEZ Juan, PRASAD Amrutha, NIGMATULINA Iuliia, MOTLÍČEK Petr and KLEINERT Matthias. A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers. Aerospace, vol. 10, no. 5, 2023, pp. 1-25. ISSN 2226-4310. Detail
- KHALIL Driss, PRASAD Amrutha, MOTLÍČEK Petr, ZULUAGA-GOMEZ Juan, NIGMATULINA Iuliia, MADIKERI Srikanth and SCHUEPBACH Christof. An Automatic Speaker Clustering Pipeline for the Air Traffic Communication Domain. Aerospace, vol. 10, no. 10, 2023, pp. 1-14. ISSN 2226-4310. Detail
- MOTLÍČEK Petr, PRASAD Amrutha, NIGMATULINA Iuliia, HELMKE Hartmut, OHNEISER Oliver and KLEINERT Matthias. Automatic Speech Analysis Framework for ATC Communication in HAAWAII. In: Proceedings of the 13th SESAR Innovation Days. Seville: SESAR Joint Undertaking, 2023, pp. 1-9. Detail
- HELMKE Hartmut, KLEINERT Matthias, AHRENHOLD Nils, EHR Heiko, MÜHLHAUSEN Thorsten, PINSKA Chauvin Ella, OHNEISER Oliver, KLAMERT Lucas, MOTLÍČEK Petr, PRASAD Amrutha, ZULUAGA-GOMEZ Juan and DOKIC Jelena. Automatic Speech Recognition and Understanding for Radar Label Maintenance Support Increases Safety and Reduces Air Traffic Controllers' Workload. In: Proceedings of ATM Seminar. Savannah, Georgia: EUROPEAN ORGANISATION FOR THE SAFETY OF AIR NAVIGATION, 2023, pp. 1-11. Detail
- HANÁK Jiří, CHUDÝ Peter and VLK Jan. Collaborative Agents for Synthetic Tactical Training. In: AIAA/IEEE Digital Avionics Systems Conference - Proceedings. Barcelona: Institute of Electrical and Electronics Engineers, 2023, pp. 1-9. ISBN 979-8-3503-3357-2. ISSN 2155-7195. Detail
- BHATTACHARJEE Mrinmoy, MOTLÍČEK Petr, NIGMATULINA Iuliia, HELMKE Hartmut, OHNEISER Oliver, KLEINERT Matthias and EHR Heiko. Customization of Automatic Speech Recognition Engines for Rare Word Detection Without Costly Model Re-Training. In: Proceedings of the 13th SESAR Innovation Days. Seville: SESAR Joint Undertaking, 2023, pp. 1-8. Detail
- SKOWRON Marcin, BACKFRIED Gerhard, NAVAS Eva, BERZINŠ Aivars, VAN Den Bogaert Joachim, DE Jong Franciska, DEMARCO Andrea, POLÁK Peter, KOVÁČ Marek, POLÁK Peter, ROHDIN Johan A., ROSNER Michael, SANCHEZ Jon, SARATXAGA Ibon and SCHWARZ Petr. Deep Dive Speech Technology. European Language Equality. Cham: Springer Nature Switzerland AG, 2023, pp. 289-312. ISBN 978-3-031-28819-7. Detail
- VILLATORO-TELLO Esaú, MADIKERI Srikanth, ZULUAGA-GOMEZ Juan, SHARMA Bidisha, SARFJOO Seyyed Saeed, NIGMATULINA Iuliia, MOTLÍČEK Petr, IVANOV Alexei V. and GANAPATHIRAJU Aravind. Effectiveness of Text, Acoustic, and Lattice-Based Representations in Spoken Language Understanding Tasks. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Rhodes Island: IEEE Signal Processing Society, 2023, pp. 1-5. ISBN 978-1-7281-6327-7. Detail
- BAŘINA David. Experimental lossless data compressor. Microprocessors and Microsystems, vol. 98, no. 4, 2023, pp. 104803-104803. ISSN 0141-9331. Detail
- APAROVICH Maksim, KESIRAJU Santosh, DUFKOVÁ Aneta and SMRŽ Pavel. FIT BUT at SemEval-2023 Task 12: Sentiment Without Borders - Multilingual Domain Adaptation for Low-Resource Sentiment Classification. In: Proceedings of the The 17th International Workshop on Semantic Evaluation (SemEval-2023). Toronto (online): Association for Computational Linguistics, 2023, pp. 1518-1524. ISBN 978-1-959429-99-9. Detail
- BAMBUŠEK Daniel, MATERNA Zdeněk, KAPINUS Michal, BERAN Vítězslav and SMRŽ Pavel. How Do I Get There? Overcoming Reachability Limitations of Constrained Industrial Environments in Augmented Reality Applications. In: 2023 IEEE Conference on Virtual Reality and 3D User Interfaces (VR). Shanghai: Institute of Electrical and Electronics Engineers, 2023, pp. 115-122. ISBN 979-8-3503-4815-6. Detail
- TESAŘOVÁ Alena, HEROUT Adam, BAMBUŠEK Daniel and JUŘÍK Vojtěch. How to shoot yourself right with a smartphone?. Virtual Reality, vol. 2023, no. 1, pp. 1-13. ISSN 1434-9957. Detail
- MAI Florian, ZULUAGA-GOMEZ Juan, PARCOLLET Titouan and MOTLÍČEK Petr. HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition. In: Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, pp. 2213-2217. ISSN 1990-9772. Detail
- NIGMATULINA Iuliia, MADIKERI Srikanth, VILLATORO-TELLO Esaú, MOTLÍČEK Petr, ZULUAGA-GOMEZ Juan, PANDIA Karthick and GANAPATHIRAJU Aravind. Implementing contextual biasing in GPU decoder for online ASR. In: Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, pp. 4494-4498. ISSN 1990-9772. Detail
- GAVRIELIDES Andreas, SOPHOCLEOUS Marios, AGAPIOU George, LESSI Christina, ŠPAŇHEL Jakub, LENDINEZ Adrian, QIU Renxi and LI Dayou. Implementing Network Applications for 5G-Enabled Robots Through the 5G-ERA Platform. In: IFIP Advances in Information and Communication Technology. Artificial Intelligence Applications and Innovations, vol. 677. Cham: Springer Nature Switzerland AG, 2023, pp. 55-65. ISBN 978-3-031-34170-0. ISSN 1868-422X. Detail
- BURDISSO Sergio, VILLATORO-TELLO Esaú, MADIKERI Srikanth and MOTLÍČEK Petr. Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews. In: Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Dublin: International Speech Communication Association, 2023, pp. 3617-3621. ISSN 1990-9772. Detail
- YUSUF Bolaji, GOURAV Aditya, GANDHE Ankur and BULYKO Ivan. On-the-Fly Text Retrieval for end-to-end ASR Adaptation. In: Proceedings of ICASSP 2023. Rhodes Island: IEEE Signal Processing Society, 2023, pp. 1-5. ISBN 978-1-7281-6327-7. Detail
- VANDERREYDT Geoffroy, PRASAD Amrutha, KHALIL Driss, MADIKERI Srikanth, DEMUYNCK Kris and MOTLÍČEK Petr. Parameter-Efficient Tuning With Adaptive Bottlenecks For Automatic Speech Recognition. In: Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). Taipei: IEEE Signal Processing Society, 2023, pp. 1-7. ISBN 979-8-3503-0689-7. Detail
- POLÁŠEK Tomáš and ČADÍK Martin. Predicting Photovoltaic Power Production using High-Uncertainty Weather Forecasts. Applied Energy, vol. 2023, no. 339, pp. 120989-121004. ISSN 0306-2619. Detail
- CHLUBNA Tomáš, MILET Tomáš, ZEMČÍK Pavel and KULA Michal. Real-Time Light Field Video Focusing and GPU Accelerated Streaming. Journal of Signal Processing Systems, vol. 95, no. 6, 2023, pp. 703-719. ISSN 1939-8115. Detail
- KIŠŠ Martin, HRADIŠ Michal, BENEŠ Karel, BUCHAL Petr and KULA Michal. SoftCTC-semi-supervised learning for text recognition using soft pseudo-labels. International Journal on Document Analysis and Recognition (IJDAR), vol. 2024, no. 27, 2023, pp. 177-193. ISSN 1433-2825. Detail
- NOVÁK Jiří and CHUDÝ Peter. Surrogate Modeling of Optimal Control Based Collision Avoidance System for Multirotor Unmanned Aerial Vehicles. In: AIAA/IEEE Digital Avionics Systems Conference - Proceedings. Barcelona: Institute of Electrical and Electronics Engineers, 2023, pp. 1-7. ISBN 979-8-3503-3357-2. ISSN 2155-7195. Detail
- POLÁŠEK Tomáš, ČADÍK Martin, KELLER Yosi and BENEŠ Bedřich. Vision UFormer: Long-Range Monocular Absolute Depth Estimation. Computers and Graphics, vol. 111, no. 4, 2023, pp. 180-189. ISSN 0097-8493. Detail
2022
- BOITO Marcely Z., YUSUF Bolaji, ONDEL Yang Lucas Antoine Francois, VILLAVICENCIO Aline and BESACIER Laurent. Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings. In: Proceedings of the the 1st Annual Meeting of the ELRA/ISCA Special Interest Group on Under-Resourced Languages. Marseile: European Language Resources Association, 2022, pp. 1-9. ISBN 979-10-95546-91-7. Detail
2024
- Convergence verification of the Collatz problem, software, 2024
Authors: Bařina David Detail - Minimalist JPEG decoder & encoder, software, 2024
Authors: Bařina David Detail - x3: Experimental Data Compressor, software, 2024
Authors: Bařina David Detail
2023
- KSPredict: Software for predicting the development of emergency events and crisis situations, software, 2023
Authors: Klíma Ondřej, Neubauer Jiří, Polcerová Lenka, Králík Miroslav, Zeman Tomáš Detail