Project detail

Soudobé metody zpracování, analýzy a zobrazování multimediálních a 3D dat

Duration: 01.03.2023 — 28.02.2026

Funding resources

Brno University of Technology - Vnitřní projekty VUT

- whole funder (2023-01-01 - 2024-12-31)

On the project

Multimediální a 3D data jsou důležitými a potřebnými daty pro vzrůstající počet aplikací moderních počítačových systémů, v nichž je jejich využití nenahraditelné. Současně je známo, že zpracování takových dat je obtížné a výpočetně náročné a to platí i o jejich zobrazování a analýze. Proto je výzkum v této oblasti jedním z obtížnějších a důležitých. Projekt navazuje na dřívější projekt "Moderní metody zpracování, analýzy a zobrazování multimediálních a 3D dat".

Mark

FIT-S-23-8278

Default language

Czech

People responsible

Bambušek Daniel, Ing. - fellow researcher
Bartl Vojtěch, Ing. - fellow researcher
Bažout David, Ing. - fellow researcher
Beneš Karel, Ing. - fellow researcher
Beran Vítězslav, doc. Ing., Ph.D. - fellow researcher
Bobák Petr, Ing. - fellow researcher
Brukner Jan, Ing. - fellow researcher
Burget Lukáš, doc. Ing., Ph.D. - fellow researcher
Čadík Martin, doc. Ing., Ph.D. - fellow researcher
Černocký Jan, prof. Dr. Ing. - fellow researcher
Dobeš Petr, Ing. - fellow researcher
Dočekal Martin, Ing. - fellow researcher
Fajčík Martin, Ing., Ph.D. - fellow researcher
Hanák Jiří, Ing. - fellow researcher
Herout Adam, prof. Ing., Ph.D. - fellow researcher
Hříbek David, Ing. - fellow researcher
Chlubna Tomáš, Ing. - fellow researcher
Chudý Peter, doc. Ing., Ph.D., MBA - fellow researcher
Kapinus Michal, Ing., Ph.D. - fellow researcher
Karas Matej, Ing. - fellow researcher
Kišš Martin, Ing. - fellow researcher
Klepárník Petr, Ing., Ph.D. - fellow researcher
Kocour Martin, Ing. - fellow researcher
Kohút Jan, Ing. - fellow researcher
Landini Federico Nicolás - fellow researcher
Maršík Lukáš, Ing. - fellow researcher
Mošner Ladislav, Ing. - fellow researcher
Munzar Milan, Ing. - fellow researcher
Nguyen Son Hai, Ing. - fellow researcher
Nosko Svetozár, Ing. - fellow researcher
Novák Jiří, Ing. - fellow researcher
Ondřej Karel, Ing. - fellow researcher
Pavlus Ján, Ing. - fellow researcher
Peng Junyi, Master of Technology, MSc, Eng. - fellow researcher
Polášek Tomáš, Ing. - fellow researcher
Reich Bořek, Ing. - fellow researcher
Smrž Pavel, doc. RNDr., Ph.D. - fellow researcher
Španěl Michal, Ing., Ph.D. - fellow researcher
Špaňhel Jakub, Ing. - fellow researcher
Švec Ján, Ing. - fellow researcher
Švec Tomáš, Ing. - fellow researcher
Tesařová Alena, Ing. - fellow researcher
Vlnas Michal, Ing. - fellow researcher
Zemčík Pavel, prof. Dr. Ing., dr. h. c. - principal person responsible

Units

Department of Computer Graphics and Multimedia
- (2023-01-01 - 2025-12-31)
Faculty of Information Technology
- (2023-01-01 - 2025-12-31)

Results

POLÁŠEK, T.; ČADÍK, M.; KELLER, Y.; BENEŠ, B. Vision UFormer: Long-Range Monocular Absolute Depth Estimation. COMPUTERS & GRAPHICS-UK, 2023, vol. 111, no. 4, p. 180-189. ISSN: 0097-8493.
Detail

KIŠŠ, M.; HRADIŠ, M.; BENEŠ, K.; BUCHAL, P.; KULA, M. SoftCTC-semi-supervised learning for text recognition using soft pseudo-labels. International Journal on Document Analysis and Recognition, 2023, vol. 2024, no. 99, p. 1-17. ISSN: 1433-2825.
Detail

BHATTACHARJEE, M.; MOTLÍČEK, P.; NIGMATULINA, I.; HELMKE, H.; OHNEISER, O.; KLEINERT, M.; EHR, H. Customization of Automatic Speech Recognition Engines for Rare Word Detection Without Costly Model Re-Training. Proceedings of the 13th SESAR Innovation Days. Seville: SESAR Joint Undertaking, 2023. p. 1-8.
Detail

APAROVICH, M.; KESIRAJU, S.; DUFKOVÁ, A.; SMRŽ, P. FIT BUT at SemEval-2023 Task 12: Sentiment Without Borders - Multilingual Domain Adaptation for Low-Resource Sentiment Classification. In Proceedings of the The 17th International Workshop on Semantic Evaluation (SemEval-2023). Toronto (online): Association for Computational Linguistics, 2023. p. 1518-1524. ISBN: 978-1-959429-99-9.
Detail

HELMKE, H.; KLEINERT, M.; AHRENHOLD, N.; EHR, H.; MÜHLHAUSEN, T.; PINSKA, E.; OHNEISER, O.; KLAMERT, L.; MOTLÍČEK, P.; PRASAD, A.; ZULUAGA-GOMEZ, J.; DOKIC, J. Automatic Speech Recognition and Understanding for Radar Label Maintenance Support Increases Safety and Reduces Air Traffic Controllers' Workload. Proceedings of ATM Seminar. Savannah, Georgia: EUROPEAN ORGANISATION FOR THE SAFETY OF AIR NAVIGATION, 2023. p. 1-11.
Detail

BAŘINA, D. Experimental lossless data compressor. Microprocessors and Microsystems, 2023, vol. 98, no. 4, p. 104803-104803. ISSN: 0141-9331.
Detail

CHLUBNA, T.; MILET, T.; ZEMČÍK, P.; KULA, M. Real-Time Light Field Video Focusing and GPU Accelerated Streaming. Journal of Signal Processing Systems for Signal Image and Video Technology, 2023, vol. 95, no. 6, p. 703-719. ISSN: 1939-8115.
Detail

TESAŘOVÁ, A.; HEROUT, A.; BAMBUŠEK, D.; JUŘÍK, V. How to shoot yourself right with a smartphone?. VIRTUAL REALITY, 2023, vol. 2023, no. 1, p. 1-13. ISSN: 1434-9957.
Detail

NOVÁK, J.; CHUDÝ, P. Surrogate Modeling of Optimal Control Based Collision Avoidance System for Multirotor Unmanned Aerial Vehicles. In AIAA/IEEE Digital Avionics Systems Conference - Proceedings. Barcelona: Institute of Electrical and Electronics Engineers, 2023. p. 1-7. ISBN: 979-8-3503-3357-2. ISSN: 2155-7195.
Detail

HANÁK, J.; CHUDÝ, P.; VLK, J. Collaborative Agents for Synthetic Tactical Training. In AIAA/IEEE Digital Avionics Systems Conference - Proceedings. Barcelona: Institute of Electrical and Electronics Engineers, 2023. p. 1-9. ISBN: 979-8-3503-3357-2. ISSN: 2155-7195.
Detail

NOVÁK, J.; CHUDÝ, P. Dynamic Soaring in Uncertain Wind Conditions: Polynomial Chaos Expansion Approach. In Machine Learning, Optimization, and Data Science. Lecture Notes in Computer Science. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Grasmere: Springer Nature Switzerland AG, 2024. p. 104-115. ISBN: 978-3-031-53968-8. ISSN: 0302-9743.
Detail

CHLUBNA, T.; MILET, T.; ZEMČÍK, P. How Capturing Camera Trajectory Distortion Affects User Experience on Looking Glass 3D Display. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, vol. 2024, no. 83, p. 20265-20287. ISSN: 1573-7721.
Detail

YUSUF, B.; GOURAV, A.; GANDHE, A.; BULYKO, I. On-the-Fly Text Retrieval for end-to-end ASR Adaptation. In Proceedings of ICASSP 2023. Rhodes Island: IEEE Signal Processing Society, 2023. p. 1-5. ISBN: 978-1-7281-6327-7.
Detail

SKOWRON, M.; BACKFRIED, G.; NAVAS, E.; BERZINŠ, A.; VAN, J.; DE, F.; DEMARCO, A.; POLÁK, P.; KOVÁČ, M.; POLÁK, P.; ROHDIN, J.; ROSNER, M.; SANCHEZ, J.; SARATXAGA, I.; SCHWARZ, P. Deep Dive Speech Technology. In European Language Equality. Cham: Springer Nature Switzerland AG, 2023. p. 289-312. ISBN: 978-3-031-28819-7.
Detail

BOBÁK, P.; ČMOLÍK, L.; ČADÍK, M. Reinforced Labels: Multi-Agent Deep Reinforcement Learning for Point-Feature Label Placement. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, p. 1-14. ISSN: 1077-2626.
Detail

KIEFER, B.; BARTL, V.; ŠPAŇHEL, J.; HEROUT, A.; et al. 1st Workshop on Maritime Computer Vision (MaCVi) 2023: Challenge Results. In IEEE Winter Applications and Computer Vision Workshops (WACVW). Winter Applications and Computer Vision Workshops. LOS ALAMITOS: IEEE COMPUTER SOC, 2023. p. 265-302. ISBN: 979-8-3503-2056-5. ISSN: 2690-621X.
Detail

ZULUAGA-GOMEZ, J.; PRASAD, A.; NIGMATULINA, I.; MOTLÍČEK, P.; KLEINERT, M.;. A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers. Aerospace, 2023, vol. 10, no. 5, p. 1-25. ISSN: 2226-4310.
Detail

GAVRIELIDES, A.; SOPHOCLEOUS, M.; AGAPIOU, G.; LESSI, C.; ŠPAŇHEL, J.; LENDINEZ, A.; QIU, R.; LI, D. Implementing Network Applications for 5G-Enabled Robots Through the 5G-ERA Platform. In IFIP Advances in Information and Communication Technology. IFIP Advances in Information and Communication Technology. Artificial Intelligence Applications and Innovations. Cham: Springer Nature Switzerland AG, 2023. p. 55-65. ISBN: 978-3-031-34170-0. ISSN: 1868-422X.
Detail

BOITO, M.; YUSUF, B.; ONDEL YANG, L.; VILLAVICENCIO, A.; BESACIER, L. Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings. In Proceedings of the the 1st Annual Meeting of the ELRA/ISCA Special Interest Group on Under-Resourced Languages. Marseile: European Language Resources Association, 2022. p. 1-9. ISBN: 979-10-95546-91-7.
Detail

KHALIL, D.; PRASAD, A.; MOTLÍČEK, P.; ZULUAGA-GOMEZ, J.; NIGMATULINA, I.; MADIKERI, S.; SCHUEPBACH, C. An Automatic Speaker Clustering Pipeline for the Air Traffic Communication Domain. Aerospace, 2023, vol. 10, no. 10, p. 1-14. ISSN: 2226-4310.
Detail

NIGMATULINA, I.; MADIKERI, S.; VILLATORO-TELLO, E.; MOTLÍČEK, P.; ZULUAGA-GOMEZ, J.; PANDIA, K.; GANAPATHIRAJU, A. Implementing contextual biasing in GPU decoder for online ASR. In Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Dublin: International Speech Communication Association, 2023. p. 4494-4498. ISSN: 1990-9772.
Detail

BURDISSO, S.; VILLATORO-TELLO, E.; MADIKERI, S.; MOTLÍČEK, P. Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews. In Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Dublin: International Speech Communication Association, 2023. p. 3617-3621. ISSN: 1990-9772.
Detail

MAI, F.; ZULUAGA-GOMEZ, J.; PARCOLLET, T.; MOTLÍČEK, P. HyperConformer: Multi-head HyperMixer for Efficient Speech Recognition. In Proceedings of the Annual Conference of International Speech Communication Association, INTERSPEECH. Proceedings of Interspeech. Dublin: International Speech Communication Association, 2023. p. 2213-2217. ISSN: 1990-9772.
Detail

VILLATORO-TELLO, E.; MADIKERI, S.; ZULUAGA-GOMEZ, J.; SHARMA, B.; SARFJOO, S.; NIGMATULINA, I.; MOTLÍČEK, P.; IVANOV, V.; GANAPATHIRAJU, A. Effectiveness of Text, Acoustic, and Lattice-Based Representations in Spoken Language Understanding Tasks. In ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Rhodes Island: IEEE Signal Processing Society, 2023. p. 1-5. ISBN: 978-1-7281-6327-7.
Detail

VANDERREYDT, G.; PRASAD, A.; KHALIL, D.; MADIKERI, S.; DEMUYNCK, K.; MOTLÍČEK, P. Parameter-Efficient Tuning With Adaptive Bottlenecks For Automatic Speech Recognition. Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). Taipei: IEEE Signal Processing Society, 2023. p. 1-7. ISBN: 979-8-3503-0689-7.
Detail

MOTLÍČEK, P.; PRASAD, A.; NIGMATULINA, I.; HELMKE, H.; OHNEISER, O.; KLEINERT, M. Automatic Speech Analysis Framework for ATC Communication in HAAWAII. Proceedings of the 13th SESAR Innovation Days. Seville: SESAR Joint Undertaking, 2023. p. 1-9.
Detail

KLÍMA, O.; NEUBAUER, J.; POLCEROVÁ, L.; KRÁLÍK, M.; ZEMAN, T.: KSPredict; KSPredict: Software pro predikci vývoje krizových situací a mimořádných událostí. https://github.com/ondrej-klima/shinyfireweather. URL: https://github.com/ondrej-klima/shinyfireweather. (software)
Detail

BAŘINA, D.: JPEG; Minimalist JPEG decoder & encoder. http://www.fit.vutbr.cz/research/prod/?id=814. URL: http://www.fit.vutbr.cz/research/prod/?id=814. (software)
Detail

BAŘINA, D.: collatz; Convergence verification of the Collatz problem. http://www.fit.vutbr.cz/research/prod/?id=828. URL: http://www.fit.vutbr.cz/research/prod/?id=828. (software)
Detail

BAŘINA, D.: x3; x3: Experimental Data Compressor. http://www.fit.vutbr.cz/research/prod/?id=827. URL: http://www.fit.vutbr.cz/research/prod/?id=827. (software)
Detail