Publications
Publications
2024
Anonymizing Dysarthric Speech: Investigating the Effects of Voice Conversion on Pathological Information Preservation
27th International Conference on Text, Speech, and Dialogue, TSD 2024 (Brno, CZE, 9. September 2024 - 13. September 2024)
In: Elmar Nöth, Aleš Horák, Petr Sojka (ed.): Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2024
DOI: 10.1007/978-3-031-70566-3_14 , , , , , , :
Indoor Synthetic Data Generation: A Systematic Review
In: Computer Vision and Image Understanding 240 (2024), Article No.: 103907
ISSN: 1077-3142
DOI: 10.1016/j.cviu.2023.103907 , , , , :
Addressing challenges in speaker anonymization to maintain utility while ensuring privacy of pathological speech
In: Communications Medicine 4 (2024), Article No.: 182
ISSN: 2730-664X
DOI: 10.1038/s43856-024-00609-5 , , , , , , , , :
2023
Deep Learning in Surgical Workflow Analysis: A Review of Phase and Step Recognition
In: IEEE Journal of Biomedical and Health Informatics (2023), p. 1-14
ISSN: 2168-2194
DOI: 10.1109/JBHI.2023.3311628 , , , , , :
ADABase: A Multimodal Dataset for Cognitive Load Estimation
In: Sensors 23 (2023)
ISSN: 1424-8220
DOI: 10.3390/s23010340 , , , , , , :
Federated learning for secure development of AI models for Parkinson’s disease detection using speech from different languages
Interspeech 2023 (Dublin, 21. August 2023 - 24. August 2023)
In: Proceedings of INTERSPEECH 2023, Dublin, Ireland: 2023
DOI: 10.21437/Interspeech.2023-2108 , , , , , , :
The effect of speech pathology on automatic speaker verification: a large-scale study
In: Scientific Reports 13 (2023), p. 20476
ISSN: 2045-2322
DOI: 10.1038/s41598-023-47711-7
URL: https://www.nature.com/articles/s41598-023-47711-7 , , , , , :
Impact of Including Pathological Speech in Pre-training on Pathology Detection
TSD 2023: Text, Speech, and Dialogue (Pilsen, 4. September 2023 - 6. September 2023)
In: Kamil Ekštein, František Pártl, Miloslav Konopík (ed.): Text, Speech, and Dialogue, Cham: 2023
DOI: 10.1007/978-3-031-40498-6_13 , , , , , , , , :
Impact of Including Pathological Speech in Pre-training on Pathology Detection
Springer Science and Business Media Deutschland GmbH, 2023
ISBN: 9783031404979
DOI: 10.1007/978-3-031-40498-6_13 , , , , , , , , :
Multi-modal Biomarker Extraction Framework for Therapy Monitoring of Social Anxiety and Depression Using Audio and Video
Springer Science and Business Media Deutschland GmbH, 2023
ISBN: 9783031476785
DOI: 10.1007/978-3-031-47679-2_3 , , , , , , , , , :
PoCaPNet: A Novel Approach for Surgical Phase Recognition Using Speech and X-Ray Images
International Conference Interspeech 2023 (Dublin, 21. August 2023 - 24. August 2023) , , , , , :
Multi-Modal Biomarker Extraction Framework for Therapy Monitoring of Social Anxiety and Depression Using Audio and Video
International Conference on Machine Learning (Workshop on Machine Learning for Multimodal Healthcare) (Hawaii Convention Center, 1801 Kalākaua Ave, Honolulu, HI 96815, United States, 29. July 2023 - 29. July 2023)
In: International Conference on Machine Learning (Workshop on Machine Learning for Multimodal Healthcare) 2023 , , , , , , , :
Impact of Including Pathological Speech in Pre-Training on Pathology Detection
Text, Speech, and Dialogue. Satellite event of Interspeech 22023 (Pilsen, 4. September 2023 - 6. July 2023) , , :
2022
PoCaP Corpus: A Multimodal Dataset for Smart Operating Room Speech Assistant Using Interventional Radiology Workflow Analysis
25th International Conference on Text, Speech, and Dialogue, TSD 2022 (Brno, 6. September 2022 - 9. September 2022)
In: Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala (ed.): Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2022
DOI: 10.1007/978-3-031-16270-1_38 , , , , , , , :
Autoblog 2021: The Importance of Language Models for Spontaneous Lecture Speech
25th International Conference on Text, Speech and Dialogue (Brno, Czech Republic, 6. September 2022 - 9. September 2022)
In: Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala (ed.): Text, Speech, and Dialogue 25th International Conference, TSD 2022, Brno, Czech Republic, September 6–9, 2022, Proceedings, Springer Nature Switzerland AG: 2022
DOI: 10.1007/978-3-031-16270-1_24
URL: https://link.springer.com/chapter/10.1007/978-3-031-16270-1_24 , , , , :
Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition
Interspeech (Seoul, 18. September 2022 - 22. September 2022)
In: Proceedings of Interspeech 2022 2022
DOI: 10.21437/Interspeech.2022-10674
URL: https://www.isca-speech.org/archive/interspeech_2022/hernandez22_interspeech.html , , , , , :
Known operator learning and hybrid machine learning in medical imaging - A review of the past, the present, and the future
In: Progress in Biomedical Engineering 4 (2022), Article No.: 022002
ISSN: 2516-1091
DOI: 10.1088/2516-1091/ac5b13 , , , , :
Offer Proprietary Algorithms Still Protection of Intellectual Property in the Age of Machine Learning?: A Case Study Using Dual Energy CT Data
German Workshop on Medical Image Computing, 2022 (Heidelberg, DEU, 26. June 2022 - 28. June 2022)
In: Klaus Maier-Hein, Thomas M. Deserno, Heinz Handels, Andreas Maier, Christoph Palm, Thomas Tolxdorff (ed.): Informatik aktuell 2022
DOI: 10.1007/978-3-658-36932-3_70 , , , , :
SliTraNet: Automatic Detection of Slide Transitions in Lecture Videos using Convolutional Neural Networks
OAGM Workshop 2021 (, 24. November 2021 - 25. November 2021)
In: Proceedings of the OAGM Workshop 2021. Computer Vision and Pattern Analysis Across Domains 2022
DOI: 10.3217/978-3-85125-869-1-10
URL: https://openlib.tugraz.at/download.php?id=621f329186973&location=browse , , , , :
Disentangled Latent Speech Representation for Automatic Pathological Intelligibility Assessment
Proceedings of INTERSPEECH 2022 (Songdo) , , , , , :
2021
Multimodal Corpus Analysis of Autoblog 2020: Lecture Videos in Machine Learning
International Conference on Speech and Computer (Online, 27. September 2021 - 30. September 2021)
In: Multimodal Corpus Analysis of Autoblog 2020: Lecture Videos in Machine Learning 2021
DOI: 10.1007/978-3-030-87802-3_24
URL: https://link.springer.com/chapter/10.1007/978-3-030-87802-3_24 , :