Publications

2024

Hernandez A., Perez Toro PA., Arias-Vergara T., Vasquez-Correa JC., Yang SH., Orozco-Arroyave JR., Maier A.:
Anonymizing Dysarthric Speech: Investigating the Effects of Voice Conversion on Pathological Information Preservation
27th International Conference on Text, Speech, and Dialogue, TSD 2024 (Brno, CZE, 9. September 2024 - 13. September 2024)
In: Elmar Nöth, Aleš Horák, Petr Sojka (ed.): Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2024
DOI: 10.1007/978-3-031-70566-3_14
Schieber H., Demir K., Kleinbeck C., Yang SH., Roth D.:
Indoor Synthetic Data Generation: A Systematic Review
In: Computer Vision and Image Understanding 240 (2024), Article No.: 103907
ISSN: 1077-3142
DOI: 10.1016/j.cviu.2023.103907
Tayebi Arasteh S., Arias-Vergara T., Perez Toro PA., Weise T., Packhäuser K., Schuster M., Nöth E., Maier A., Yang SH.:
Addressing challenges in speaker anonymization to maintain utility while ensuring privacy of pathological speech
In: Communications Medicine 4 (2024), Article No.: 182
ISSN: 2730-664X
DOI: 10.1038/s43856-024-00609-5

2023

Demir K., Schieber H., Weise T., May M., Maier A., Yang SH.:
Deep Learning in Surgical Workflow Analysis: A Review of Phase and Step Recognition
In: IEEE Journal of Biomedical and Health Informatics (2023), p. 1-14
ISSN: 2168-2194
DOI: 10.1109/JBHI.2023.3311628
Oppelt MP., Foltyn A., Deuschel J., Lang NR., Holzer N., Eskofier B., Yang SH.:
ADABase: A Multimodal Dataset for Cognitive Load Estimation
In: Sensors 23 (2023)
ISSN: 1424-8220
DOI: 10.3390/s23010340
Tayebi Arasteh S., Rios-Urrego CD., Nöth E., Maier A., Yang SH., Rusz J., Rafael Orozco-Arroyave J.:
Federated learning for secure development of AI models for Parkinson’s disease detection using speech from different languages
Interspeech 2023 (Dublin, 21. August 2023 - 24. August 2023)
In: Proceedings of INTERSPEECH 2023, Dublin, Ireland: 2023
DOI: 10.21437/Interspeech.2023-2108
Tayebi Arasteh S., Weise T., Schuster M., Nöth E., Maier A., Yang SH.:
The effect of speech pathology on automatic speaker verification: a large-scale study
In: Scientific Reports 13 (2023), p. 20476
ISSN: 2045-2322
DOI: 10.1038/s41598-023-47711-7
URL: https://www.nature.com/articles/s41598-023-47711-7
Utz J., Weise T., Schlereth M., Wagner F., Thies M., Gu M., Uderhardt S., Breininger K.:
Focus on Content not Noise: Improving Image Generation for Nuclei Segmentation by Suppressing Steganography in CycleGAN
2023 IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2023 (Paris, 2. October 2023 - 6. October 2023)
In: Proceedings - 2023 IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2023 2023
DOI: 10.1109/ICCVW60793.2023.00417
Weise T., Maier A., Demir K., Perez Toro PA., Arias Vergara T., Heismann B., Nöth E., Schuster M., Yang SH.:
Impact of Including Pathological Speech in Pre-training on Pathology Detection
TSD 2023: Text, Speech, and Dialogue (Pilsen, 4. September 2023 - 6. September 2023)
In: Kamil Ekštein, František Pártl, Miloslav Konopík (ed.): Text, Speech, and Dialogue, Cham: 2023
DOI: 10.1007/978-3-031-40498-6_13
Weise T., Maier A., Demir K., Perez Toro PA., Arias Vergara T., Heismann B., Nöth E., Schuster ME., Yang SH.:
Impact of Including Pathological Speech in Pre-training on Pathology Detection
Springer Science and Business Media Deutschland GmbH, 2023
ISBN: 9783031404979
DOI: 10.1007/978-3-031-40498-6_13
Weise T., Perez Toro PA., Deitermann A., Hoffmann B., Demir K., Straetz T., Nöth E., Maier A., Kallert T., Yang SH.:
Multi-Modal Biomarker Extraction Framework for Therapy Monitoring of Social Anxiety and Depression Using Audio and Video
International Conference on Machine Learning (Workshop on Machine Learning for Multimodal Healthcare) (Hawaii Convention Center, 1801 Kalākaua Ave, Honolulu, HI 96815, United States, 29. July 2023 - 29. July 2023)
In: Andreas K. Maier, Julia A. Schnabel, Pallavi Tiwari, Oliver Stegle (ed.): International Conference on Machine Learning (Workshop on Machine Learning for Multimodal Healthcare), Cham: 2023
DOI: 10.1007/978-3-031-47679-2_3
Yang SH., Demir K., Weise T., Schmid A., May M., Maier A.:
PoCaPNet: A Novel Approach for Surgical Phase Recognition Using Speech and X-Ray Images
International Conference Interspeech 2023 (Dublin, 21. August 2023 - 24. August 2023)
Yang SH., Weise T., Demir K.:
Impact of Including Pathological Speech in Pre-Training on Pathology Detection
Text, Speech, and Dialogue. Satellite event of Interspeech 22023 (Pilsen, 4. September 2023 - 6. July 2023)

2022

Demir K., May M., Schmid A., Uder M., Breininger K., Weise T., Maier A., Yang SH.:
PoCaP Corpus: A Multimodal Dataset for Smart Operating Room Speech Assistant Using Interventional Radiology Workflow Analysis
25th International Conference on Text, Speech, and Dialogue, TSD 2022 (Brno, 6. September 2022 - 9. September 2022)
In: Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala (ed.): Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 2022
DOI: 10.1007/978-3-031-16270-1_38
Hernandez A., Klumpp P., Das BK., Maier A., Yang SH.:
Autoblog 2021: The Importance of Language Models for Spontaneous Lecture Speech
25th International Conference on Text, Speech and Dialogue (Brno, Czech Republic, 6. September 2022 - 9. September 2022)
In: Petr Sojka, Aleš Horák, Ivan Kopeček, Karel Pala (ed.): Text, Speech, and Dialogue 25th International Conference, TSD 2022, Brno, Czech Republic, September 6–9, 2022, Proceedings, Springer Nature Switzerland AG: 2022
DOI: 10.1007/978-3-031-16270-1_24
URL: https://link.springer.com/chapter/10.1007/978-3-031-16270-1_24
Hernandez A., Perez Toro PA., Nöth E., Orozco Arroyave JR., Maier A., Yang SH.:
Cross-lingual Self-Supervised Speech Representations for Improved Dysarthric Speech Recognition
Interspeech (Seoul, 18. September 2022 - 22. September 2022)
In: Proceedings of Interspeech 2022 2022
DOI: 10.21437/Interspeech.2022-10674
URL: https://www.isca-speech.org/archive/interspeech_2022/hernandez22_interspeech.html
Maier A., Köstler H., Heisig M., Krauß P., Yang SH.:
Known operator learning and hybrid machine learning in medical imaging - A review of the past, the present, and the future
In: Progress in Biomedical Engineering 4 (2022), Article No.: 022002
ISSN: 2516-1091
DOI: 10.1088/2516-1091/ac5b13
Maier A., Yang SH., Maleki F., Muthukrishnan N., Forghani R.:
Offer Proprietary Algorithms Still Protection of Intellectual Property in the Age of Machine Learning?: A Case Study Using Dual Energy CT Data
German Workshop on Medical Image Computing, 2022 (Heidelberg, DEU, 26. June 2022 - 28. June 2022)
In: Klaus Maier-Hein, Thomas M. Deserno, Heinz Handels, Andreas Maier, Christoph Palm, Thomas Tolxdorff (ed.): Informatik aktuell 2022
DOI: 10.1007/978-3-658-36932-3_70
Sindel A., Hernandez A., Yang SH., Christlein V., Maier A.:
SliTraNet: Automatic Detection of Slide Transitions in Lecture Videos using Convolutional Neural Networks
OAGM Workshop 2021 (, 24. November 2021 - 25. November 2021)
In: Proceedings of the OAGM Workshop 2021. Computer Vision and Pattern Analysis Across Domains 2022
DOI: 10.3217/978-3-85125-869-1-10
URL: https://openlib.tugraz.at/download.php?id=621f329186973&location=browse
Weise T., Maier A., Nöth E., Heismann B., Schuster M., Yang SH.:
Disentangled Latent Speech Representation for Automatic Pathological Intelligibility Assessment
Proceedings of INTERSPEECH 2022 (Songdo)

2021

Hernandez A., Yang SH.:
Multimodal Corpus Analysis of Autoblog 2020: Lecture Videos in Machine Learning
International Conference on Speech and Computer (Online, 27. September 2021 - 30. September 2021)
In: Multimodal Corpus Analysis of Autoblog 2020: Lecture Videos in Machine Learning 2021
DOI: 10.1007/978-3-030-87802-3_24
URL: https://link.springer.com/chapter/10.1007/978-3-030-87802-3_24