Publications AITRICS' innovative research takes the lead in advancements in medical artificial intelligence. All AAAI ACL ACS Acute and Critical Care AISTATS arXiv BMJ Health & Care Informatics CHIL Computer Vision&Image Understanding Critical Care CVPR ECCV EMNLP ICASSP ICCV ICLR ICML IEEE IJCAI INTERSPEECH JCDD JMIR Journal Clinical Medicine MLHC NAACL NeurIPS SaTML Scientific Reports Sensors Title Content Search 6 ICASSP Stable-TTS: Stable Speaker-Adaptive Text-to-Speech Synthesis via Prosody Prompting ICASSP 2025 Stable-TTS: Stable Speaker-Adaptive Text-to-Speech Synthesis via Prosody Prompting Wooseok Han, Minki Kang, Changhun Kim, Eunho Yang Speaker-adaptive Text-to-Speech (TTS) synthesis has attracted considerable attention due to its broad range of applications, such as p... 5 ICASSP Face-StyleSpeech: Enhancing Zero-shot Speech Synthesis from Face Images with Improved Face-to-Speech Mapping ICASSP 2025 Face-StyleSpeech: Enhancing Zero-shot Speech Synthesis from Face Images with Improved Face-to-Speech Mapping Minki Kang, Wooseok Han, Eunho Yang Generating speech from a face image is crucial for developing virtual humans capable of interacting using their uniq... 4 ICASSP COMPACT AND DE-BIASED NEGATIVE INSTANCE EMBEDDING FOR MULTI-INSTANCE LEARNING ON WHOLE-SLIDE IMAGE CLASSIFICATION ICASSP 2024 COMPACT AND DE-BIASED NEGATIVE INSTANCE EMBEDDING FOR MULTI-INSTANCELEARNING ON WHOLE-SLIDE IMAGE CLASSIFICATION Joohyung Lee, Heejeong Nam, Kwanhyung Lee, Sangchul Hahn Whole-slide image (WSI) classification is a challenging task because 1) patches ... 3 ICASSP WeavSpeech: Data Augmentation Strategy For Automatic Speech Recognition Via Semantic-Aware Weaving ICASSP 2023 WeavSpeech: Data Augmentation Strategy For Automatic Speech Recognition Via Semantic-Aware Weaving Kyusung Seo, Joonhyung Park, Jaeyun Song and Eunho Yang A cut-and-paste type of data augmentation strategy has attracted considerable attention in the vision... 2 ICASSP Grad-StyleSpeech: Any-Speaker Adaptive Text-to-Speech Synthesis with Diffusion Models ICASSP 2023 Grad-StyleSpeech: Any-Speaker Adaptive Text-to-Speech Synthesis with Diffusion Models Minki Kang, Dongchan Min, Sung Ju Hwang There has been a significant progress in Text-To-Speech (TTS) synthesis technology in recent years, thanks to the advancement in ne... 1 ICASSP Mutually-Constrained Monotonic Multihead Attention for Online ASR ICASSP 2021 Mutually-Constrained Monotonic Multihead Attention for Online ASR Jaeyun Song, Hajin Shim, Eunho Yang Despite the feature of real-time decoding, Monotonic Multihead Attention (MMA) shows comparable performance to the state-of-the-art offline methods ... 1