publications

publications in reversed chronological order.

2024

  1. Learning Fine-Grained Controllability on Speech Generation via Efficient Fine-Tuning
    Chung-Ming Chien, Andros Tjandra, Apoorv Vyas, and 3 more authors
    In Interspeech 2024
  2. On the Evaluation of Speech Foundation Models for Spoken Language Understanding
    Siddhant Arora, Ankita Pasad, Chung-Ming Chien, and 9 more authors
    In Findings of the Association for Computational Linguistics ACL 2024
  3. AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement
    Ju-Chieh Chou, Chung-Ming Chien, and Karen Livescu
    In ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  4. What Do Self-Supervised Speech Models Know about Words?
    Ankita Pasad, Chung-Ming Chien, Shane Settle, and 1 more author
    Transactions of the Association for Computational Linguistics

2023

  1. Few-Shot Spoken Language Understanding via Joint Speech-Text Models
    Chung-Ming Chien, Mingjiamei Zhang, Ju-Chieh Chou, and 1 more author
    Best Student Paper Award
    In 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
  2. Toward Joint Language Modeling for Speech Units and Text
    Ju-Chieh Chou, Chung-Ming Chien, Wei-Ning Hsu, and 5 more authors
    In Findings of the Association for Computational Linguistics: EMNLP 2023

2022

  1. Voice Filter: Few-Shot Text-to-Speech Speaker Adaptation Using Voice Conversion as a Post-Processing Module
    Adam Gabryś, Goeric Huybrechts, Manuel Sam Ribeiro, and 6 more authors
    In ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

2021

  1. S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations
    Jheng-hao Lin, Yist Y. Lin, Chung-Ming Chien, and 1 more author
    In Proc. Interspeech 2021
  2. Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech
    Chung-Ming Chien, Jheng-Hao Lin, Chien-yu Huang, and 2 more authors
    In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  3. FragmentVC: Any-To-Any Voice Conversion by End-To-End Extracting and Fusing Fine-Grained Voice Fragments with Attention
    Chung-Ming Chien*, Yist Y. Lin*, Jheng-Hao Lin, and 2 more authors
    *equal contribution
    In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  4. Hierarchical Prosody Modeling for Non-Autoregressive Speech Synthesis
    Chung-Ming Chien, and Hung-yi Lee
    In 2021 IEEE Spoken Language Technology Workshop (SLT)