General

Jiangyan Yi is a Master and Ph.D. Tutor. 

She is currently an Associate Professor in State Key Laboratory of Multimodal Artificial Intelligence Systems (MAIS), Institute of Automation, Chinese Academy of Sciences. 

Email: jiangyan.yi@nlpr.ia.ac.cn

Address: 95 Zhongguancun East Road, 100190, BEIJING, CHINA


Research Areas

Speech signal processing, speech recognition and synthesis, fake audio detection and transfer learning


Education

· University of Chinese Academy of Sciences, Pattern Recognition and Intelligent System, Ph.D. , 2015-2018

· Graduate School of Chinese Academy of Social Sciences, Computational Linguistics, Master, 2007-2010


Experience



Work Experience

· Institute of Automation, Chinese Academy of Sciences, NLPR, AssociateProfessor, 2020-

· Institute of Automation, Chinese Academy of Sciences, NLPR, Assistant Professor, 2018-2020

· Alibaba Group, Cloud Computing Department, Senior R&D Engineer, 2011-2014


Teaching Experience

· Speech Signal Processing, School of Artificial Intelligence, University of Chinese Academy of Sciences, 2021-2022

· Speech Interaction, School of Computer Science and TechnologyUniversity of Chinese Academy of Sciences, 2018-2019

Publications

   
Papers

Selected Jounal papers:

[1] Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Ye Bai: Language-Adversarial Transfer Learning for Low-Resource Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 27(3): 621-630 (2019)

[2] Jiangyan Yi, Zhengqi Wen, Jianhua Tao, Hao Ni, Bin Liu: CTC Regularized Model Adaptation for Improving LSTM RNN Based Multi-Accent Mandarin Speech Recognition. J. Signal Process. Syst. 90(7): 985-997 (2018)

[3] Tao Wang, Jiangyan Yi*, Ruibo Fu, Jianhua Tao, Zhengqi Wen. CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing,IEEE ACM Trans. Audio Speech Lang. Process (2022)

[4] Cunhang Fan, Jiangyan Yi*, Jianhua Tao, Zhengkun Tian, Bin Liu, Zhengqi Wen: Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 29: 198-209 (2021)

[5] Ye Bai, Jiangyan Yi*, Jianhua Tao, Zhengqi Wen, Zhengkun Tian, Shuai Zhang: Integrating Knowledge Into End-to-End Speech Recognition From External Text-Only Data. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1340-1351 (2021)

[6] Ye Bai, Jiangyan Yi*, Jianhua Tao, Zhengkun Tian, Zhengqi Wen, Shuai Zhang: Fast End-to-End Speech Recognition Via Non-Autoregressive Models and Cross-Modal Knowledge Transferring From BERT. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1897-1911 (2021)

[7] Tao Wang, Ruibo Fu, Jiangyan Yi*, Jianhua Tao, Zhengqi Wen: NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation. IEEE ACM Trans. Audio Speech Lang. Process. 30: 865-878 (2022)

[8] Cunhang Fan, Jianhua Tao, Bin Liu, Jiangyan Yi, Zhengqi Wen, Xuefei Liu: End-to-End Post-Filter for Speech Separation With Deep Attention Fusion Features. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1303-1314 (2020)

[9] Zhengkun Tian, Jiangyan Yi*, Jianhua Tao, Shuai Zhang, Zhengqi Wen. Hybrid Autoregressive and Non-Autoregressive Transformer Models for Speech Recognition. IEEE Signal Processing Letters.27 (2022)

[10] Yibin Zheng, Jianhua Tao, Zhengqi Wen, Jiangyan Yi: Forward-Backward Decoding Sequence for Regularizing End-to-End TTS. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 2067-2079 (2019)

[11] Ye Bai, Jiangyan Yi*, Jianhua Tao, Zhengqi Wen, Cunhang Fan: A Public Chinese Dataset for Language Model Adaptation. J. Signal Process. Syst. 92(8): 839-851 (2020)


  Selected Conference papers:

[1] Jiangyan Yi, Ruibo Fu, Jianhua Tao, Shuai Nie, Haoxin Ma, Chenglong Wang, Tao Wang, Zhengkun Tian, Ye Bai, Cunhang Fan, Shan Liang, Shiming Wang, Shuai Zhang, Xinrui Yan, Le Xu, Zhengqi Wen, Haizhou Li: ADD 2022: the First Audio Deep Synthesis Detection Challenge. ICASSP 2022

[2] Jiangyan Yi, Ye Bai, Jianhua Tao, Haoxin Ma, Zhengkun Tian, Chenglong Wang , Tao Wang, Ruibo Fu: Half-Truth: A Partially Fake Audio Detection Dataset. INTERSPEECH 2021

[3] Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Ye Bai, Cunhang Fan: Focal Loss for Punctuation Prediction. INTERSPEECH 2020: 721-725

[4] Jiangyan Yi, Jianhua Tao, Ye Bai: Language-invariant Bottleneck Features from Adversarial End-to-end Acoustic Models for Low Resource Speech Recognition. ICASSP 2019: 6071-6075

[5] Jiangyan Yi, Jianhua Tao: Self-attention Based Model for Punctuation Prediction Using Word and Speech Embeddings. ICASSP 2019: 7270-7274

[6] Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Ye Bai: Adversarial Multilingual Training for Low-Resource Speech Recognition. ICASSP 2018: 4899-4903

[7] Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Ya Li: Distilling Knowledge from an Ensemble of Models for Punctuation Prediction. INTERSPEECH 2017: 2779-2783

[8] Tao Wang, Jiangyan Yi, Liqun Deng, Ruibo Fu,Jianhua Tao, Zhengqi Wen: Context-Aware Mask Prediction Network for End-to-End Text-Based Speech Editing. ICASSP 2022

[9] Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Ye Bai, Jianhua Tao, Zhengqi Wen: Decoupling Pronunciation and Language for End-to-End Code-Switching Automatic Speech Recognition. ICASSP 2021: 6249-6253

[10] Zhengkun Tian, Jiangyan Yi, Ye Bai, Jianhua Tao, Shuai Zhang, Zhengqi Wen: FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization. INTERSPEECH 2021

[11] Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Ye Bai, Jianhua Tao, Xuefei Liu, Zhengqi Wen: End-to-End Spelling Correction Conditioned on Acoustic Feature for Code-switching Speech Recognition. INTERSPEECH 2021

[12] Haoxin Ma, Jiangyan Yi, Jianhua Tao, Ye Bai, Zhengkun Tian, Chenglong Wang: Continual Learning for Fake Audio Detection. INTERSPEECH 2021

[13] Zhengkun Tian, Jiangyan Yi, Ye Bai, Jianhua Tao, Shuai Zhang, Zhengqi Wen: Synchronous Transformers for end-to-end Speech Recognition. ICASSP 2020: 7884-7888

[14] Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen, Shuai Zhang: Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition. INTERSPEECH 2020: 3381-3385

[15] Zhengkun Tian, Jiangyan Yi, Jianhua Tao, Ye Bai, Shuai Zhang, Zhengqi Wen: Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition. INTERSPEECH 2020: 5026-5030

[16] Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Zhengkun Tian, Chenghao Zhao, Cunhang Fan: A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting. INTERSPEECH 2019: 2190-2194

[17] Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen: Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition. INTERSPEECH 2019: 3795-3799

[18] Zhengkun Tian, Jiangyan Yi, Jianhua Tao, Ye Bai, Zhengqi Wen: Self-Attention Transducers for End-to-End Speech Recognition. INTERSPEECH 2019: 4395-4399

[19] Tao Wang, Ruibo Fu, Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Chunyu Qiang, Shiming Wang: Prosody and Voice Factorization for Few-Shot Speaker Adaptation in the Challenge M2voc 2021. ICASSP 2021: 8603-8607



Students

现指导学生

顾浩  硕士研究生  081104-模式识别与智能系统  

周俊佐  硕士研究生  085410-人工智能  

张一诺  硕士研究生  085400-电子信息  

马一鸣  硕士研究生  085400-电子信息  

廖琳  硕士研究生  085400-电子信息  

程振华  硕士研究生  085400-电子信息  

曾思丁  硕士研究生  085400-电子信息