基本信息

程高峰 男 中国科学院声学研究所
电子邮件: chenggaofeng@hccl.ioa.ac.cn
通信地址: No. 21 North 4th Ring Road, Haidian Dist
邮政编码: 100190
电子邮件: chenggaofeng@hccl.ioa.ac.cn
通信地址: No. 21 North 4th Ring Road, Haidian Dist
邮政编码: 100190
招生信息
招生专业
081002-信号与信息处理
招生方向
语音信号处理,语音识别
教育背景
2014-09--2019-06 中国科学院大学 工学博士2010-09--2014-06 北京邮电大学 理学学士
工作经历
工作简历
2023-05~现在, 中国科学院声学研究所, 副研究员2021-11~2023-05,中国科学院声学研究所, 助理研究员2019-07~2021-11,中国科学院声学研究所, 特别研究助理
社会兼职
2023-10-19-2025-10-19,《声学学报》青年编委, 青年编委
专利与奖励
奖励信息
(1) 中国电子学会科学技术奖, 二等奖, 省级, 2023(2) 北京市科学技术奖, 二等奖, 省级, 2019
专利成果
( 1 ) 一种基于私有参数的语音识别联邦学习方法和系统, 发明专利, 2022, 第 1 作者, 专利号: CN114783425A( 2 ) 一种语音识别模型的个性化联邦学习方法和系统, 发明专利, 2022, 第 2 作者, 专利号: CN114783443A( 3 ) 一种多领域自适应的端到端语音识别方法、系统及电子装置, 发明专利, 2021, 第 1 作者, 专利号: CN113436616A( 4 ) 一种语音识别解码的方法及装置, 发明专利, 2021, 第 1 作者, 专利号: CN113436619A( 5 ) 一种语音关键词检索方法、系统和电子装置, 发明专利, 2021, 第 1 作者, 专利号: CN113192535A( 6 ) 联结主义时间分类和截断式注意力联合在线语音识别技术, 发明专利, 2020, 第 3 作者, 专利号: CN111179918A( 7 ) 一种在线端对端语音转写方法及系统, 发明专利, 2020, 第 3 作者, 专利号: CN111128191A( 8 ) 一种基于窗口输入的双向回馈神经网络的语音识别方法, 发明专利, 2020, 第 2 作者, 专利号: CN111091817A( 9 ) 一种基于混合声学模型的语音识别系统及方法, 专利授权, 2019, 第 2 作者, 专利号: CN109754790A( 10 ) 一种基于无网格最大互信息准则的神经网络训练加速方法, 发明专利, 2018, 第 3 作者, 专利号: CN108629412A
出版信息
发表论文
[1] 高长丰, 程高峰, 张鹏远. 面向鲁棒自动语音识别的一致性自监督学习方法. 声学学报[J]. 2023, 第 2 作者48(3): 578-587, http://lib.cqvip.com/Qikan/Article/Detail?id=7109696359.[2] Deng, Keqi, 程高峰, Yang, Runyan, Yan, Yonghong. Alleviating ASR Long-Tailed Problem by Decoupling the Learning of Representation and Classification. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING[J]. 2022, 第 2 作者 通讯作者 30: 340-354, http://dx.doi.org/10.1109/TASLP.2021.3138707.[3] 程高峰, 颜永红. 多语言语音识别声学模型建模方法最新进展. 计算机科学[J]. 2022, 第 1 作者49(1): 47-52, http://lib.cqvip.com/Qikan/Article/Detail?id=7106430421.[4] 杨润延, 程高峰, 刘建. 基于端到端语音识别的关键词检索技术研究. 计算机科学[J]. 2022, 第 2 作者49(1): 53-58, http://lib.cqvip.com/Qikan/Article/Detail?id=7106430422.[5] 程高峰, Miao, Haoran, Yang, Runyan, Deng, Keqi, Yan, Yonghong. ETEH: Unified Attention-Based End-to-End ASR and KWS Architecture. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING[J]. 2022, 第 1 作者 通讯作者 30: 1360-1373, http://dx.doi.org/10.1109/TASLP.2022.3161159.[6] 颜永红, 程高峰. 语言声学智能化的思考与探索. 中国科学:物理学、力学、天文学[J]. 2022, 第 2 作者52(4): 58-67, http://lib.cqvip.com/Qikan/Article/Detail?id=7107037464.[7] Yang, Runyan, 程高峰, Zhang, Pengyuan, Yan, Yonghong. An E2E-ASR-Based Iteratively-Trained Timestamp Estimator. IEEE SIGNAL PROCESSING LETTERS[J]. 2022, 第 2 作者29: 1654-1658, http://dx.doi.org/10.1109/LSP.2022.3190793.[8] Gao, Changfeng, 程高峰, Li, Ta, Zhang, Pengyuan, Yan, Yonghong. Self-Supervised Pre-Training for Attention-Based Encoder-Decoder ASR Model. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING[J]. 2022, 第 2 作者 通讯作者 30: 1763-1774, http://dx.doi.org/10.1109/TASLP.2022.3171967.[9] IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING. 2022, 第 1 作者[10] IEEE SIGNAL PROCESSING LETTERS. 2022, 第 2 作者[11] Deng, Keqi, 程高峰, Miao, Haoran, Zhang, Pengyuan, Yan, Yonghong, IEEE. History Utterance Embedding Transformer LM for Speech Recognition. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021)[J]. 2021, 第 2 作者 通讯作者 5914-5918, [12] 杨润延, 程高峰, 缪浩然, 黎塔, 张鹏远, 颜永红. Keyword search using attention-based end-to-end ASR and framesynchronous phoneme alignments. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)[J]. 2021, 第 2 作者 通讯作者 [13] Gao, Changfeng, 程高峰, Yang, Runyan, Zhu, Han, Zhang, Pengyuan, Yan, Yonghong, IEEE. Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Text Data. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2021, 第 2 作者 通讯作者 6543-6547, [14] IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP). 2021, 通讯作者 [15] 缪浩然, 程高峰, 张鹏远. Online Hybrid CTC/Attention Architecture for End-to-End Speech Recognition. PROC. INTERSPEECH 2019. 2019, 第 2 作者1(1): [16] 程高峰, 李鑫, 颜永红. 利用高速通道连接的长短时记忆循环神经网络语音识别. CHINESE JOURNAL OF ELECTRONICS[J]. 2019, 第 1 作者28(1): 107-112, [17] Cheng, Gaofeng, Zhang, Pengyuan, Xu, Ji. Automatic Speech Recognition System with Output-Gate Projected Gated Recurrent Unit. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS[J]. 2019, E102D(2): 355-363, http://dx.doi.org/10.1587/transinf.2018EDP7155.[18] Cheng Gaofeng, Li Xin, Yan Yonghong. Using Highway Connections to Enable Deep Small-footprint LSTM-RNNs for Speech Recognition. CHINESE JOURNAL OF ELECTRONICS[J]. 2019, 28(1): 107-112, http://sciencechina.cn/gw.jsp?action=detail.jsp&internal_id=6459030&detailType=1.[19] CHENG Gaofeng, LI Xin, YAN Yonghong. Using Highway Connections to Enable Deep Small-footprint LSTM-RNNs for Speech Recognition. 电子学报:英文版[J]. 2019, 28(1): 107-112, http://lib.cqvip.com/Qikan/Article/Detail?id=6100201213.[20] Huang, Lu, Cheng, Gaofeng, Zhang, Pengyuan, Yang, Yi, Xu, Shumin, Sun, Jiasong, IEEE. Utterance-level Permutation Invariant Training with Latency-controlled BLSTM for Single-channel Multi-talker Speech Separation. 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC). 2019, 1256-1261, http://dx.doi.org/10.1109/apsipaasc47483.2019.9023163.[21] Povey, Daniel, Cheng, Gaofeng, Wang, Yiming, Li, Ke, Xu, Hainan, Yarmohamadi, Mahsa, Khudanpur, Sanjeev, Int Speech Commun Assoc. Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6. 2018, 3743-3747, [22] Li Wenjie, Cheng Gaofeng, Ge Fengpei, Zhang Pengyuan, Yan Yonghong, Int Speech Commun Assoc. Investigation on the combination of batch normalization and dropout in BLSTM-based acoustic modeling for ASR. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6. 2018, 2888-2892, [23] Cheng Gaofeng, Huang Lu, Sun Jiasong, Yan Yonghong, IEEE. Bidirectional LSTM with Extended Input Context. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP). 2018, 364-368, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000469313700074.[24] Cheng Gaofeng, Povey Daniel, Huang Lu, Xu Ji, Khudanpur Sanjeev, Yan Yonghong, Int Speech Commun Assoc. Output-Gate Projected Gated Recurrent Unit for Speech Recognition. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6. 2018, 1793-1797, [25] Cheng Gaofeng, Peddinti Vijayaditya, Povey Daniel, Manohar Vimal, Khudanpur Sanjeev, Yan Yonghong, Int Speech Commun Assoc. An exploration of dropout with LSTMs. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6. 2017, 1586-1590,
科研活动
科研项目
( 1 ) 基于XXXX的智能水下XX识别技术研究, 负责人, 国家任务, 2023-09--2025-12( 2 ) 可持续无监督学习的音频目标分类方法探索, 负责人, 研究所自主部署, 2021-01--2023-12( 3 ) 智能语音演示系统, 负责人, 境内委托项目, 2019-12--2025-01
参与会议
(1)基于基础大模型的水声成像目标检测与分割技术研究 2024-04-14