基本信息

窦文生,研究员,中国科学院软件研究所
电子邮件: wensheng AT iscas DOT ac DOT cn
通信地址: 北京市中关村南四街4号5号楼6层
邮政编码: 100190

个人主页https://wsdou.github.io/

研究领域

我对基础软件的质量保障技术有着广泛的兴趣,基于软件分析、软件测试、形式化方法、大模型等的技术,构建面向基础软件的质量保障平台,检测、诊断基础软件中存在的多类质量问题,为构建高可靠、高安全的基础软件提供有效支撑。我的研究对象包括数据库系统(关系型数据库系统、图数据库系统、向量数据库系统等)、分布式系统(分布式共识协议、分布式数据库系统等)、电子表格系统等

招生信息

   
招生专业
083500-软件工程
081202-计算机软件与理论
招生方向
网络分布式计算理论与技术
智能软件工程
人工智能

教育背景

2010-09--2015-06   中国科学院软件研究所   博士
2006-09--2009-06   中国科学院软件研究所   硕士
2002-09--2006-06   武汉大学   本科

工作经历

   
工作简历
2022-10~现在, 中国科学院软件研究所, 研究员
2017-10~2018-04,微软亚洲研究院, 铸星计划访问学者
2016-08~2022-10,中国科学院软件研究所, 副研究员
2015-07~2016-07,中国科学院软件研究所, 助理研究员
2013-12~2014-08,美国俄亥俄州立大学, 访问学者
2013-02~2013-07,香港科技大学, 访问学者
2009-07~2015-06,中国科学院软件研究所, 助理研究员

专利与奖励

   
奖励信息
(1) 中创软件人才奖, , 国家级, 2024
(2) 《软件学报》2022年高影响力论文, , 其他, 2023
(3) 中国计算机学会NASAC青年软件创新奖, , 专项, 2023
(4) 中国科学院青年创新促进会优秀会员, , 院级, 2022
(5) 《软件学报》2021年高影响力论文, , 其他, 2022
(6) Microsoft Research Asia Increasing Productivity Award, 其他, 2019
(7) 中国科学院青年创新促进会会员, , 院级, 2018
(8) ESEC/FSE 2018杰出论文奖, 其他, 2018
(9) 中科院软件所杰出青年, 研究所(学校), 2018
(10) 微软亚洲研究院铸星计划学者, , 其他, 2017
(11) 中国计算机学会优秀博士学位论文奖, , 专项, 2016
(12) 中国科学院大学优秀毕业生, 研究所(学校), 2015

发表论文

  • Proving Cypher Query Equivalence
    Lei Tang, Wensheng Dou, Yingying Zheng, Lijie Xu, Wei Wang, Jun Wei, Tao Huang
    41st International Conference on Data Engineering (ICDE 2025, CCF A)
  • Detecting Schema-Related Logic Bugs in Relational DBMSs via Equivalent Database Construction
    Jiansen Song, Wensheng Dou, Yingying Zheng, Yu Gao, Ziyu Cui, Wei Wang, Jun Wei
    51st International Conference on Very Large Data Bases (VLDB 2025, CCF A)
  • Evaluating Garbage Collection Performance Across Managed Language Runtimes
    Yicheng Wang, Wensheng Dou, Yu Liang, Yi Wang, Wei Wang, Jun Wei, Tao Huang
    47th IEEE/ACM International Conference on Software Engineering (ICSE 2025, CCF A)
  • BridgeGC: An Efficient Cross-Level Garbage Collector for Big Data Frameworks
    Yicheng Wang, Lijie Xu, Tian Guo, Wensheng Dou, Hongbin Zeng, Wei Wang, Jun Wei, Tao Huang
    ACM Transactions on Architecture and Code Optimization (TACO 2025, CCF A)
  • PATCH: Empowering Large Language Model with Programmer-Intent Guidance and Collaborative-Behavior Simulation for Automatic Bug Fixing
    Yuwei Zhang, Zhi Jin, Ying Xing, Ge Li, Fang Liu, Jiaxin Zhu, Wensheng Dou, Jun Wei
    ACM Transactions on Software Engineering and Methodology (TOSEM 2025, CCF A)
  • Training Deep Neural Networks with Virtual Smoothing Classes
    Zhiyang Zhou, Siwei Wei, Xudong Zhang, Wensheng Dou, Muzi Qu, Yan Cai
    39th AAAI Conference on Artificial Intelligence (AAAI 2025, CCF A)
  • Testing Gremlin-Based Graph Database Systems via Query Disassembling
    Yingying Zheng, Wensheng Dou, Lei Tang, Ziyu Cui, Yu Gao, Jiansen Song, Liang Xu, Jiaxin Zhu, Wei Wang, Jun Wei, Hua Zhong, Tao Huang
    33rd ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2024, CCF A)
  • Detecting Metadata-Related Logic Bugs in Database Systems via Raw Database Construction
    Jiansen Song, Wensheng Dou, Yu Gao, Ziyu Cui, Yingying Zheng, Dong Wang, Wei Wang, Jun Wei, Tao Huang
    50th International Conference on Very Large Data Bases (VLDB 2024, CCF A)
  • Understanding Transaction Bugs in Database Systems
    Ziyu Cui, Wensheng Dou, Yu Gao, Dong Wang, Jiansen Song, Yingying Zheng, Tao Wang, Rui Yang, Kang Xu, Yixin Hu, Jun Wei, Tao Huang
    46th IEEE/ACM International Conference on Software Engineering (ICSE 2024, CCF A)
  • FaultFuzz: A Coverage Guided Fault Injection Tool for Distributed Systems
    Wenhan Feng, Qiugen Pei, Yu Gao, Dong, Wang, Wensheng Dou, Jun Wei, Zheheng Liang, Zhenyue Long
    46th IEEE/ACM International Conference on Software Engineering (ICSE Demo 2024)
  • Differential Optimization Testing of Gremlin-Based Graph Database Systems
    Yingying Zheng, Wensheng Dou, Lei Tang, Ziyu Cui, Jiansen Song, Ziyue Cheng, Wei Wang, Jun Wei, Hua Zhong, Tao Huang
    17th IEEE International Conference on Software Testing, Verification and Validation (ICST 2024)
  • DiTing: A Semi-Supervised Adversarial Training Framework for Robust Out-of-Distribution Detection (谛听:一种面向鲁棒分布外样本检测的半监督对抗训练方法)
    Zhiyang Zhou, Wensheng Dou, Shuo Li, Liangyi Kang, Shuai Wang, Jie Liu, Dan Ye
    Journal of Software (软件学报), 2024
  • Testing Raft Systems Based on TLA+ Formal Specification (基于TLA+形式化规约的Raft系统测试)
    Dong Wang, Wensheng Dou, Yu Gao, Chenao Wu, Jun Wei, Tao Huang
    Journal of Software (软件学报), 2024
  • Model Checking Guided Testing for Distributed Systems
    Dong Wang, Wensheng Dou, Yu Gao, Chenao Wu, Jun Wei, Tao Huang
    18th European Conference on Computer Systems (EuroSys 2023, CCF A)
  • Coverage Guided Fault Injection for Cloud Systems
    Yu Gao, Wensheng Dou, Dong Wang, Wenhan Feng, Jun Wei, Hua Zhong, Tao Huang
    45th IEEE/ACM International Conference on Software Engineering (ICSE 2023, CCF A)
  • Detecting Isolation Bugs via Transaction Oracle Construction
    Wensheng Dou, Ziyu Cui, Qianwang Dai, Jiansen Song, Dong Wang, Yu Gao, Wei Wang, Jun Wei, Lei Chen, Hanmo Wang, Hua Zhong, Tao Huang
    45th IEEE/ACM International Conference on Software Engineering (ICSE 2023, CCF A)
  • Testing Database Systems via Differential Query Execution
    Jiansen Song, Wensheng Dou, Ziyu Cui, Qianwang Dai, Wei Wang, Jun Wei, Hua Zhong, Tao Huang
    45th IEEE/ACM International Conference on Software Engineering (ICSE 2023, CCF A)
  • Randomized Differential Testing of RDF Stores
    Rui Yang, Yingying Zheng, Lei Tang, Wensheng Dou, Wei Wang, Jun Wei
    45th IEEE/ACM International Conference on Software Engineering (ICSE Demo 2023)
  • Fixing Robust Out-of-Distribution Detection for Deep Neural Networks
    Zhiyang Zhou, Jie Liu, Wensheng Dou, Shuo Li, Liangyi Kang, Muzi Qu, Dan Ye
    34th IEEE International Symposium on Software Reliability Engineering (ISSRE 2023)
  • Detecting Flash Loan Based Price Manipulation Attacks in Ethereum
    Qing Xia, Zhirong Huang, Wensheng Dou, Yafeng Zhang, Fengjun Zhang, Geng Liang, Chun Zuo
    43rd IEEE International Conference on Distributed Computing Systems (ICDCS 2023)
  • Self-Supervised Log Parsing Using Semantic Contribution Difference
    Siyu Yu, Ningjiang Chen, Yifan Wu, Wensheng Dou
    Journal of Systems and Software (JSS 2023)
  • Detecting Crash Recovery Bugs in Cloud Systems via Common Data Analysis (共用数据导向的分布式系统失效恢复缺陷检测) 
    Yu Gao, Dong Wang, Qianwang Dai, Wensheng Dou, Jun Wei
    Journal of Software (软件学报), 2023
  • Differentially Testing Database Transactions for Fun and Profit
    Ziyu Cui, Wensheng Dou, Qianwang Dai, Jiansen Song, Wei Wang, Jun Wei, Dan Ye
    37th IEEE/ACM International Conference on Automated Software Engineering (ASE 2022, CCF A)
  • Finding Bugs in Gremlin-Based Graph Database Systems via Randomized Differential Testing
    Yingying Zheng, Wensheng Dou, Yicheng Wang, Zheng Qin, Lei Tang, Yu Gao, Dong Wang, Wei Wang, Jun Wei
    31st ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2022, CCF A
  • Understanding Device Integration Bugs in Smart Home System
    Tao Wang, Kangkang Zhang, Wei Chen, Wensheng Dou, Jiaxin Zhu, Jun Wei, Tao Huang
    31st ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2022, CCF A)
  • DisTA: Generic Dynamic Taint Tracking for Java-Based Distributed Systems 
    Dong Wang, Yu Gao, Wensheng Dou, Jun Wei
    52nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2022, CCF B)
  • Common Data Guided Crash Injection for Cloud Systems
    Yu Gao, Dong Wang, Qianwang Dai, Wensheng Dou, Jun Wei
    44th ACM/IEEE International Conference on Software Engineering (ICSE Demo 2022)
  • Characterizing and Detecting Bugs in WeChat Mini-Programs
    Tao Wang, Qingxin Xu, Xiaoning Chang, Wensheng Dou, Jiaxin Zhu, Jinhui Xie, Yuetang Deng, Jianbo Yang, Jiaheng Yang, Jun Wei, Tao Huang
    44th ACM/IEEE International Conference on Software Engineering (ICSE 2022, CCF A).
  • Knowledge-Based Environment Dependency Inference for Python Programs
    Hongjie Ye, Wei Chen, Wensheng Dou, Guoquan Wu, Jun Wei
    44th ACM/IEEE International Conference on Software Engineering (ICSE 2022, CCF A).
  • Race Detection for Event-Driven Node.js Applications
    Xiaocning Chang, Wensheng Dou, Jun Wei, Tao Huang, Jinhui Xie, Yuetang Deng, Jianbo Yang, Jiaheng Yang
    36th IEEE/ACM International Conference on Automated Software Engineering (ASE 2021, CCF A).
  • Systemizing Interprocedural Static Analysis of Large-Scale Systems Code with Graspan
    Zhiqiang Zuo, Kai Wang, Aftab Hussain, Ardalan Amiri Sani, Yiyu Zhang, Shenming Lu, Wensheng Dou, Linzhang Wang, Xuandong Li, Chenxi Wang, Guoqing Harry Xu
    ACM Transactions on Computer Systems (TOCS 2021, CCF A).
  • Semantic Table Structure Identification in Spreadsheets
    Yakun Zhang, Xiao Lv, Haoyu Dong, Wensheng Dou, Shi Han, Dongmei Zhang, Jun Wei, Dan Ye
    30th ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2021, CCF A).
  • The Performance of Selfish Mining in GHOST
    Qing Xia, Wensheng Dou, Fengjun Zhang, Geng Liang
    20th IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom 2021).
  • The Impact Analysis of Multiple Miners and Propagation Delay on Selfish Mining
    Qing Xia, Wensheng Dou, Tong Xi, Jing Zeng, Fengjun Zhang, Jun Wei, Geng Liang
    45th IEEE Computers, Software, and Applications Conference (COMPSAC 2021).
  • DeepCon: Contribution Coverage Testing for Deep Learning Systems
    Zhiyang Zhou, Wensheng Dou, Jie Liu, Chenxin Zhang, Jun Wei, Dan Ye
    28th International Conference on Software Analysis, Evolution, and Reengineering (SANER 2021, CCF B).
  • Survey on Blockchain Consensus Protocol (区块链共识协议综述)
    Qing Xia, Wensheng Dou, Kaiwen Guo, Geng Liang, Chun Zuo, Fengjun Zhang
    Journal of Software (软件学报), 2021.
    《软件学报》高影响力论文奖
  • CoFI: Consistency-Guided Fault Injection for Cloud Systems
    Haicheng Chen, Wensheng Dou, Dong Wang, Feng Qin
    35th IEEE/ACM International Conference on Automated Software Engineering (ASE 2020, CCF A).
  • Learning to Detect Table Clones in Spreadsheets
    Yakun Zhang, Wensheng Dou, Jiaxin Zhu, Liang Xu, Zhiyong Zhou, Jun Wei, Dan Ye, Bo Yang
    29th ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2020, CCF A).
  • Detecting Cache-Related Bugs in Spark Applications
    Hui Li, Dong Wang, Tianze Huang, Yu Gao, Wensheng Dou, Lijie Xu, Wei Wang, Jun Wei, Hua Zhong
    29th ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA 2020, CCF A)
  • DistStream: An Order-Aware Distributed Framework for Online-Offline Stream Clustering Algorithms
    Lijie Xu, Xingtong Ye, Kai Kang, Tian Guo, Wensheng Dou, Wei Wang, Jun Wei
    40th IEEE International Conference on Distributed Computing Systems (ICDCS 2020, CCF B).
  • Understanding Exception-Related Bugs in Large-Scale Cloud Systems 
    Haicheng Chen, Wensheng Dou, Yanyan Jiang, Feng Qin 
    34th IEEE/ACM International Conference on Automated Software Engineering (ASE 2019, CCF A).
  • An Experimental Evaluation of Garbage Collectors on Big Data Applications 
    Lijie Xu, Tian Guo, Wensheng Dou, Wei Wang, Jun Wei 
    45th International Conference on Very Large Data Bases (VLDB 2019, CCF A).
  • Detecting Atomicity Violations for Event-Driven Node.js Applications 
    Xiaoning Chang, Wensheng Dou, Yu Gao, Jie Wang, Jun Wei, Tao Huang 
    41st ACM/IEEE International Conference on Software Engineering (ICSE 2019, CCF A).
  • Recent Progress in Program Analysis (程序分析研究进展)
    Jian Zhang, Chao Zhang, Jifeng Xuan, Yingfei Xiong, Qianxiang Wang, Bin Liang, Lian Li, Wensheng Dou, Zhenbang Chen, Liqian Chen, Yan Cai
    Journal of Software (软件学报), 2019.
    《软件学报》高影响力论文奖
  • An Empirical Study on Crash Recovery Bugs in Large-Scale Distributed Systems
    Yu Gao, Wensheng Dou, Feng Qin, Chushu Gao, Dong Wang, Jun Wei, Ruirui Huang, Li Zhou, Yongming Wu 
    26th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2018, CCF A). 

    ACM SIGSOFT Distinguished Paper Award
  • How Are Spreadsheet Templates Used in Practice: A Case Study on Enron
    Liang Xu, Wensheng Dou, Jiaxin Zhu, Chushu Gao, Jun Wei, Tao Huang 
    26th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2018 NIER).
  • Expandable Group Identification in Spreadsheets
    Wensheng Dou, Shi Han, Liang Xu, Dongmei Zhang, Jun Wei 
    33rd IEEE/ACM International Conference on Automated Software Engineering (ASE 2018, CCF A).
  • Context-Based Event Trace Reduction in Client-Side JavaScript Applications 
    Jie Wang, Wensheng Dou, Chushu Gao, Yu Gao, Jun Wei 
    11th IEEE Conference on Software Testing, Validation and Verification (ICST 2018).
  • Detecting Faulty Empty Cells in Spreadsheets
    Liang Xu, Shuo Wang, Wensheng Dou, Bo Yang, Chushu Gao, Jun Wei, Tao Huang 
    25th International Conference on Software Analysis, Evolution, and Reengineering (SANER 2018, CCF B).
  • Rewriting High-Level Spreadsheet Structures into Higher-Order Functional Programs
    Florian Biermann, Wensheng Dou, Peter Sestoft 
    20th International Symposium on Practical Aspects of Declarative Languages (PADL 2018).
  • JSTrace: Fast Reproducing Web Application Errors
    Jie Wang, Wensheng Dou, Chushu Gao, Jun Wei 
    Journal of Systems and Software (JSS 2018, CCF B).
  • Characterizing and Diagnosing Out of Memory Errors in MapReduce Applications
    Lijie Xu, Wensheng Dou, Feng Zhu, Chushu Gao, Jie Liu, Jun Wei 
    Journal of Systems and Software (JSS 2018, CCF B).
  • A Comprehensive Study on Real World Concurrency Bugs in Node.js
    Jie Wang, Wensheng Dou, Yu Gao, Chushu Gao, Feng Qin, Kang Yin, Jun Wei 
    32nd International Conference on Automated Software Engineering (ASE 2017, CCF A).
  • Mining API Type Specifications for JavaScript
    Shuai Wang, Wensheng Dou, Chushu Gao, Jun Wei, Tao Huang 
    24th Asia-Pacific Software Engineering Conference (APSEC 2017).
  • SpreadCluster: Recovering Versioned Spreadsheets through Similarity-Based Clustering
    Liang Xu, Wensheng Dou, Chushu Gao, Jie Wang, Jun Wei, Hua Zhong, Tao Huang 
    14th International Conference on Mining Software Repositories (MSR 2017).
  • CACheck: Detecting and Repairing Cell Arrays in Spreadsheets
    Wensheng Dou, Chang Xu, Shing-Chi Cheung, Jun Wei 
    IEEE Transactions on Software Engineering (TSE 2017, CCF A).
  • Detecting Table Clones and Smells in Spreadsheets
    Wensheng Dou, Shing-Chi Cheung, Chushu Gao, Chang Xu, Liang Xu, Jun Wei 
    24th International Symposium on the Foundations of Software Engineering (FSE 2016, CCF A).
  • VEnron: A Versioned Spreadsheet Corpus and Related Evolution Analysis
    Wensheng Dou, Liang Xu, Shing-Chi Cheung, Chushu Gao, Jun Wei, Tao Huang 
    38th International Conference on Software Engineering (ICSE SEIP 2016).
  • Fast Reproducing Web Application Errors
    Jie Wang, Wensheng Dou, Chushu Gao, Jun Wei 
    26th IEEE International Symposium on Software Reliability Engineering (ISSRE 2015, CCF B).
  • Experience Report: A Characteristic Study on Out of Memory Errors in Distributed Data-Parallel Applications
    Lijie Xu, Wensheng Dou, Feng Zhu, Chushu Gao, Jie Liu, Hua Zhong, Jun Wei 
    26th IEEE International Symposium on Software Reliability Engineering (ISSRE 2015, CCF B).
  • Discovering User-Defined Event Handlers in Presence of JavaScript Libraries
    Shuai Wang, Wensheng Dou, Chushu Gao, Jun Wei, Tao Huang 
    22nd Asia Pacific Software Engineering Conference (APSEC 2015).
  • Towards Web Application Mobilization via Efficient Web Control Extraction
    Shuai Wang, Wensheng Dou, Guoquan Wu, Jie Wang, Chushu Gao, Jun Wei, Tao Huang 
    7th Asia-Pacific Symposium on Internetware (Internetware 2015).

  • Is Spreadsheet Ambiguity Harmful? Detecting and Repairing Spreadsheet Smells due to Ambiguous Computation
    Wensheng Dou, Shing-Chi Cheung, Jun Wei 
    36th International Conference on Software Engineering (ICSE 2014, CCF A).