Bowen Xu

News

Learning From the Best: What Makes Popular Hugging Face Models? A Registered Report

Quantization Is Not a Dealbreaker: Empirical Insights from Large Code Models

APIDocBooster: An Extract-Then-Abstract Framework Leveraging Large Language Models for Augmenting API Documentation

How Program Analysis and AI Can Better Support Domain-Specific Software Development and Maintenance?

A Comprehensive Study of OOP-Related Bugs in C++ Compilers

Empirical Software Engineering Journal (aka. EMSE)

Safe Use of AI for Coding

Finding Safety Violations of AI-Enabled Control Systems through the Lens of Synthesized Proxy Programs

PTM4Tag+: Tag recommendation of stack overflow posts with pre-trained models

Prioritizing Speech Test Cases

Benchmarking Track

An empirical study on the effectiveness of large language models for SATD identification and classification

Systematic Code Migration

Leveraging Large Language Model for Automatic Patch Correctness Assessment

BAFFLE: Backdoor Attack in Offline Reinforcement Learning

Stealthy Backdoor Attack for Code Models

Out of Sight, Out of Mind: Better Automatic Vulnerability Repair by Broadening Input Ranges and Sources

Curiosity-Driven Testing for Sequential Decision-Making Process

Greening Large Language Models of Code

Research Track

Forge 2024

Representation Learning for Stack Overflow Posts: How Far are We?

Technical Track

The Devil is in the Tails: An Exploratory Study on Long-tailed

Are We Ready to Embrace Generative AI for Software Q&A?

Self-Supervised Code Change Representation Learning

Supporting Collateral Evolution in Software Ecosystems

Software Ecosystems: Tooling and Analytics

Multi-Granularity Detector for Vulnerability Fixes

NIER Track

Technical Track

Demonstrations Track

Generation-based Code Review Automation: How Far Are We?

TECHSUMBOT: A Stack Overflow Answer Summarization Tool for Technical Query

Curiosity-Driven and Victim-Aware Adversarial Policies

Duplicate Bug Report Detection: How Far Are We?

Compressing Pre-trained Models of Code into 3 MB

Answer Summarization for Technical Queries: Benchmark and New Approach

How to Better Utilize Code Graphs in Semantic Code Search?

[Call For Paper]

the 6th edition of the MaLTeSQuE

PTM4Tag: Sharpening Tag Recommendation of Stack Overflow with Pre-trained Models

Aspect-Based API Review Classification: How Far Can Pre-Trained Transformer Model Go?

Can Identifier Splitting Improve Open-Vocabulary Language Model of Code?

Post2Vec: Learning Distributed Representations of Stack Overflow Posts

Bio

Dr. Bowen Xu is a (tenure-track) Assistant Professor in the Department of Computer Science at North Carolina State University (NC State). Prior he joining NC State, he was a post-doctoral researcher in the School of Computing and Information Systems (SCIS) at Singapore Management University (SMU). He received his PhD degree from SCIS at SMU. His research interests lie primarily in the fields of machine learning and software engineering. Recently, he has focused on securing AI models and building high-quality data for various coding tasks. His works won several research paper awards, such as the Highly Commended Full Paper Award at ESEM 2018, the Honorable Mention Award at ACSAC 2022, nominated for ACM SIGSOFT Distinguished Paper Award at ASE 2022. He co-organized two FSE workshops SEA4DQ in 2024 and MaLTeS in 2022. He served as Paper Review co-chair for ICSE 2025 and FSE 2025 and guest editor for EMSE. He is also invited to be a PC and referee for many top-tier conferences and Journals, such as ICSE, FSE, ASE, TSE, TOSEM, EMSE, etc. Contact him at: bxu22@ncsu.edu. More info at: https://www.bowenxu.me.

Bowen Xu

Research Focus

News

Bio