📝 Selected Publications
(* indicates equal contribution, full publication list)
AIOps
-
No More Labelled Examples? An Unsupervised Log Parser with LLMs (FSE 2025)
Junjie Huang, Zhihan Jiang, Zhuangbin Chen, Michael R. Lyu
[code] [paper] -
L4: Diagnosing Large-scale LLM Training Failures via Automated Log Analysis (FSE 2025)
Zhihan Jiang, Junjie Huang, Guangba Yu, Zhuangbin Chen, Yichen Li, Renyi Zhong, Cong Feng, Yongqiang Yang, Zengyin Yang, Michael R. Lyu
[paper] -
Demystifying and Extracting Fault-indicating Information from Logs for Failure Diagnosis (ISSRE 2024)
Junjie Huang, Zhihan Jiang, Jinyang Liu, Yintong Huo, Jiazhen Gu, Zhuangbin Chen, Cong Feng, Hui Dong, Zengyin Yang, Michael R. Lyu
[code] [paper] -
FaultProfIT: Hierarchical Fault Profiling of Incident Tickets in Large-scale Cloud Systems (ICSE-SEIP 2024)
Junjie Huang, Jinyang Liu, Zhuangbin Chen, Zhihan Jiang, Yichen Li, Jiazhen Gu, Cong Feng, Zengyin Yang, Yongqiang Yang, Michael R. Lyu
[paper] -
Knowledge-aware Alert Aggregation in Large-scale Cloud Systems: a Hybrid Approach (ICSE-SEIP 2024)
Jinxi Kuang, Jinyang Liu, Junjie Huang, Renyi Zhong, Jiazhen Gu, Lan Yu, Rui Tan, Zengyin Yang, Michael R. Lyu
[paper] -
A Large-Scale Evaluation for Log Parsing Techniques: How Far Are We? (ISSTA 2024)
Zhihan Jiang, Jinyang Liu, Junjie Huang, Yichen Li, Yintong Huo, Jiazhen Gu, Zhuangbin Chen, Jieming Zhu, Michael R. Lyu
[code] [paper]
Code Intelligence
-
Contextualized Data-Wrangling Code Generation in Computational Notebooks (ASE 2024)
Junjie Huang, Daya Guo, Chenglong Wang, Jiazhen Gu, Shuai Lu, Jeevana Priya Inala, Cong Yan, Jianfeng Gao, Nan Duan, Michael R. Lyu
[code] [paper] -
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation (NeurIPS 2021)
Shuai Lu*, Daya Guo*, Shuo Ren*, Junjie Huang*, Alexey Svyatkovskiy, Ambrosio Blanco, Colin Clement, Dawn Drain, Daxin Jiang, Duyu Tang, Ge Li, Lidong Zhou, Linjun Shou, Long Zhou, Michele Tufano, Ming Gong, Ming Zhou, Nan Duan, Neel Sundaresan, Shao Kun Deng, Shengyu Fu, Shujie Liu
[code] [paper] -
CoSQA: 20,000+ Web Queries for Code Search and Question Answering (ACL 2021)
Junjie Huang, Duyu Tang, Linjun Shou, Ming Gong, Ke Xu, Daxin Jiang, Ming Zhou, Nan Duan
[data] [code] [paper] -
Execution-based Evaluation for Data Science Code Generation Models (EMNLP 2022 DaSH)
Junjie Huang, Chenglong Wang, Jipeng Zhang, Cong Yan, Haotian Cui, Jeevana Priya Inala, Colin Clement, Nan Duan, Jianfeng Gao
[code] [paper] -
CodeExp: Explanatory Code Document Generation (EMNLP 2022)
Haotian Cui, Chenglong Wang, Junjie Huang, Jeevana Priya Inala, Todd Mytkowicz, Bo Wang, Jianfeng Gao, Nan Duan
[code] [paper]