Jiawei Liu

I’m a Ph.D. candidate at UIUC PL/FM/SE (2021-Present), advised by Lingming Zhang.

I am generally interested in Programming Systems and Machine Learning.

📬 Shortest path to find me: jiawei6@illinois.edu

📰 Some recent blog & hacking: R1 for Code Generation and Speculative Code Editing.

Papers Show More

  1. Pre-print
    Learning Code Preference via Synthetic Evolution
    arXiv preprint arXiv:2410.03837. 2024
  2. ICML’24
    Magicoder: Empowering Code Generation with OSS-Instruct
    Forty-first International Conference on Machine Learning. 2024
     Adopted by Meta Llama 3.1, Google CodeGemma, and IBM Granite
  3. Pre-print
    StarCoder 2 and The Stack v2: The Next Generation
    Anton Lozhkov, Raymond Li, Loubna Ben Allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar,  Jiawei LiuYuxiang Wei and 56 more authors
    arXiv preprint arXiv:2402.19173. 2024
  4. NeurIPS’23
    Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation
    Thirty-seventh Conference on Neural Information Processing Systems. 2023
     Over 700k HuggingFace downloads; integrated by various industries
  5. FSE’23
    Atifact AvailableAtifact Reusable
    NeuRI: Diversifying DNN Generation via Inductive Rule Inference
    Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 2023
     🏆 ACM SIGSOFT Distinguished Paper Award
  6. ASPLOS’23
    Atifact AvailableAtifact FunctionalResults Reproduced
    NNSmith: Generating Diverse and Valid Test Cases for Deep Learning Compilers
    Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2. 2023
     🏆 Distinguished Artifact Award
  7. OOPSLA’22
    Atifact AvailableAtifact Reusable
    Coverage-guided tensor compiler fuzzing with joint IR-pass mutation
    Proceedings of the ACM on Programming Languages 6 (OOPSLA1). Apr 2022

Invited Talk

NLP+SE Seminar, UT Austin: Smelling the Quality of LLM-generated Code Mar 2025

Programming Systems, Uber: Evaluating LLMs for Correct & Efficient Code Generation Sept 2024

ARiSE Lab, Columbia University: Simplify the Making of Great Software in the ML Era April 2024

Snowflake GenAI: Rigorous Evaluation of LLMs for Code (Slides) Feb 2024

AST Lab, ETH Zürich: Generating Test-Cases for ML Compilers (Slides) Jan 2024

GAI4SE, NC State University: LLMs for Software Testing (Guest Lecture) Nov 2023

Apache TVM Conference: Automating DL Compiler Bug Finding with NNSmith Mar 2023

SAMPL, University of Washington: Coverage-Guided Tensor Compiler Fuzzing (Slides) May 2022

Service

Organizing: LLM4Code@ICSE'{24,25} (Publicity Chair)

Program Committee/Reviewer: ASE'24, TSE, TOSEM, NeurIPS'24, ICLR'25

Artifact Evaluation Committee: PLDI'23, OSDI'22, ATC'22