Jiawei Liu

PhD candidate at UIUC CS; 3rd-year since 2021 Fall

avatar-ca.jpg

My research goal is to simplify the making of great software with and for machine learning and its systems. Currently, At UIUC, I work on Programming Languages, Formal Methods, and Software Engineering with Lingming Zhang. I have been working on the following main directions:

ūüõ°ÔłŹ Synthesizing test programs for automatic bug finding in ML systems:

ūü§Ė Teaching and evaluating large language models for code:

ūü§ó Feel free to drop me an email if you are interested in my research.

Papers Show All

  1. ACL’24
      To Appear  
    XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts
    Yifeng Ding,  Jiawei Liu, Yuxiang Wei, Terry Yue Zhuo,  and Lingming Zhang
    arXiv preprint arXiv:2404.15247. 2024
  2. Pre-print
    Emerging Platforms Meet Emerging LLMs: A Year-Long Journey of Top-Down Development
    Siyuan Feng,  Jiawei Liu, Ruihang Lai, Charlie F. Ruan, Yong Yu, Lingming Zhang,  and Tianqi Chen
    arXiv preprint arXiv:2404.09151. 2024
  3. Pre-print
    StarCoder 2 and The Stack v2: The Next Generation
    Anton Lozhkov, Raymond Li, Loubna Ben Allal, Federico Cassano, Joel Lamy-Poirier, Nouamane Tazi, Ao Tang, Dmytro Pykhtar,  Jiawei Liu, Yuxiang Wei, Tianyang Liu, Max Tian, Denis Kocetkov, Arthur Zucker, Younes Belkada, Zijian Wang, Qian Liu, Dmitry Abulkhanov, Indraneil Paul,  Zhuang Li and 46 more authors
    arXiv preprint arXiv:2402.19173. 2024
  4. ICML’24
      To Appear  
    Magicoder: Empowering Code Generation with OSS-Instruct
    Yuxiang Wei, Zhe Wang,  Jiawei Liu, Yifeng Ding,  and Lingming Zhang
    Forty-first International Conference on Machine Learning. 2024
  5. NeurIPS’23
    Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation
    Jiawei Liu, Chunqiu Steven Xia, Yuyao Wang,  and Lingming Zhang
    Thirty-seventh Conference on Neural Information Processing Systems. 2023
  6. ESEC/FSE’23
    Atifact AvailableAtifact Reusable
    NeuRI: Diversifying DNN Generation via Inductive Rule Inference
    Jiawei Liu, Jinjun Peng, Yuyao Wang,  and Lingming Zhang
    Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 2023
    ūüŹÜ ¬†ACM SIGSOFT Distinguished Paper Award
  7. ASPLOS’23
    Atifact AvailableAtifact FunctionalResults Reproduced
    NNSmith: Generating Diverse and Valid Test Cases for Deep Learning Compilers
    Jiawei Liu, Jinkun Lin, Fabian Ruffy, Cheng Tan, Jinyang Li, Aurojit Panda,  and Lingming Zhang
    Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2. 2023
    ūüŹÜ ¬†Distinguished Artifact Award
  8. OOPSLA’22
    Atifact AvailableAtifact Reusable
    Coverage-guided tensor compiler fuzzing with joint IR-pass mutation
    Jiawei Liu, Yuxiang Wei, Sen Yang, Yinlin Deng,  and Lingming Zhang
    Proceedings of the ACM on Programming Languages 6 (OOPSLA1). Apr 2022
*PLSE conferences like OOPSLA and ESEC/FSE do not badge for reproducibility at artifact evaluation as it requires third-party re-implementation. Nonetheless, we got all badges we can get. :D

Service

Organizing: LLM4Code@ICSE'24

Reviewer: NeurIPS'24, TSE, TOSEM, DCAA@AAAI'23, R2FM@ICLR'24

Artifact Evaluation Committee: PLDI'23, OSDI'22, ATC'22

Invited Talk

ARiSE Lab, Columbia University: Simplify the Making of Great Software in the ML Era April 2024

Snowflake GenAI: Rigorous Evaluation of LLMs for Code (Slides) Feb 2024

AST Lab, ETH Z√ľrich: Generating Test-Cases for ML Compilers (Slides) Jan 2024

GAI4SE, NC State University: LLMs for Software Testing (Guest Lecture) Nov 2023

Apache TVM Conference: Automating DL Compiler Bug Finding with NNSmith Mar 2023

SAMPL, University of Washington: Coverage-Guided Tensor Compiler Fuzzing (Slides) May 2022

Experience

UIUC, 2021~TBD   CS PhD @ PL/FM/SE

BigCode   Open Code LLMs

Google TPU, Smr+Fall. 23   ML SDC

OctoML, Smr. 22   Pattern Language

Tongji University, 20{17~21}   B.Eng. in CS

Alibaba DAMO, Smr. 21 GNN4Assembly

NYU Systems Group, Smr. 20 Video Analytics

ByteDance AI Lab, Spr. 20   Model Serving