Mufei Li (李牧非)'s Homepage
PhD Student @ Georgia Institute of Technology
  I’m a third-year PhD student in machine learning at Georgia Institute of Technology, advised by Prof. Pan Li. My current research interest is memory mechanisms of foundation models, e.g., retrieval-augmented generation, long-context reasoning, long-term memory, etc.
Prior to that, I was a software development engineer (SDE) at Amazon Web Services (AWS) Shanghai AI Lab. I received my bachelor’s degree in Honors Math from NYU Shanghai.
News
| Sep 27, 2025 | Check EAPrivacy, a benchmark to quantify the physical-world privacy awareness of LLMs. This work was led by Xinjie, who’s looking for internship opportunities in summer 2026. | 
|---|---|
| Sep 21, 2025 | HaystackCraft, our new benchmark on LLM long-context and agentic reasoning, has been accepted by NeurIPS 2025 workshop on Evaluating the Evolving LLM Lifecycle. Congratulations to all the co-authors! | 
| Sep 18, 2025 | Graph-KV: Breaking Sequence via Injecting Structural Biases into Large Language Models has been accepted by NeurIPS 2025! Congratulations to Haoyu and all the co-authors! | 
| Jun 12, 2025 | Check our new paper on better leveraging the LLM KV cache for modeling multi-doc interdependencies, Graph-KV: Breaking Sequence via Injecting Structural Biases into Large Language Models, led by Haoyu! | 
| May 27, 2025 | I’m starting a new position as Research Scientist Intern at Meta Ranking and Foundational AI! I’m based in Sunnyvale, California. | 
Selected Publications
- 
      ICLRSimple is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented GenerationIn International Conference on Learning Representations, 2025Mufei Li and Siqi Miao contributed equally to this work. This work was selected for oral presentation at [ICLR 2025 workshop on Foundation Models in the Wild](https://fm-wild-community.github.io/)
 - 
      ICLR SpotlightLayerDAG: A Layerwise Autoregressive Diffusion Model for Directed Acyclic Graph GenerationIn International Conference on Learning Representations, 2025Selected for spotlight presentation at the ICLR 2025 workshop ["Will Synthetic Data Finally Solve the Data Access Problem?"](https://synthetic-data-iclr.github.io/)