I am currently a fourth-year PhD student in the Department of Electronic Engineering at Tsinghua University, supervised by Professor Wu Ji. I also received my Bachelor degree from the Department of Electronic Engineering at Tsinghua University. I am currently an intern researcher at iFLYTEK.

My research interest includes evaluation and knowledge injection for large language models. I have published 5 papers at the top international AI conferences with total google scholar citations .

🔥 News

  • 2025.01: Our paper Reliable and diverse evaluation of LLM medical knowledge mastery has been accepted by The Thirteenth International Conference on Learning Representations (ICLR-25). Codes and Datasets will be out soon!
  • 2024.04: Our paper MultifacetEval: Multifaceted Evaluation to Probe LLMs in Mastering Medical Knowledge has been accepted by the Thirty-Third International Joint Conference on Artificial Intelligence (IJCAI-24). Codes and Datasets are available here.

📝 Publications

🏆 Academic Competitions

  • 1st place on the Task 7 (Multi-evidence Natural Language Inference for Clinical Trial Data) of 17th International Workshop on Semantic Evaluation (SemEval 2023)
  • 2nd place on the Task 9 (Fact Verification and Evidence Finding for Tabular Data in Scientific Documents) of 15th International Workshop on Semantic Evaluation (SemEval 2021)

🎖 Honors and Awards

  • 2022-2023 Tsinghua Alumni - Pinghu Talents Scholarship (Second Class).
  • 2023-2024 Tsinghua Alumni - Quanzhou Talents Scholarship (Second Class).
  • 2024-2025 Tsinghua Alumni - Jining Talents Scholarship (Second Class).

📖 Educations

  • 2021.08 - Present, PhD Student, Department of Electronic Engineering, Tsinghua University (GPA: 3.99/4.00).
  • 2017.08 - 2021.07, Undergraduate Student, Department of Electronic Engineering, Tsinghua University (GPA: 3.85/4.00).

💬 Invited Talks

  • 2024.12 I gave a talk about MultifacetEval on the NLP Academic Exchange Platform. You can find replay here

💻 Internships

  • 2022.05 - Present, iFlyTek, Beijing, China.