Jianli Zhao
zhao78740 AT gmail.com
Hi, there.
I am Jianli Zhao, who earned a B.S. in Computer Science from Shandong University, where I was mentored by Prof. Bin Jiang. Now I am spending a serene gap year, during which I have the privilege of being mentored by Prof. Fazl Barez from Oxford. Earlier, in the summer of 2024, I also spent a golden summer at UCSD, working under the guidance of Prof. Julian Mcauley.
My prior research primarily concentrated on Information Extraction using a diffusion model method [PDF]. Currently I am keen on Trustworthy LLMs and AI safety. My interests span from exploring the vulnerabilities of LLMs, such as red-teaming, to investigating interpretability via circuit-level analysis, with the goal of ensuring a more secure deployment. [PDF]
Research Interest
- Natural Language Processing
- AI Safety
- Trustworthy LLMs
- Interpretability and Alignment
Selected Publications
-
Chain-of-Thought Hijacking
Jianli Zhao, Tingchen Fu, Rylan Schaeffer, Mrinank Sharma, Fazl Barez
Under Review 2025 [paper] [code] [website] -
IPED: An Implicit Perspective for Relational Triple Extraction based on Diffusion Model
Jianli Zhao, Changhao Xu, Bin Jiang
NAACL 2024 [paper] [code]
Education
- B.S. in Shandong University, 2021-2025