cv | Zhonghao He

Basics

Name	Zhonghao He (何忠豪)
Label	AI Alignment and Human-AI Interaction Researcher
Affiliation	Leverhulme Center for Future Intelligence, University of Cambridge
Summary	I am creating AI-assistants for human moral progress and preventing LLM-induced lock-in. My work is the only necessary form of my existence. I build, therefore I exist.

Education

2022.09 - 2025.07

Cambridge, UK
Master

University of Cambridge

AI Ethics and Society
- Machine Learning Alignment
- AI Ethics
- Advanced Deep Learning
- ML Safety
- Discrete Mathematics
- Reinforcement Learning
- Probability for Computer Scientists
- Mechanistic Interpretability
- Algorithms and Data Structure
2019.06 - 2019.09

Palo Alto, USA
Summer Student

Stanford University

Cognitive Science & Philosophy
- Mathematics Foundation of Computing
- Minds and Machines
- Introduction to Neuroscience
2014.08 - 2019.06

Shantou, China
Bachelor of Arts

Shantou University

English & Linguistics
- Machine Learning and relevant maths
- Linguistics

Projects

2025.06 - Present
Use RL to infer human goals and levels of comprehension

Exploring one aspect of truth-seeking: 'exploration' in human-AI systems'.
- Position: Co-lead
- Collaborators: Prof Andreea Bobu (MIT), Tianyi (Alex) Qiu (CHAI, Berkley & PKU)
2025.05 - 2025.06
Position: AI Presents Catastrophic Epistemic Risks

We argue that AI presents catastrophic epistemic risks, which deserves a closer look and more research attention to.
- Position: Co-author
- Collaborators: Kellin Pelrine, Dan Zhao, Tianyi (Alex) Qiu
2025.03 - Present
Stay True to the Evidence:Measuring Belief Entrenchment in LLM Reasoning via the Martingale Score

We propose an alternative to RLHF that uses " helping humans to seek truth as training objective " and human opinion change data as ground truth. By doing so we aim to remove feedback loop-incurred lock-in from its root and revolves alignment evaluation around " LLM-assited human performanceWe ".
- Position: Co-lead
- Collaborators: Prof Maarten Sap (CMU), Prof Hirokazu Shirado (CMU), Tianyi (Alex) Qiu (CHAI, Berkley & PKU)
2024.10 - present
Open Problems in AI Influence

We propose "AI influence " as a field of studies on AI's impact on epistemics and morality. In this paper we collect open problems in AI influence, as long as methodologies from ML, computational social science, cognitive science to study AI influence.
- Position: Co-lead
- Goal: Publication in Transactional Machine Learning Research
- Collaborators: Prof Max Kleiman-Weiner (UW), Tianyi (Alex) Qiu (CHAI, Berkley & PKU, Prof Atoosa Kasirzadeh, Prof John P Wihbey, Dr Moshe Glickman, Tao Lin.
2024.10 - 2025.06
The Lock-in Hypothesis: Stagnation by Algorithms

We are concerned with the problems of LLM-incured value lock-in and knowledge collapse (as probable as model collapse since increasingly our discourses are mediated by AI systems and iterative training becomes more prevalence), with the consequence being more destructive. Our team establishes real-world evdience of lock-in from WildChat data and builds simulations and formal modeling of the mechanisms of LLM-incurred value lock-in
- Position: Project Co-lead
- Collaborators: Prof Max Kleiman-Weiner (UW), Tianyi (Alex) Qiu (CHAI, Berkley & PKU)
2023.12 - 2025.02
Multilevel analytical framework for interpretability

Research on cognitive science and neuroscience to address interpretability challenges in ML.
- Position: Project Lead
- Goal: Publication in Transactional Machine Learning Research
- Senior authors: Prof Grace W. Lindsay(NYU), Prof Anna Ivanova (GeorgiaTech)
2023.07 - 2023.10
Comprehensive Survey on AI Alignment

Survey paper on alignment research for newcomers.
- Focus: Interpretability challenges in ML
- Collaborators: Yaodong Yang, Jiaming Ji, Tianyi Qiu
2022.12 - 2023.03
Harms from agentic algorithmic systems

Research on safety and harms from agentic systems in AI.
- Highlight: Published paper cited by GPT-4 and high-profile AI safety reports
2021.06 - 2022.02
Stanford Existential Risks Initiative (SERI)

Research on China's AI governance approach.
- Position: Research Fellow

Awards

2025.06

Foresight Institution AI Safety Grant

Foresight Institution

A research grant to support our truh-seeking algorithm, benchmark, and human-subject experiments
2022.08

Open Philanthropy Graduate Scholarship

Open Philanthropy

A full scholarship for the master of study in AI Ethics and Society at the University of Cambridge.
2023.01

Manifund AI Research Scholarship

Manifund

A research scholarship to support work on neuroscience and mechanistic interpretability.
2017.02

Hong Kong Cyberport Creative Micro Funding

Hong Kong Cyberport Management Company Limited, wholly owned by the Hong Kong SAR Government

Early-stage startup funding to support work on Homeal, a sharing economy app about food and culture.

Skills

	Mathematics
	Calculus
	Information Theory
	Linear Algebra
	Probability Theory
	Statistics
	Formal Methods

	ML Engineering
	Machine Learning
	Deep Learning
	Reinforcement Learning
	Data Analysis
	ML Safety
	Git

	Experimental
	Data Visualization
	Mechanistic Interpretability
	Simulation

	Programming
	Python (advanced)
	Pytorch
	R (intermediary)
	web stuff (intermediary)
	Matlab (basic)
	C/C++ (basic)

Languages

	English
	Close to Native

	Chinese
	Native

	French
	Basic

Interests

	Physical Activities
	Rowing
	Hiking

	Other Interests
	Debate
	Greek Literature

Basics

Education

University of Cambridge

AI Ethics and Society

Stanford University

Cognitive Science & Philosophy

Shantou University

English & Linguistics

Projects

Exploring one aspect of truth-seeking: 'exploration' in human-AI systems'.

We argue that AI presents catastrophic epistemic risks, which deserves a closer look and more research attention to.

We propose an alternative to RLHF that uses " helping humans to seek truth as training objective " and human opinion change data as ground truth. By doing so we aim to remove feedback loop-incurred lock-in from its root and revolves alignment evaluation around " LLM-assited human performanceWe ".

We propose "AI influence " as a field of studies on AI's impact on epistemics and morality. In this paper we collect open problems in AI influence, as long as methodologies from ML, computational social science, cognitive science to study AI influence.

Research on cognitive science and neuroscience to address interpretability challenges in ML.

Survey paper on alignment research for newcomers.

Research on safety and harms from agentic systems in AI.

Research on China's AI governance approach.

Awards

Foresight Institution

A research grant to support our truh-seeking algorithm, benchmark, and human-subject experiments

Open Philanthropy

A full scholarship for the master of study in AI Ethics and Society at the University of Cambridge.

Manifund

A research scholarship to support work on neuroscience and mechanistic interpretability.

Hong Kong Cyberport Management Company Limited, wholly owned by the Hong Kong SAR Government

Early-stage startup funding to support work on Homeal, a sharing economy app about food and culture.

Skills

Languages

Interests