cv
You can download a PDF version. Although the cv is more updated on this website. I intend to disclose the whole journey of mine here, however it may look winding and strange. I want to build, one honest work a time, and then one item in this page. In the end it's life worth living.
Basics
Name | Zhonghao He (何忠豪) |
Label | AI Alignment and Human-AI Interaction Researcher |
Affiliation | Leverhulme Center for Future Intelligence, University of Cambridge |
Summary | I am creating AI-assistants for human moral progress and preventing LLM-induced lock-in. My work is the only necessary form of my existence. I build, therefore I exist. |
Education
-
2022.09 - PRESENT Cambridge, UK
Master
University of Cambridge
AI Ethics and Society
- Machine Learning Alignment
- AI Ethics
- AI Governance
- CS230 Deep Learning
- ML Safety
- Discrete Mathematics
- CS234 Reinforcement Learning
- CS109 Probability for Computer Scientists
- Mechanistic Interpretability
- Algorithms and Data Structure
-
2019.06 - 2019.09 Palo Alto, USA
Summer Student
Stanford University
Cognitive Science & Philosophy
- Mathematics Foundation of Computing
- Minds and Machines
- Introduction to Neuroscience
-
2014.08 - 2019.06 Shantou, China
Bachelor of Arts
Shantou University
English & Global Studies
- Machine Learning and relevant maths
- Research Methodology
- Linguistics
Projects
- 2025.5 - 2025.6
Position: AI Presents Catastrophic Epistemic Risks
We argue that AI presents catastrophic epistemic risks, which deserves a closer look and more research attention to.
- Position: Co-author
- Collaborators: Kellin Pelrine, Dan Zhao, Tianyi (Alex) Qiu
- 2025.03 - Present
Stay True to the Evidence:Measuring Belief Entrenchment in LLM Reasoning via the Martingale Score
We propose an alternative to RLHF that uses " helping humans to seek truth as training objective " and human opinion change data as ground truth. By doing so we aim to remove feedback loop-incurred lock-in from its root and revolves alignment evaluation around " LLM-assited human performanceWe ".
- Position: Project Co-founder
- Collaborators: Prof Maarten Sap (CMU), Prof Hirokazu Shirado, Tianyi (Alex) Qiu (CHAI, Berkley & PKU
- 2024.10 - 2025.01
Open Problems in AI Influence
We propose "AI influence " as a field of studies on AI's impact on epistemics and morality. In this paper we collect open problems in AI influence, as long as methodologies from ML, computational social science, cognitive science to study AI influence.
- Position: Project Co-founder
- Collaborators: Prof Max Kleiman-Weiner (UW), Tianyi (Alex) Qiu (CHAI, Berkley & PKU, Prof Atoosa Kasirzadeh, Prof John P Wihbey, Dr Moshe Glickman, Tao Lin.
- 2024.10 - 2025.05
The Lock-in Hypothesis: Stagnation by Algorithms
We are concerned with the problems of LLM-incured value lock-in and knowledge collapse (as probable as model collapse since increasingly our discourses are mediated by AI systems and iterative training becomes more prevalence), with the consequence being more destructive. Our team establishes real-world evdience of lock-in from WildChat data and builds simulations and formal modeling of the mechanisms of LLM-incurred value lock-in
- Position: Project Co-founder
- Collaborators: Prof Max Kleiman-Weiner (UW), Tianyi (Alex) Qiu (CHAI, Berkley & PKU
- 2023.12 - 2025.02
Multilevel analytical framework for interpretability
Research on cognitive science and neuroscience to address interpretability challenges in ML.
- Position: Project Lead
- Goal: Publication in Transactional Machine Learning Research
- Senior authors: Grace W. Lindsay(NYU), Prof Anna Ivanova (GeorgiaTech)
- 2023.07 - 2023.10
Comprehensive Survey on AI Alignment
Survey paper on alignment research for newcomers.
- Focus: Interpretability challenges in ML
- Collaborators: Yaodong Yang, Jiaming Ji, Tianyi Qiu
- 2022.12 - 2023.03
Harms from agentic algorithmic systems
Research on safety and harms from agentic systems in AI.
- Highlight: Published paper cited by GPT-4 and high-profile AI safety reports
- 2021.06 - 2022.02
Stanford Existential Risks Initiative (SERI)
Research on China's AI governance approach.
- Position: Research Fellow
Awards
- 2022.08
Open Philanthropy Graduate Scholarship
Open Philanthropy
A full scholarship for the master of study in AI Ethics and Society at the University of Cambridge.
- 2023.01
Manifund AI Research Scholarship
Manifund
A research scholarship to support work on neuroscience and mechanistic interpretability.
- 2017.02
Hong Kong Cyberport Creative Micro Funding
Hong Kong Cyberport Management Company Limited, wholly owned by the Hong Kong SAR Government
Early-stage startup funding to support work on Homeal, a sharing economy app about food and culture.
Skills
Mathematics | |
Calculus | |
Information Theory | |
Linear Algebra | |
Formal Methods |
ML Engineering | |
Machine Learning | |
Deep Learning | |
Data Analysis | |
ML Safety | |
Git |
Experimental | |
Data Visualization | |
Mechanistic Interpretability | |
Simulation |
Programming | |
Python (advanced) | |
Pytorch | |
R (intermediary) | |
web stuff (intermediary) | |
Matlab (basic) | |
C/C++ (basic) |
Languages
English | |
Close to Native |
Chinese | |
Native |
French | |
Basic |
Interests
Physical Activities | |
Rowing | |
Hiking |
Other Interests | |
Debate | |
Greek Literature |