Kicking Off

Friday, May 1, 2026

Kicking Off

For this blog, I will regularly read AI research papers, summarize them, and discuss my findings. The point of this is to rapidly upskill in the area, and develop a taste for judging research directions. I'll start with a deep dive into understanding individual neurons, and how an auto-interp pipeline could be useful for automated interp/AI safety work.

AIS - Research Papers

Friday, May 1, 2026

Kicking Off

No comments:

Post a Comment

Language models can explain neurons in language models:

Report Abuse