Friday, May 1, 2026

Kicking Off

     For this blog, I will regularly read AI research papers, summarize them, and discuss my findings. The point of this is to rapidly upskill in the area, and develop a taste for judging research directions. I'll start with a deep dive into understanding individual neurons, and how an auto-interp pipeline could be useful for automated interp/AI safety work.

No comments:

Post a Comment

Language models can explain neurons in language models:

Language models can explain neurons in language models: Link: https://openaipublic.blob.core.windows.net/neuron-explainer/paper/index.html  ...