For this blog, I will regularly read AI research papers, summarize them, and discuss my findings. The point of this is to rapidly upskill in the area, and develop a taste for judging research directions. I'll start with a deep dive into understanding individual neurons, and how an auto-interp pipeline could be useful for automated interp/AI safety work.
No comments:
Post a Comment