Sidharth Baskaran

emailgithublinkedin饾晱scholar

Computer Science at Georgia Tech. Working on language model interpretability to build intelligent systems. Manual transmission & caffeine enthusiast.

Research

HyperDAS: Towards Automating Mechanistic Interpretability with Hypernetworks
Jiuding Sun, Jing Huang, Sidharth Baskaran, Karel D'Oosterlinck, Christopher Potts, Michael Sklar*, Atticus Geiger*
ICLR 2025pdf
Brief Investigations of a Multi-layer Sparse Auto-Encoder
Sidharth Baskaran*, Michael Sklar*
LessWrong (in development)
Rebuilding ROME: Resolving Model Collapse during Sequential Model Editing
Akshat Gupta, Sidharth Baskaran, Gopala Anumanchipalli
EMNLP 2024pdfarxiv

* denotes equal contribution

Software