Sidharth Baskaran
email 路 github 路 linkedin 路 饾晱 路 scholar
Computer Science at Georgia Tech. Working on language model interpretability to build intelligent systems. Manual transmission & caffeine enthusiast.
Research
HyperDAS: Towards Automating Mechanistic Interpretability with Hypernetworks
Jiuding Sun, Jing Huang, Sidharth Baskaran, Karel D'Oosterlinck, Christopher Potts, Michael Sklar*, Atticus Geiger*
Brief Investigations of a Multi-layer Sparse Auto-Encoder
Sidharth Baskaran*, Michael Sklar*
LessWrong (in development)
Rebuilding ROME: Resolving Model Collapse during Sequential Model Editing
Akshat Gupta, Sidharth Baskaran, Gopala Anumanchipalli
* denotes equal contribution
Software
Ray Job Submission launcher plugin for the Hydra library.
Simple orchestration of containers and remote workflows for research.