Sriram Balasubramanian

PhD candidate at the University of Maryland, working on mechanistic interpretability and the safety of advanced AI systems.

About

I'm a PhD candidate in Computer Science at the University of Maryland, College Park, advised by Prof. Soheil Feizi. I work on uncovering the mechanisms that drive the success of modern neural networks — I think a principled understanding of these mechanisms is essential for safely developing and reliably controlling advanced AI.

More broadly, I'm concerned about the impact of advanced AI on human systems and the role of humanity in an era of widespread, superhuman AI. Previously, I was a research fellow at Microsoft Research India, and before that I did my BTech (Hons.) in CS at IIT Bombay.

Portrait of Sriram Balasubramanian

Research

Interpretability

Decomposing vision and language models into interpretable parts — heads, MLPs, circuits — and tying them to human-readable concepts.

Robustness

The geometry of model decisions — blind spots, masking, and the gap between what models claim and what they actually compute.

AI & Society

Where generative models meet the world: artistic style copyright, the (im)possibility of reliable AI-text detection, and other safety-adjacent questions.

Selected publications

2026
2025
2024
2023
2020

Full list on Google Scholar.

Experience