Publications

Publications and Preprints

My research focuses on interpretable AI systems: understanding and shaping the mechanisms behind LLM reasoning, agent memory, and human-centered recommendation. I am particularly interested in methods with clear mathematical or geometric structure that explain empirical phenomena, support verification, and make AI behavior more reliable and safe.

Work in Progress

Selected Projects