xzAscC xzAscC

Hi, I'm Xudong Zhu 👋

PhD student at The Ohio State University working on understanding and controlling large language models. I also vibe code research prototypes, developer tools, and AI systems.

Recent Projects

Understanding Linear Steering (ongoing)
Investigating the geometry, linearity, and causal structure of steering directions in LLM representation space.
AbsTopK: Rethinking Sparse Autoencoders For Bidirectional Features ArXiv, OpenReview
Developed a principled proximal-gradient framework that unifies SAE variants (ReLU, JumpReLU, TopK) and reveals that non-negativity constraints prevent bidirectional feature representation. Proposed AbsTopK, a magnitude-based sparse operator that recovers complete semantic axes and improves interpretability and steering in LLMs.
From Emergence to Control: Probing and Modulating Self-Reflection in Language Models Arxiv
Showed that linear directions in representation space can enable and control self-reflection behavior in pretrained LLMs without finetuning.

GitHub Stats

If you are interested in collaboration, feel free to open an issue or connect with me.

Acknowledgments

GitHub stats cards are powered by github-readme-stats. Many thanks to the authors for building and maintaining it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xzAscC xzAscC

Block or report xzAscC

Hi, I'm Xudong Zhu 👋

Recent Projects

GitHub Stats

Acknowledgments

Pinned Loading

Uh oh!