Yuxin Xiong

M.S. Student at UC San Diego

profile.jpg

Hi 👋 I’m Yuxin Xiong, a Master’s student in Computer Science at UC San Diego, advised by Prof. Julian McAuley. Before that, I received my Bachelor’s degree from Shanghai Jiao Tong University, where I was advised by Prof. Siheng Chen.

My research focuses on Reasoning, Reinforcement Learning and Large Language Model (LLM) Agents 🤖 — aiming to make language models more intelligent, interpretable, and aligned with human reasoning.

Previously, I collaborated with Prof. Zhiting Hu on large-scale reasoning model training at UC San Diego. I also interned at AWS AI Lab, working with Minjie Wang on Chain-of-Thought (CoT) agents and tool-augmented LLM reasoning.

I’m currently open to Ph.D. positions (Fall 2026) 🎓 and new grad opportunities in AI/ML research and development 🚀.

News 🔥

Jan 22, 2026 Our paper CTRLS was accepted to AISTATS 2026! 🌟
Sep 09, 2025 We contributed to the release of K2-Think — a general reasoning model! 🎉
Aug 20, 2025 Two of our papers were accepted to EMNLP 2025! 🎊
Jan 22, 2025 Our work OCEAN was accepted to ICLR 2025! 🌊
Jun 30, 2024 I graduated from Shanghai Jiao Tong University (SJTU)! 🎓
Jun 12, 2024 Our work MATRIX was selected as an ICML 2024 Spotlight! 🌟

Selected Publications

  1. AISTATS
    Ctrls: Chain-of-thought reasoning via latent state-transition
    Junda Wu*, Yuxin Xiong*, Xintong Li, Zhengmian Hu, Tong Yu, Rui Wang, Xiang Chen, Jingbo Shang, and Julian McAuley
    arXiv preprint arXiv:2507.08182, 2025
  2. EMNLP
    Explainable Chain-of-Thought Reasoning: An Empirical Analysis on State-Aware Reasoning Dynamics
    Sheldon Yu*, Yuxin Xiong*, Junda Wu, Xintong Li, Tong Yu, Xiang Chen, Ritwik Sinha, Jingbo Shang, and Julian McAuley
    In Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
  3. EMNLP
    Mitigating Visual Knowledge Forgetting in MLLM Instruction-tuning via Modality-decoupled Gradient Descent
    Junda Wu, Yuxin Xiong, Xintong Li, Yu Xia, Ruoyu Wang, Yu Wang, Tong Yu, Sungchul Kim, Ryan A Rossi, Lina Yao, Jingbo Shang, and Julian McAuley
    In Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
  4. Self-Alignment of Large Language Models via Monopolylogue-based Social Scene Simulation
    Xianghe Pang, Shuo Tang, Rui Ye, Yuxin Xiong, Bolun Zhang, Yanfeng Wang, and Siheng Chen
    In Forty-first International Conference on Machine Learning, 2024
    Spotlight
  5. OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models
    Junda Wu, Xintong Li, Ruoyu Wang, Yu Xia, Yuxin Xiong, Jianing Wang, Tong Yu, Xiang Chen, Branislav Kveton, Lina Yao, Jingbo Shang, and Julian McAuley
    In The Thirteenth International Conference on Learning Representations, 2025
  6. Emergent Communication in Interactive Sketch Question Answering
    Zixing Lei, Yiming Zhang, Yuxin Xiong, and Siheng Chen
    In Thirty-seventh Conference on Neural Information Processing Systems, 2023

Services

Review for:ICLR 2026, NeurIPS 2023/2025, ICML 2024/2025, AAAI 2025/2026