I build self-improving agents that do real work.

I recently finished my Ph.D. in Computer Science at Yale University, advised by Mark Gerstein.

I work with Arman Cohan as my advisor of record, and closely with Smita Krishnaswamy. I earned my master's from Yale CS with the late Dragomir Radev, and have been a graduate affiliate at Grace Hopper College since 2021.

My research focuses on three threads:

  • Harnessing self-improving reasoning: memory [R.3], self-updating [S.14], and hierarchical refinement [R.5] that enable LLMs to sustain expert-level multi-step reasoning at the frontier of human knowledge.
  • Harnessing collaborative agents: specialized agents that coordinate, debate, and converge on reliable decisions across professional services, medicine, engineering, and consumer activities. [S.7,R.2].
  • Training AI co-scientists: representations for protein design [R.1], molecular reasoning [S.9], and large-scale homology search [S.13], driving discoveries that hold up in the lab [S.15].
My research is supported by Schmidt Futures.

I've also been a Research Scientist Intern at Google (contributing across DeepMind, Search, and Cloud); with earlier stints at Eigen AI, Microsoft, and Sinovation Ventures.

Now building something new.



Discover my full publication on google scholar | semantic scholar

Recent Preprints [ + Show 4 papers ]

Selected Publications [ − Hide ]

  • [R.6] EvoClaw: Evaluating AI Agents on Continuous Software Evolution
    Gangda Deng, Zhaoling Chen, Zhongming Yu, Haoyang Fan, Yuhong Liu, Yuxin Yang, Dhruv Parikh, Rajgopal Kannan, Le Cong, Mengdi Wang, Qian Zhang, Viktor Prasanna, Xiangru Tang*, Xingyao Wang*
    ICML 2026
    [PDF] [Abstract] [Bib]
    EvoClaw
  • [S.19] LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning
    Xinwu Ye, Yicheng Mao, Jia Zhang, Yimeng Liu, Li Hao, Fang Wu, Zhiwei Li, Yuxuan Liao, Zehong Wang, Yingcheng Wu, Zhiyuan Liu, Zhenfei Yin, Li Yuan, Philip Torr, Huan Sun, Xiangxiang Zeng, Mengdi Wang, Le Cong, Shenghua Gao, Xiangru Tang
    ICML 2026
    [PDF] [Abstract] [Bib]
    LatentChem
  • [S.18] MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning
    Xiangru Tang, Daniel Shao, Jiwoong Sohn, Jiapeng Chen, Jiayi Zhang, Jinyu Xiang, Fang Wu, Yilun Zhao, Chenglin Wu, Wenqi Shi, Arman Cohan, Mark Gerstein.
    Patterns
    "Thinking models (DeepSeek R1 and OpenAI o3) show exceptional performance on medical QA tasks."
    [PDF] [Abstract] [Bib]
    MedagentsBench
  • [S.17] Eigen-Agent: Adaptive Multi-Agent Scientific Reasoning with Monitor-Based RAG
    Xiangru Tang, Wanghan Xu, Yujie Wang, Zijie Guo, Daniel Shao, Jiapeng Chen, Cixuan Zhang, Ziyi Wang, Lixin Zhang, Guancheng Wan, Wenlong Zhang, Lei Bai, Zhenfei Yin, Philip Torr, Hanrui Wang, Di Jin.
    ICLR 2026
    [PDF] [Abstract] [Bib]
    Eigen-1
  • [S.16] LocAgent: Graph-Guided LLM Agents for Code Localization
    Zhaoling Chen*, Xiangru Tang*, Gangda Deng*, Fang Wu, Jialong Wu, Zhiwei Jiang, Viktor Prasanna, Arman Cohan, Xingyao Wang.
    ACL 2025
    "No need to embed the entire repo, agent + graph-based indexing is all you need!"
    [PDF] [Abstract] [Bib]
    LocAgent
  • [S.15] Risks of AI Scientists: Prioritizing Safeguarding Over Autonomy
    Xiangru Tang, Qiao Jin, Kunlun Zhu, Tongxin Yuan, Yichi Zhang, Wangchunshu Zhou, Meng Qu, Yilun Zhao, Jian Tang, Zhuosheng Zhang, Arman Cohan, Zhiyong Lu, Mark Gerstein.
    Nature Communications, 2025 (IF 14.7)
    ICLR 2024 Workshop on LLM Agents
    [PDF] [Abstract] [Bib]
  • [S.14] ChemAgent: Self-updating Memories in Large Language Models Improves Chemical Reasoning
    Xiangru Tang*, Tianyu Hu*, Muyang Ye*, Yanjun Shao*, Xunjian Yin, Siru Ouyang, Wangchunshu Zhou, Pan Lu, Zhuosheng Zhang, Yilun Zhao, Arman Cohan, Mark Gerstein.
    ICLR 2025
    "Enable LLMs to continuously improve through experience."
    [PDF] [Abstract] [Bib]
    ChemAgent
  • [S.13] Fast, Sensitive Detection of Protein Homologs Using Deep Dense Retrieval
    Liang Hong*, Zhihang Hu*, Siqi Sun*, Xiangru Tang*, Jiuming Wang, Qingxiong Tan, Liangzhen Zheng, Sheng Wang, Sheng Xu, Irwin King, Mark Gerstein, Yu Li.
    Nature Biotechnology, 2024 (IF 33.1)
    "Up to 28,700 times faster than HMMER!"
    [PDF] [Abstract] [Bib]
    DPR
  • [S.12] MIMIR: A Customizable Agent Tuning Platform for Enhanced Scientific Applications
    Xiangru Tang*, Chunyuan Deng*, Hanmin Wang*, Haoran Wang*, Yilun Zhao, Wenqi Shi, Yi Fung, Wangchunshu Zhou, Jiannan Cao, Heng Ji, Arman Cohan, Mark Gerstein.
    EMNLP 2024
    [PDF] [Abstract] [Bib]
    MIMIR
  • [S.11] Step-Back Profiling: Distilling User History for Personalized Scientific Writing
    Xiangru Tang, Xingyao Zhang, Yanjun Shao, Jie Wu, Yilun Zhao, Arman Cohan, Ming Gong, Dongmei Zhang, Mark Gerstein.
    IJCAI 2024 Workshop on AI4Research (Best Paper Award)
    [PDF] [Abstract] [Bib]
    Step-Back Profiling
  • [S.10] A Survey of Generative AI for De Novo Drug Design: New Frontiers in Molecule and Protein Generation
    Xiangru Tang*, Howard Dai*, Elizabeth Knight*, Fang Wu, Yunyang Li, Tianxiao Li, Mark Gerstein.
    Briefings in Bioinformatics, 2024 (IF 13.99, JCR "Q1")
    "An introductory overview with a clear breakdown of datasets, benchmarks, & models."
    [PDF] [Abstract] [Bib]
    GenAI4Drug
  • [S.9] MolLM: A Unified Language Model for Integrating Biomedical Text with 2D and 3D Molecular Representations
    Xiangru Tang, Andrew Tran, Jeffrey Tan, Mark Gerstein.
    ISMB 2024, Proceedings in Bioinformatics (IF 6.93, JCR "Q1")
    [PDF] [Abstract] [Bib]
    MolLM
  • [S.8] BioCoder: A Benchmark for Bioinformatics Code Generation with Large Language Models
    Xiangru Tang, Bill Qian, Rick Gao, Jiakang Chen, Xinyun Chen, Mark Gerstein.
    ISMB 2024, Proceedings in Bioinformatics (IF 6.93, JCR "Q1")
    "BioCoder input covers repository-level potential package dependencies, class declarations, & global variables."
    [PDF] [Abstract] [Bib]
    BioCoder
  • [S.7] MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning
    Xiangru Tang*, Anni Zou*, Zhuosheng Zhang, Yilun Zhao, Xingyao Zhang, Arman Cohan, Mark Gerstein.
    ACL 2024 Findings
    "The first multi-agent framework within the medical context!"
    [PDF] [Abstract] [Bib]
    MedAgents
  • [S.6] Struc-Bench: Are Large Language Models Good at Generating Complex Structured Tabular Data?
    Xiangru Tang, Yiming Zong, Jason Phang, Yilun Zhao, Wangchunshu Zhou, Arman Cohan, Mark Gerstein.
    NAACL 2024 (Oral)
    [PDF] [Abstract] [Bib]
    Struc-Bench
  • [S.5] Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models
    Anni Zou, Zhuosheng Zhang, Hai Zhao, Xiangru Tang.
    IEEE Transactions on Audio, Speech and Language Processing (In Review)
    "Bridge the gap between performance and generalization when using the CoT prompting!"
    [PDF] [Abstract] [Bib]
    Meta-CoT
  • [S.4] Aligning Factual Consistency for Clinical Studies Summarization through Reinforcement Learning
    Xiangru Tang, Arman Cohan, Mark Gerstein.
    ACL 2023 Clinical Natural Language Processing
    [PDF] [Abstract] [Bib]
  • [S.3] GersteinLab at MEDIQA-Chat 2023: Clinical Note Summarization from Doctor-Patient Conversations through Fine-tuning and In-context Learning
    Xiangru Tang, Andrew Tran, Jeffrey Tan, Mark Gerstein.
    ACL 2023 Clinical Natural Language Processing
    [PDF] [Abstract] [Bib]
    MEDIQA
  • [S.2] CONFIT: Toward Faithful Dialogue Summarization with Linguistically-Informed Contrastive Fine-tuning
    Xiangru Tang, Arjun Nair, Borui Wang, Bingyao Wang, Jai Desai, Aaron Wade, Haoran Li, Asli Celikyilmaz, Yashar Mehdad, Dragomir Radev.
    NAACL 2022 (Oral)
    [PDF] [Abstract] [Bib]
  • [S.1] Investigating Crowdsourcing Protocols for Evaluating the Factual Consistency of Summaries
    Xiangru Tang, Alexander Fabbri, Haoran Li, Ziming Mao, Griffin Adams, Borui Wang, Asli Celikyilmaz, Yashar Mehdad, Dragomir Radev.
    NAACL 2022
    [PDF] [Abstract] [Bib]

Other Publications [ + Show 24 papers ]

Honors [ − Hide ]

2026   Best Paper Award · AAAI AI for Research Workshop
2025   Best Paper Runner-Up · ICML CFAgentic Workshop
2025   Yunfan “Rising Star” Award · World AI Conference (WAIC)
2024   Best Paper Award · IJCAI AI for Research Workshop
2025   Travel Grant · ICML Computer Use Agents Workshop
2025   Financial Assistance · ICLR

Recent Talks [ + Show 10 talks ]

Services [ + Show ]

Undergrad Advising [ + Show ]

Misc. [ + Show ]