Yadong (Adam) Lu
Principal researcher / enthusiastic go player
I am a principal researcher at Microsoft Research, Redmond working on computer use agent and efficient pre/post-training methodologies of large vision-language models. Before that I received my Ph.D. degree in Statistics at UC Irvine (advisor: Pierre Baldi). I worked on a wide variety of scalable machine learning algorithms with applications to neural network model efficiency, image processing, and natural language processing.
Check out our recent work on computer use agent OmniParser (ranked #1 Trending repo on GitHub and HuggingFace model hub, 24k+ star so far), and scaling synthetic trajectory data for web agent.
Selected Works
- Arxiv
- ACL 2025Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web AgentsACL 2025 2025
- Arxiv
- CVPR
- ICLRSamba: Simple Hybrid State Space Models for Efficient Unlimited Context Language ModelingICLR 2024
- TMLR
- ArxivAn Empirical Study of Scaling Instruction-Tuned Large Multimodal ModelsNeurIPS, Workshop on Instruction Tuning and Instruction Following NeurIPS
- Arxiv
- NeurIPS
- Patent
- Patent