Yadong (Adam) Lu

Principal researcher / enthusiastic go player

prof_pic.jpg

I am a principal researcher at Microsoft Research, Redmond working on computer use agent and efficient pre/post-training methodologies of large vision-language models. Before that I received my Ph.D. degree in Statistics at UC Irvine (advisor: Pierre Baldi). I worked on a wide variety of scalable machine learning algorithms with applications to neural network model efficiency, image processing, and natural language processing.

Check out our recent work on computer use agent OmniParser (ranked #1 Trending repo on GitHub and HuggingFace model hub, 24k+ star so far), and scaling synthetic trajectory data for web agent.

Selected Works

  1. Arxiv
    Beyond Clicking: A Step Towards Generalist GUI Grounding via Text Dragging
    Liao, Zeyi, Lu, Yadong, Gou, Boyu, Sun, Huan, and Awadallah, Ahmed
    2025
  2. ACL 2025
    Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents
    Pahuja, Vardaan, Lu, Yadong, Rosset, Corby, Gou, Boyu, Mitra, Arindam, Whitehead, Spencer, Su, Yu, and Awadallah, Ahmed
    ACL 2025 2025
  3. Arxiv
    OmniParser for Pure Vision Based GUI Agent
    Lu, Yadong, Yang, Jianwei, Shen, Yelong, and Awadallah, Ahmed
    2024
  4. CVPR
    Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale
    Bonatti, Rogerio, Zhao, Dan, Bonacci, Francesco, Dupont, Dillon, Abdali, Sara, Li, Yinheng, Lu, Yadong, Wagle, Justin, Koishida, Kazuhito, Bucker, Arthur, Jang, Lawrence, and Hui, Zack
    CVPR 2025
  5. ICLR
    Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
    Ren, Liliang, Liu, Yang, Lu, Yadong, Shen, Yelong, Liang, Chen, and Chen, Weizhu
    ICLR 2024
  6. TMLR
    Multi-LoRA Composition for Image Generation
    Zhong, Ming, Shen, Yelong, Wang, Shuohang, Lu, Yadong, Jiao, Yizhu, Ouyang, Siru, Yu, Donghan, Han, Jiawei, and Chen, Weizhu
    TMLR 2024
  7. Arxiv
    An Empirical Study of Scaling Instruction-Tuned Large Multimodal Models
    Yadong, Lu, Chunyuan, Li, Haotian, Liu, Jianwei, Yang, Jianfeng, Gao, and Yelong, Shen
    NeurIPS, Workshop on Instruction Tuning and Instruction Following NeurIPS
  8. Arxiv
    Efficient RLHF: Reducing the Memory Usage of PPO
    Michael, Santacroce, Yadong, Lu, Han, Yu, Yuanzhi, Li, and Yelong, Shen
    Arxiv 2023
  9. NeurIPS
    In-Context Learning Unlocked for Diffusion Models
    Wang, Zhendong, Jiang, Yifan, Lu, Yadong, Shen, Yelong, He, Pengcheng, Chen, Weizhu, Wang, Zhangyang, and Zhou, Mingyuan
    NeurIPS Spotlight 2023
  10. Patent
    Progressive data compression using artificial neural networks
    Lu, Yadong, Yang, Yang, Zhu, Yinhao, Said, Amir, and Cohen, Taco
    US Patent 2022
  11. Patent
    Variable bit rate compression using neural network models
    Lu, Yadong, Yang, Yang, Zhu, Yinhao, Said, Amir, Pourreza, Reza, and Cohen, Taco
    US Patent 2022
  12. JHEP
    Resolving Extreme Jet Substructure
    Lu, Yadong, Romero, Alexis, Fenton, Michael James, Whiteson, Daniel, and Baldi, Pierre
    Journal of High Energy Physics 2022
  13. ECCV
    Differentiable Joint Pruning and Quantization for Hardware Efficiency
    Wang, Yin, Lu, Yadong, and Blankevoort, Tijmen
    European Conference on Computer Vision (ECCV) 2020
  14. ICIP
    Progressive Neural Image Compression With Nested Quantization And Latent Ordering
    Lu, Yadong*, Zhu, Yinhao*, Yang, Yang*, Said, Amir, and Cohen, Taco S
    In IEEE International Conference on Image Processing (ICIP) 2021