Dawei Gao

alt text 

Algorithm Engineer

Data Analytics and Intelligence Lab
Alibaba DAMO Academy

Jinhui Building, Beijing, China
gaodawei.gdw@alibaba-inc.com

About me

Dawei Gao is currently a staff in the Data Analytics and Intelligence Lab (DAIL) at Alibaba DAMO Academy. He receives his Ph.D. in 2022 in the Department of Computer Science and Engineering, Beihang University (Supervised by Prof. Ke Xu & Prof. Yongxin Tong). He receives his B.E. in 2016 in the Department of Computer Science and Engineering, Beihang University.

News

  1. [2024/02/22] Our novel multi-agent platform, AgentScope is released in GitHub!

Research

My recent research focuses on

Recent Publications

  1. [arXiv’24] Xuchen Pan, Dawei Gao(co-first author), Yuexiang Xie, Zhewei Wei, Yaliang Li, Bolin Ding, Ji-Rong Wen, Jingren Zhou. Very Large-Scale Multi-Agent Simulation in AgentScope, arXiv, 2024.

  2. [arXiv’24] Dawei Gao, Zitao Li (co-first author), Weirui Kuang, Xuchen Pan, Daoyuan Chen, Zhijian Ma, Bingchen Qian, Liuyi Yao, Lin Zhu, Chen Cheng, Hongzhu Shi, Yaliang Li, Bolin Ding, Jingren Zhou. AgentScope: A Flexible yet Robust Multi-Agent Platform, arXiv, 2024.

  3. [SIGMOD’24] Daoyuan Chen, Yilun Huang, Zhijian Ma, Hesen Chen, Xuchen Pan, Ce Ge, Dawei Gao, Yuexiang Xie, Zhaoyang Liu, Jinyang Gao, Yaliang Li, Bolin Ding, Jingren Zhou. Data-Juicer: A One-Stop Data Processing System for Large Language Models, In Proceedings of the ACM International Conference on Management of Data, 2024. [Industrial Track]

  4. [COLING’24] Peiyu Liu, Zikang Liu, Ze-Feng Gao, Dawei Gao, Wayne Xin Zhao, Yaliang Li, Bolin Ding, Ji-Rong Wen. Do Emergent Abilities Exist in Quantized Large Language Models: An Empirical Study, In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024.

  5. [ACL’24] Yingqian Min, Kun Zhou, Dawei Gao, Wayne Xin Zhao, He Hu, Yaliang Li. Data-CUBE: Data Curriculum for Instruction-based Sentence Representation Learning, In Procedings of the Annual Meeting of the Association for Computational Linguistics, 2024.

  6. [VLDB’24] Dawei Gao, Haibin Wang (co-first author), Yaliang Li, Xiuyu Sun, Yichen Qian, Bolin Ding, Jingren Zhou. Text-to-SQL Empowered by Large Language Models: A Benchmark Evaluation, In Proceedings of the VLDB Endowment, GuangZhou, China, Aug 25 - Aug 29, 2024.

  7. [KDD’24] Weirui Kuang, Bingchen Qian, Zitao Li, Daoyuan Chen, Dawei Gao, Xuchen Pan, Yuexiang Xie, Yaliang Li, Bolin Ding, Jingren Zhou. FederatedScope-LLM: A Comprehensive Package for Fine-tuning Large Language Models in Federated Learning, In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Barcelona, Spain, Aug 25 - Aug 29, 2024.

  8. [ICML’23] Daoyuan Chen, Liuyi Yao, Dawei Gao, Yaliang Li, Bolin Ding. Efficient Personalized Federated Learning via Sparse Model-Adaptation, In Proceedings of the International Conference on Machine Learning, Honolulu, Hawei'i, Jul 23 - Jul 29, 2023.

  9. [VLDB’23] Dawei Gao, Daoyuan Chen (co-first author), Zitao Li, Yuexiang Xie, Xuchen Pan, Yaliang Li, Bolin Ding, Jingren Zhou. FS-Real: A Real-World Cross-Device Federated Learning Platform, In Proceedings of the VLDB Endownment, vol.15, no.6, Vancouver, Canada, Aug 29 - Sep 1, 2023. [Demostration Track]

  10. [KDD’23] Daoyuan Chen, Dawei Gao (co-first author), Yuexiang Xie, Xuchen Pan, Zitao Li, Yaliang Li, Bolin Ding, Jingren Zhou. FS-REAL: Towards Real-World Cross-Device Federated Learning, In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Long Beach, California, USA, Aug 6 - Aug 10, 2023.

  11. [KDD’23] Ergute Bao, Dawei Gao, Xiaokui Xiao, Yaliang Li. Communication Efficient and Differentially Private Logistic Regression under the Distributed Setting, In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Long Beach, California, USA, Aug 6 - Aug 10, 2023.

  12. [VLDB’23] Yuexiang Xie, Zhen Wang, Dawei Gao, Daoyuan Chen, Liuyi Yao, Weirui Kuang, Yaliang Li, Bolin Ding, Jingren Zhou. FederatedScope: A Flexible Federated Learning Platform for Heterogeneity, In Proceedings of the VLDB Endowment, vol.15, no.6, Vancouver, Canada, Aug 29 - Sep 1, 2023.

  13. [arXiv’22] Liuyi Yao, Dawei Gao, Zhen Wang, Yuexiang Xie, Weirui Kuang, Daoyuan Chen, Haohui Wang, Chenhe Dong, Bolin Ding, Yaliang Li. A Benchmark for Federated Hetero-Task Learning, arXiv, 2022.

  14. [NeurIPS’22] Daoyuan Chen, Dawei Gao, Weirui Kuang, Yaliang Li, Bolin Ding. pFL-Bench: A Comprehensive Benchmark for Personalized Federated Learning, In Proceedings of the Annual Conference on Neural Information Processing Systems, New Orleans, USA, Nov 27 - Dec 3, 2022. (Datasets and Benchmarks Track)

  15. [KDD’22] Dawei Gao, Yuexiang Xie, Zimu Zhou, Zhen Wang, Yaliang Li, Bolin Ding. Finding Meta Winning Ticket to Train Your MAML, In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, Aug 14 - Aug 18, 2022. (Research Track)

  16. [CIKM’21] Dawei Gao, Xiaoxi He, Zimu Zhou*, Yongxin Tong, Lothar Thiele. Pruning Meta-Trained Networks for On-Device Adaptation, In Proceedings of the ACM International Conference on Information and Knowledge Management, Virtual, Nov 01 - Nov 05, 2021.

  17. [KDD’21] Xiaoxi He, Dawei Gao, Zimu Zhou*, Yongxin Tong, Lothar Thiele. Pruning-Aware Merging for Efficient Multitask Inference, In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Virtual, Aug 14 - Aug 18, 2021. (Research Track)

  18. [KDD’20] Dawei Gao, Xiaoxi He, Zimu Zhou*, Yongxin Tong, Ke Xu, Lothar Thiele. Rethinking Pruning for Accelerating Deep Inference At the Edge, In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Virtual, Aug 23 - Aug 27, 2020. (Research Track)

  19. [DSE’17] Dawei Gao, Yongxin Tong*, Jieying She, Tianshu Song, Lei Chen, Ke Xu. Top-k Team Recommendation and Its Variants in Spatial Crowdsourcing, Data Science and Engineering, 2(2): 136-150, June 2017.

  20. [APWeb-WAIM’17] Dawei Gao, Yongxin Tong*, Yudian Ji, Ke Xu. Team-Oriented Task Planning in Spatial Crowdsourcing, In Proceedings of the 1st Asia Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint Conference on Web and Big Data, Pages 41-56, Beijing, China, July 7-9, 2017.

  21. [WAIM’16] Dawei Gao, Yongxin Tong, Jieying She, Tianshu Song, Lei Chen, Ke Xu. Top-k Teams Recommendation in Spatial Crowdsourcing, In Proceedings of the 17th International Conference on Web-Age Information Management, Pages 191-204, Nanchang, Jiangxi, China, June 3-5, 2016. (Best Paper Award)

Open Source Project

  1. AgentScope: An developer-oriented multi-agent platform.

  2. FederatedScope: An easy-to-use federated learning platform.

  3. FS-REAL: An efficient and scalable prototyping system for real-world cross-device federated learning.

  4. Data-Juicer: A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs!

  • Welcome, the -th visitor!