Authors: Dongyu Yao, Keling Yao, Harry Chi; Advisor: Prof. Katia Sycara and Dr. Yaqi Xie
About Team
Students
Dongyu Yao is a graduate student in the Master of Science in Computer Vision program at Carnegie Mellon University. He earned his Bachelor’s degree in Cyberspace Security at Wuhan University. His research interests focus on Data-centric AI and Reliable (Multi-modal) AI, encompassing key areas in Computer Vision (data-efficient scene understanding), Natural Language Processing, and Generative AI (LLMs and MLLMs).Keling Yao is a graduate student in the Master of Science in Computer Vision program at Carnegie Mellon University. He earned his Bachelor’s degree in Data Science Security at the Chinese University of Hong Kong. His research interests focus on Multimodal Language Model Post Training and Autonomous Driving System.Harry Chi is a graduate student in the Master of Science in Computer Vision program at Carnegie Mellon University. He earned his Bachelor’s degree in Computer Science and Technology at ShanghaiTech University. His research interests focus on Multimodal Machine Learning and Computer Vision, with extension to Reinforcement Learning and World Model.
Advisors
Prof. Katia Sycara is a Research Professor at the Robotics Institute in the School of Computer Science at Carnegie Mellon University and a part-time holder of the Sixth Century Chair in Computing Science at the University of Aberdeen. She directs the Advanced Agent-Robotics Technology Lab. Her research interests include multi-agent systems, human–robot interaction, adversarial and game-theoretic reasoning, machine learning, planning and scheduling, information fusion, and large-scale human–machine networked systems.Dr. Yaqi Xie is a Postdoctoral Fellow at the Robotics Institute at Carnegie Mellon University, where she is advised by Prof. Katia Sycara. She received her Ph.D. in Computer Science from the National University of Singapore under the supervision of Prof. Harold Soh. Her research focuses on human–AI synergy through neural-symbolic integration, combining symbolic reasoning and deep learning to build adaptive, interpretable, and trustworthy intelligence systems, with applications in perception, decision-making, and generative models.