通过多模态表征学习推进外科手术智能化
日期:2025/04/21 - 2025/04/21
学术讲座:通过多模态表征学习推进外科手术智能化
主讲人:Kun Yuan, Ph.D. at Technical University of Munich
时间:2025年4月21日(周一)下午1:30
在线链接:
讲座摘要
Understanding surgical scenes across multiple modalities is essential for building context-aware and intelligent surgical systems. This talk focuses on integrating visual information from both laparoscopic and external cameras, capturing the internal operative field and the surrounding OR environment, to enable holistic perception and reasoning in surgery. Kun Yuan will present recent advances in surgical multi-modal representation learning, leveraging surgical foundation models trained on large-scale video-text data. Emphasis will be placed on knowledge-guided adaptation strategies, cross-view alignment, and the challenges of sparse supervision. Applications include surgical phase recognition, team interaction analysis, and enhanced decision support, with an outlook on building generalizable and trustworthy AI tools for the OR.
主讲人简介
Kun Yuan is a joint senior Ph.D. student at the University of Strasbourg, France, and the Technical University of Munich, Germany, supervised by Prof. Nicolas Padoy and Prof. Nassir Navab. His research focuses on the development of multi-modal learning methods for surgical video analysis, with applications in surgical workflow understanding and intelligent operating rooms. He has been dedicated to cross-modal representation learning using laparoscopic and external OR video, contributing to the next generation of context-aware surgical AI systems.