Spatio-temporal Layers Based Intra-Operative Stereo Depth Estimation Network Via Hierarchical Prediction and Progressive Training.
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE(2024)
摘要
Background and Objective: Safety of robotic surgery can be enhanced through augmented vision or artificial constraints to the robotl motion, and intra-operative depth estimation is the cornerstone of these applications because it provides precise position information of surgical scenes in 3D space. High-quality depth estimation of endoscopic scenes has been a valuable issue, and the development of deep learning provides more possibility and potential to address this issue.Methods: In this paper, a deep learning-based approach is proposed to recover 3D information of intra-operative scenes. To this aim, a fully 3D encoder-decoder network integrating spatio-temporal layers is designed, and it adopts hierarchical prediction and progressive learning to enhance prediction accuracy and shorten training time.Results: Our network gets the depth estimation accuracy of MAE 2.55 +/- 1.51 (mm) and RMSE 5.23 +/- 1.40 (mm) using 8 surgical videos with a resolution of 1280x1024, which performs better compared with six other state-of-the-art methods that were trained on the same data.Conclusions: Our network can implement a promising depth estimation performance in intra-operative scenes using stereo images, allowing the integration in robot-assisted surgery to enhance safety.
更多查看译文
关键词
Robotic surgery,Intra-operative,Depth estimation,Deep learning,Stereo images
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要