BiliVLA: Scene-Aware Vision-Language-Action Model with Reinforcement Learning for Autonomous Biliary Endoscopic Navigation


This is a companion discussion topic for the original entry at https://arxiv.org/abs/2606.23531