Automatic Surgical Skill Rating Using Stylistic Behavior Components
Autor: | Marzieh Ershad, Robert Rege, Ann Majewicz Fey |
---|---|
Rok vydání: | 2018 |
Předmět: |
Computer science
business.industry education Gold standard ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION 030232 urology & nephrology 030230 surgery computer.software_genre behavioral disciplines and activities Task (project management) 03 medical and health sciences 0302 clinical medicine Resource (project management) Robotic Surgical Procedures Surgical skills Task analysis Robot Crowdsourcing Artificial intelligence Clinical Competence business computer Natural language processing |
Zdroj: | EMBC |
ISSN: | 2694-0604 |
Popis: | A gold standard in surgical skill rating and evaluation is direct observation, which a group of experts rate trainees based on a likert scale, by observing their performance during a surgical task. This method is time and resource intensive. To alleviate this burden, many studies have focused on automatic surgical skill assessment; however, the metrics suggested by the literature for automatic evaluation do not capture the stylistic behavior of the user. In addition very few studies focus on automatic rating of surgical skills based on available likert scales. In a previous study we presented a stylistic behavior lexicon for surgical skill. In this study we evaluate the lexicon's ability to automatically rate robotic surgical skill, based on the 6 domains in the Global Evaluative Assessment of Robotic Skills (GEARS). 14 subjects of different skill levels performed two surgical tasks on da Vinci surgical simulator. Different measurements were acquired as subjects performed the tasks, including limb (hand and arm) kinematics and joint (shoulder, elbow, wrist) positions. Posture videos of the subjects performing the task, as well as videos of the task being performed were viewed and rated by faculty experts based on the 6 domains in GEARS. The paired videos were also rated via crowd-sourcing based on our stylistic behavior lexicon. Two separate regression learner models, one using the sensor measurements and the other using crowd ratings for our proposed lexicon, were trained for each domain in GEARS. The results indicate that the scores predicted from both prediction models are in agreement with the gold standard faculty ratings. |
Databáze: | OpenAIRE |
Externí odkaz: |