Incremental Domain Generalization with Graph Network for Surgical Scene Understanding

A domain generalized approach on surgical scene graphs to predict instrument-tissue interaction during robot-assisted surgery. We incorporate incremental learning to the feature extraction network and knowledge distillation-based student-teacher learning to the graph network, to accommodate new instruments and domain shifts in the new domain.
We design an enhanced curriculum by smoothing (E-CBS) based on Laplacian of Gaussian kernel and Gaussian kernel, and integrate with feature extraction network and visual-semantic graph attention network to improve the model performance.
Furthermore, we normalize the feature extraction and graph network’s logits by T-Norm and study its effect in calibrating the model.
The proposed SSU is trained on nephrectomy procedures video frames and then domain generalized to transoral robotics surgery video frames.

VS-GATs

To be added

Graph

Preliminary

To be added

Code Overview

In this project, we implement our method using the Pytorch and DGL library and there are three main folders:

Feature_extractor/: Used to extract features from dataset images to train the graph network.
datasets/: Contains the dataset needed to train the network.
model/: Contains network models.
utils/: Contains utility tools used for training and evaluation.
checkpoints/: Conatins trained weights

Library Prerequisities.

DGL

DGL is a Python package dedicated to deep learning on graphs, built atop existing tensor DL frameworks (e.g. Pytorch, MXNet) and simplifying the implementation of graph-based neural networks.

Prerequisites

Python 3.6
Pytorch 1.1.0
DGL 0.3
CUDA 10.0
Ubuntu 16.04

Dataset

Download feature extracted data for training and evalutation

gdrive_link for features To be added
Download the pretrain word2vec model on GoogleNews and put it into datasets/word2vec

Training

model_train.py
Checkpoints will be saved in checkpoints/ folder.

Testing

model_evaluation.py

Acknowledgement

Code adopted and modified from :

Visual-Semantic Graph Attention Network for Human-Object Interaction Detecion
- Paper Visual-Semantic Graph Attention Network for Human-Object Interaction Detecion.
- Official Pytorch implementation code.
End-to-End Incremental Learning
- Paper End-to-End Incremental Learning.
- Pytorch implementation code.
Curriculum by smoothing
- Paper Curriculum by smoothing.
- Pytorch implementation code.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
feature_extractor		feature_extractor
model		model
utils		utils
.gitignore		.gitignore
model_evaluation.py		model_evaluation.py
model_train.py		model_train.py
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Incremental Domain Generalization with Graph Network for Surgical Scene Understanding

VS-GATs

Graph

Preliminary

Code Overview

Library Prerequisities.

DGL

Prerequisites

Dataset

Download feature extracted data for training and evalutation

Training

Testing

Acknowledgement

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

lalithjets/Domain-Generalization-for-Surgical-Scene-Graph

Folders and files

Latest commit

History

Repository files navigation

Incremental Domain Generalization with Graph Network for Surgical Scene Understanding

VS-GATs

Graph

Preliminary

Code Overview

Library Prerequisities.

DGL

Prerequisites

Dataset

Download feature extracted data for training and evalutation

Training

Testing

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages