Weakly Supervised Point-to-tell

An object detection system that detects the category and location of an object that is being pointed at by a (blind) user's hand, based on the attention map calculated from a image-classification neural network (hence weak supervision, since training images only need a categorical label).

Summary Paper is in summary paper.pdf
This work is built upon Point-to-Tell-and-Touch

Main programs are under cnn/Guided-Attention-Inference-Network. crop_hands.py crops hands from a video by color masking and saves them to a folder as pngs with metadata such as the location of the fingertip. gt_transfer.py reads the VOC SBD dataset and overlays the png images of hands on top of objects of a class of interest.

Machine Learning code based on code by github user @alokwhitewolf and paper Tell Me Where to Look: Guided Attention Inference Network

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
cnn		cnn
semantic-segmentation-pytorch-master		semantic-segmentation-pytorch-master
utils		utils
README.md		README.md
crop_hands.py		crop_hands.py
gt_transfer.py		gt_transfer.py
load.py		load.py
summary paper.pdf		summary paper.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Weakly Supervised Point-to-tell

About

Uh oh!

Releases

Packages

Uh oh!

Languages

daohanlu/Attention-Map-Point-to-tell

Folders and files

Latest commit

History

Repository files navigation

Weakly Supervised Point-to-tell

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages