Scene Classification using Visual Words

This repo is designed to use responses to conventional filters (Gaussian, Laplacian of Gaussian etc.) as a way to represent images. For all images in train-data, we apply K-Means over the features. A visual word is then defined as a centroid of K-Means cluster.

Histogram of words in an image can then be used as an image descriptor. For robustness, a spatial pyramid of histograms is used to define an image.

Repo Structure:

visual_filters: Extracts filter responses based on a pre-defined filter-bank. Centroids of K-Means is defined as a dictionary.
spm: Code for Spatial Pyramid of histogram, used to describe an image.
recognition: Computes a descriptor for each image in train set. Inclues code for Nearest Neighbor, used for inference.

The experiments are performed on a subset of SUN Dataset.

Visualization of the Image(Left) and it's corresponding wordmap (Right)

It is interesting to learn that conventional approach can achieve an accuracy of ~73% on test data.

Notes

Major part of the code was developed as part of 16720 Intro to CV course at Carnegie Mellon University. This repo is under development.

To contact the author, feel free to write to [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
images		images
.DS_Store		.DS_Store
README.md		README.md
recognition.py		recognition.py
spm.py		spm.py
visual_filters.py		visual_filters.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Scene Classification using Visual Words

Notes

About

Uh oh!

Releases

Packages

Languages

paritoshmittal12/visual-words

Folders and files

Latest commit

History

Repository files navigation

Scene Classification using Visual Words

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages