SLIC Superpixels & Image Segmentation

This project was composed of implementing SLIC superpixel from scratch and image segmentation. We have trained two model oneOur model can achieve the accuracy of 84.661% by using multi-resolution for predicting superpixels. .

Dataset

You can download the MSRC V1 dataset here

Superpixels

A superpixel can be defined as a group of pixels that share similar characteristics (such as intensity, or distance). Superpixel algorithms have been widely applied to various tasks like Image Segmentation and Object detection.

In the first part of the project, we implement Kmean and SLIC superpixel algorithms.

Perform k-means on image pixels `(r, g, b, x, y)`

The k-means clustering algorithm is an unsupervised algorithm which, for some items and for some specified number of clusters represented by cluster centers, minimizes the distance between items and their associated cluster centers. It does so by iteratively assigning items to a cluster and recomputing the cluster center based on the assigned items.

We implement the pixel clustering function Kmean_superpixel.py. It takes input an image (shape = (n, m, 3)) and number of clusters. Each pixel should be represented by a vector with 3 values: (r, g, b, x, y)

SLIC superpixel

SLIC (Simple Linear Iterative Clustering) algorithm generates superpixels by clustering pixels based on color similarity and proximity in the image plane. We implement SLIC algorithm SLIC_superpixel.py from scratch and the detail of the algorithm can be found here

Image Segmentation

The goal of the second part is to build a segmentation network, which uses SLIC Superpixels as input. In essense, it will be a classifier for superpixels. The end product is a system which, when given an image, computes superpixels and classifies each superpixel as one of the 14 classes of MSRC v1.

Superpixel Dataset

For the purpose of consistency, we adopt the existing SLIC implementation from the scikit-learn machine learning package.

In the superpixel_dataset.py, for each image :

Get superpixels sp_i for image x. We adopt 100 segments in this assignment, segments = slic(image, n_segments=100, compactness=10).
For every superpixel sp_i in the image
2.1. find the smallest rectangle which can enclose sp_i
2.2. Dilate the rectangle by 3 pixels.
2.3. Get the same region from the segmentation image (from the file with similar name with *_GT). The class for this sp_i is mode of segmentation classes in that same region. Save the dilated region as npy (jpg is lossy for such small patches).

Segmentation Network

Basic: We have applied the VGG pre-trained network and replaced the last few layers with a fully connected layer. The inputs are the superpixels and the outputs are the label of superpixels.
Multi-resolution: To improve the performance of the superpixel prediction, we utilized the multi-resolution technique from [1] which inputs region maps to the separated VGG networks. These features capture information at various scales, ranging from fine details to more global contextual information. The figure below, cited from [1], demonstrates the concept of zoom-out features of the superpixel.

Visualization Results

Model	Test Acc.
VGG-19	72.42%
VGG-19 (Multi-resoulotion)	84.66%

Sample	Ground Truth	Prediction (multi-resolution)

References

Mostajabi, M., Yadollahpour, P., & Shakhnarovich, G. (2015). Feedforward semantic segmentation with zoom-out features. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3376-3385).

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
data		data
images		images
util		util
Kmean_superpixel.py		Kmean_superpixel.py
README.md		README.md
SLIC_superpixel.py		SLIC_superpixel.py
Solver.py		Solver.py
data_loader.py		data_loader.py
model.py		model.py
superpixel_dataset.py		superpixel_dataset.py
train_model.py		train_model.py
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SLIC Superpixels & Image Segmentation

Dataset

Superpixels

Perform k-means on image pixels `(r, g, b, x, y)`

SLIC superpixel

Image Segmentation

Superpixel Dataset

Segmentation Network

Visualization Results

References

About

Releases

Packages

Languages

ychen921/slic-superpixels-image-segmentation

Folders and files

Latest commit

History

Repository files navigation

SLIC Superpixels & Image Segmentation

Dataset

Superpixels

Perform k-means on image pixels (r, g, b, x, y)

SLIC superpixel

Image Segmentation

Superpixel Dataset

Segmentation Network

Visualization Results

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Perform k-means on image pixels `(r, g, b, x, y)`

Packages