gcn_clustering

Linkage-based object matching using Graph Convolution Network. One of the advantages of using graph-based approach is that it can make prediction by taking into account of neighbors' information. In this task, it is to predict the relationship between cap, jockey body and saddlecloth bboxes (i.e. which of them belong to the same jockey). Most of the time, these 3 objects are highly overlapped. Given the condition that each jockey can at most have one cap, one body and one saddlecloth, it is possible to group them correctly by looking at neighbors' group (e.g. Given 2 caps placing close to a jockey, if one of them is taken by the neighbor, then we know the other remaining cap should probably be the correct one.). The experimental results show the robustness of this graph-based approach in this challenging object grouping/clustering task.

Problem formulation

The idea from the CVPR'19 paper Linkage-based Face Clustering via GCN is adopted for the task. With my own further modification, the approach is changed to suit of the Multi-object clustering task in this repo. My modification and contribution are as follows:

Train a triplet model specifically for Multi-object clustering

Instead of using pre-trained ImageNet models, I train a triplet model from scratch for encoding the graph information of the data. It allows the GCN to learn the Multi-object clustering more easily as the feature of graph nodes is already providing basic information for grouping (i.e. nodes that are in different groups should have a higher distance while lower for the same group.)
Extend the prediction pipeline to batch Multi-object clustering

In prediction, it needs to run through triplet model and GCN. There are 2 data processings inside and it is time-consuming to predict a bunch of images with one image at a time. In order to speed up the process, parallelizing data processings between each prediction is implemented. The handling of multiple objects in a multiple images is quite complicated but time-efficient.

Usage

Prepare data for triplet model

Run prepare_triplet_data.ipynb

Train triplet model on the triplet data

Run triplet_train.ipynb

Use the best triplet weight to extract graph data

Run extract_graph_data.ipynb

Train the GCN

python train.py

See the prediction result

Run Prediction.ipynb

Visualization

In the predictions below, the model looks so robust that it clusters the objects accurately even though the bounding boxes of the objects are highly overlapped. Some of the group colours are close, but if downloading the image and zoom in, the colour difference can be seen.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
model		model
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
Prediction.ipynb		Prediction.ipynb
README.md		README.md
extract_graph_data.ipynb		extract_graph_data.ipynb
feeder.py		feeder.py
prepare_triplet_data.ipynb		prepare_triplet_data.ipynb
train.py		train.py
triplet_train.ipynb		triplet_train.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gcn_clustering

Problem formulation

Usage

Visualization

About

Releases

Packages

Languages

License

jaysonph/gcn-multi-object-clustering

Folders and files

Latest commit

History

Repository files navigation

gcn_clustering

Problem formulation

Usage

Visualization

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages