cityscapes-segmentation

This repo explores different self-supervised pretext task for semantic segmentation on cityscapes dataset (original resolution 1024x2048).

Pretext tasks we are implementing(using 15,000 frames of video data from cityscapes dataset):

Our target tasks include semantic segmentation and future frame prediction.

For the target task of semantic segmentation, we evaluate a UNet and a Deeplabv3 model on the cityscapes dataset(5000 examples with fine annotations).
For the task of future frame prediction, we evaluate an encoder-decoder temporal network on the cityscapes dataset.

/unet: code for running UNet for semantic segmentation (work based on Pytorch-UNet)
/DeepLabv3: code for running DeepLabv3 for semantic segmentation (work based on DeepLabv3.pytorch)
/triplet: pretext task to generate embeddings for video frames using triplet loss (work based on uzkent/MMVideoPredictor)
/spatioTemporal: pretext task doing video frame order prediction (work based on uzkent/MMVideoPredictor)
/colorization: pretext task doing video frame colorization
/MMVideoPredictor: future frame generation using custom temporal network (work based on uzkent/MMVideoPredictor)

Model	Setup	mIoU (acc)
UNet (lr=0.001, ReduceLROnPlateau, RMSprop(weight_decay=1e-8, momentum=0.9), CrossEntropyLoss)	downsample 2x, bs=1, 30 epochs	0.5153 (0.8653)
UNet(...)	downsample 4x, bs=8, 30 epochs	0.4613 (0.85)
UNet(...)	downsample 8x, bs=64, 30 epochs	0.45 (0.85)
DeepLabv3(resnet101)	ImagetNet pretrained, 10 epochs	59.31
DeepLabv3(resnet101)	scratch, 10 epochs	27.24
DeepLabv3(resnet101)	scratch, 100 epochs	49.79

Name		Name	Last commit message	Last commit date
Latest commit History 197 Commits
DeepLabv3		DeepLabv3
MMVideoPredictor		MMVideoPredictor
colorization		colorization
spatioTemporal		spatioTemporal
triplet		triplet
unet		unet
visualization		visualization
.gitignore		.gitignore
README.md		README.md
environment_mac.yml		environment_mac.yml
generate_filelist.py		generate_filelist.py
video_test_filelist.csv		video_test_filelist.csv
video_train_filelist.csv		video_train_filelist.csv
video_val_filelist.csv		video_val_filelist.csv

Provide feedback