Generation of histopatological images with Pix2pix GAN

This project deals with training a Pix2pix GAN for generating new histopathological images starting from semantic segmentation mask. From left to right: mask, real image, synthetic image.

Abstract

Colorectal cancer (CRC) is a type of cancer that begins in the large intestine (colon), the final part of the digestive tract. It typically affects older individuals, though it can occur at any age. It usually begins as small, noncancerous (benign) clumps of cells called polyps that form on the inside of the colon. Over time some of these polyps can become colon cancers. There are various parameters to determine the malignant potential of polyps that pathologists analyze: the type of polyps, their size and the degree of dysplasia. In this scope, a proper screening can help to find malignant polyps at an early stage, preventing their transformation into cancer.

In this work we study UniToPatho, a dataset of annotated high-resolution colorectal images, comprising different histological samples of colorectal polyps, collected from patients undergoing cancer screening.

Deep Learning based systems can help doctors in the delicate task of detecting and diagnosing the different types of colorectal polyps and their associated risk. In fact, Deep Learning techniques are able to get an extraordinary accuracy in medical pattern recognition, however they require large sets of annotated training images. So, the main goal of this work is to do data augmentation on UniToPatho, so that with a larger dataset it is expected that Deep Learning algorithms are more likely to get a higher accuracy. In particular, this work dealt with doing data augmentation by producing new samples of histopathological tissue starting from the semantic segmentation masks, using a particular kind of Generative Adversarial Network (GAN): a Pix2pix GAN. The idea is that if we generate new samples starting from the segmentation masks, we can produce highly precise, detailed and manageable outputs.

Overview of Code

dataset directory includes some dataset implemented in Pytorch, used to train the GAN.

pannuke.py consists of the colon images taken from PanNuke dataset. You can download the dataset from:
- here: the dataset split is 0.9 training and 0.1 test, or
- here: the dataset split is 0.7 training and 0.3 test.
unitopatho.py is an implementation of UniToPatho, which was taken from here.
unitopatho_mask.py inherits from unitopatho.py and adds the feature of the masks to samples of UniToPatho.

train_utils.py contains some general-purpose training methods used to train the GAN. For example it contains the method to train the generator and the discriminator for an epoch, or the methods to do wandb stuffs.

In order to train a GAN, you can launch the following scripts:

train_pannuke.py: train the GAN on PanNuke.
train_pannuke_ddp.py: train the GAN on PanNuke with multi-GPU training.
train_utp.py: train the GAN on UniToPatho.
train_utp_ddp.py: train the GAN on UniToPatho with multi-GPU training.

config.py contains the hyperparameters relative to the training and some other parameters.

utils.py contains some general-purpose methods.

generator_model.py and discriminator_model.py are the implementations for the generator and discriminator architectures used for our trainings

test.py is a file which you can launch for generating the synthetic images of the test set of UniToPatho with a specific model which can be loaded from wandb.

Special thanks

Special thanks goes to Aladdin Persson and his Github repo, from which we took some of his code.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Generation of histopatological images with Pix2pix GAN

Abstract

Overview of Code

Special thanks

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
dataset		dataset
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
Thesis.pdf		Thesis.pdf
config.py		config.py
discriminator_model.py		discriminator_model.py
generator_model.py		generator_model.py
pod-train.yaml		pod-train.yaml
requirements.txt		requirements.txt
result_unitopatho.png		result_unitopatho.png
run.bat		run.bat
test.py		test.py
train_pannuke.py		train_pannuke.py
train_pannuke_ddp.py		train_pannuke_ddp.py
train_utils.py		train_utils.py
train_utp.py		train_utp.py
train_utp_ddp.py		train_utp_ddp.py
utils.py		utils.py

davideruby/HistopathoPix2pixGan

Folders and files

Latest commit

History

Repository files navigation

Generation of histopatological images with Pix2pix GAN

Abstract

Overview of Code

Special thanks

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages