Pytorch code for SBRT 2017 paper Foreground Segmentation for Anomaly Detection in Surveillance Videos Using Deep Residual Networks available here
The aim of this work (under deepeye folder) is to detect and segment anomalies in a target video given a temporally aligned reference video (anomaly-free). The output segmentation map has the same resolution as the input video frame.
For our experiments, we used CDNET database. A database for identification of changing or moving areas in the field of view of a camera, covering a wide range of detection challenges and are representative of typical indoor and outdoor visual data captured today in surveillance:
- Dynamic background
- Camera jitter
- Intermittent object motion
- Shadows
- Thermal signatures
- Challenging weather
- Low frame-rate
- Acquisition at night
- PTZ capture
- Air turbulence
In this preliminary work, instead of a entire reference video, we use a single still reference frame by taking the median of each pixel throughout the first 150 frames of the considered target video. Although not ideal, this does not have much influence since videos in CDNET are recorded with a stationary camera (except for the PTZ class, for which the algorithm's performance naturally is worse). It is worth emphasizing that our algorithm allows the more general setting of using a whole video (with egomotion) for reference, and not a single still image, which is compared frame per frame with the target video.
The idea is now use it on the VDAO, a video database containing annotated videos in a cluttered industrial environment, in which the videos were captured using a camera on a moving platform. You can have a bunch of useful tools to play with VDAO database in the VDAO_Access Project.
Once you have installed Python you can just prompt:
$ cd data; python download.py
This script allows training all models using a command-line interface. The call should be something like:
$ main.py --manifest TRAIN --img_path DIR --arch ARCH train \
--epochs N --lr LR
Example of call which instantiates and trains a 20-layer ResNet with reconstruction by bilinear upsampling:
python main.py --img-dir ~/Documents/database/cdnet2014/dataset --shape 2,192,256 --arch resnet20 --arch-params 'up_mode=upsample' --manifest data/manifest.train --loss bce -b 16 train --epochs 90 --aug --lr 0.01 --wd 0.0002 --val data/manifest.val --save models/resnet20-bilinear.pth.tar
For more details, you may prompt
$ main.py --help
or just check out main.py.
This script will automatically save the model at every epoch.
Evaluating a trained model can be done by simply
$ main.py --manifest EVAL --img_path DIR --arch ARCH \
--load PATH eval
All models are defined by a class defined in the models package. A custom model can be defined as
# Filename: codes/models/customnet.py
# this line is necessary
__all__ = ['CustomNet', 'customnet']
class CustomNet(nn.Module):
def __init__(self):
super(CustomNet, self).__init__()
...
def forward(self, x):
...
return out
# This method is required by the main script
def customnet(**kwargs):
...
return CustomNet(**kwargs)
To make the CustomNet
visible in main.py
, we have to append the following code to init script
# Filename: codes/models/__init__.py
from .customnet import *
All callbacks must inherit Callback and can optionally implement one of 8 calls. The default cycle is:
- on_begin
- on_epoch_begin
- on_step_begin
- on_batch_begin
- on_batch_end
- on_step_end
- on_epoch_end
- on_end
A simple custom callback that prints at the beginning and at the end of each epoch is given:
class CustomCallback(object):
def on_epoch_begin(self, epoch):
print('epoch begin')
def on_epoch_end(self, metrics):
print('epoch end')
- Python
- 7za
- zip
- pytorch
- torchvision
- numpy
- pandas
- matplotlib
- pillow
- glob2
- inflection
- tqdm
- visdom
If you use this code in your research, please use the following BibTeX entry.
@inproceedings{cinelli2017,
title = {Foreground Segmentation for Anomaly Detection in Surveillance Videos Using Deep Residual Networks},
author = {Cinelli, Lucas P and Thomaz, Lucas A and da Silva, Allan F and da Silva, Eduardo AB and Netto, Sergio L},
booktitle = {Simpósio Brasileiro de Telecomunicações e Processamento de Sinais (SBRT)},
month = September,
year = {2017}
}
THe download script, main.py structure, parts of readme, callbacks, and many others were done by Igor Macedo Quintanilha, a good friend and colleague.
See LICENSE.md