init

peteryuX · Mar 16, 2020 · 69f2512 · 69f2512
commit 69f2512
Show file tree

Hide file tree

Showing 26 changed files with 1,935 additions and 0 deletions.
diff --git a/.gitattributes b/.gitattributes
@@ -0,0 +1,2 @@
+* linguist-vendored
+*.py linguist-vendored=false
diff --git a/.gitignore b/.gitignore
@@ -0,0 +1,15 @@
+.vscode/*
+checkpoints/*
+logs/*
+**/__pycache__/*
+**/*.tfrecord
+**/*.zip
+data/*/**/*.png
+data/*/**/*.jpg
+data/widerface/**/*.txt
+widerface_evaluate/*
+results/*
+normal
+normal.pdf
+reduction
+reduction.pdf
diff --git a/LICENSE b/LICENSE
@@ -0,0 +1,21 @@
+MIT License
+
+Copyright (c) 2020 Kuan-Yu Huang
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
diff --git a/README.md b/README.md
@@ -0,0 +1,218 @@
+# [pcdarts-tf2](https://github.com/peteryuX/pcdarts-tf2)
+
+![Star](https://img.shields.io/github/stars/peteryuX/pcdarts-tf2)
+![Fork](https://img.shields.io/github/forks/peteryuX/pcdarts-tf2)
+![License](https://img.shields.io/github/license/peteryuX/pcdarts-tf2)
+
+:fire: PC-DARTS (PC-DARTS: Partial Channel Connections for Memory-Efficient Differentiable Architecture Search, published in ICLR 2020) implemented in Tensorflow 2.0+. This is an unofficial implementation. :fire:
+
+> PC-DARTS is a memory efficient differentiable architecture search method, which can be trained with a larger batch size and, consequently, enjoys both faster speed and higher training stability. Experimental results achieve an error rate of **2.57%** on CIFAR10 with merely **0.1 GPU-days** for architecture search.
+
+Original Paper: &nbsp; [Arxiv](https://arxiv.org/abs/1907.05737) &nbsp; [OpenReview](https://openreview.net/forum?id=BJlS634tPr)
+
+Offical Implementation: &nbsp; [PyTorch](https://github.com/yuhuixu1993/PC-DARTS)
+
+<p align="center">
+    <img src="photo/architecture.jpg" width="75%">
+</p>
+
+****
+
+## Contents
+:bookmark_tabs:
+
+* [Installation](#Installation)
+* [Usage](#Training-and-Testing)
+* [Benchmark](#Benchmark)
+* [Models](#Models)
+* [References](#References)
+
+***
+
+## Installation
+:pizza:
+
+Create a new python virtual environment by [Anaconda](https://www.anaconda.com/) or just use pip in your python environment and then clone this repository as following.
+
+### Clone this repo
+```bash
+git clone https://github.com/peteryuX/pcdarts-tf2.git
+cd pcdarts-tf2
+```
+
+### Conda
+```bash
+conda env create -f environment.yml
+conda activate pcdarts-tf2
+```
+
+### Pip
+
+```bash
+pip install -r requirements.txt
+```
+
+****
+
+## Usage
+:lollipop:
+
+### Config File
+You can modify your own dataset path or other settings of model in [./configs/*.yaml](https://github.com/peteryuX/pcdarts-tf2/tree/master/configs) for training and testing, which would like below.
+
+```python
+# general setting
+batch_size: 128
+input_size: 32
+init_channels: 36
+layers: 20
+num_classes: 10
+auxiliary_weight: 0.4
+drop_path_prob: 0.3
+arch: PCDARTS
+sub_name: 'pcdarts_cifar10'
+using_normalize: True
+
+# training dataset
+dataset_len: 50000  # number of training samples
+using_crop: True
+using_flip: True
+using_cutout: True
+cutout_length: 16
+
+# training setting
+epoch: 600
+init_lr: 0.025
+lr_min: 0.0
+momentum: 0.9
+weights_decay: !!float 3e-4
+grad_clip: 5.0
+
+val_steps: 1000
+save_steps: 1000
+```
+
+Note:
+- The `sub_name` is the name of outputs directory used in checkpoints and logs folder. (make sure of setting it unique to other models)
+- The `save_steps` is the number interval steps of saving checkpoint file.
+- The [./configs/pcdarts_cifar10_search.yaml](https://github.com/peteryuX/pcdarts-tf2/tree/master/configs/pcdarts_cifar10_search.yaml) and [./configs/pcdarts_cifar10.yaml](https://github.com/peteryuX/pcdarts-tf2/tree/master/configs/pcdarts_cifar10.yaml) are used by [train_search.py](https://github.com/peteryuX/pcdarts-tf2/tree/master/train_search.py) and [train.py](https://github.com/peteryuX/pcdarts-tf2/tree/master/train.py) respectively, which have different settings for small proxy model training(architecture searching) and full-size model training. Please make sure you use the correct config file in related script. (The example yaml script above is [./configs/pcdarts_cifar10.yaml](https://github.com/peteryuX/pcdarts-tf2/tree/master/configs/pcdarts_cifar10.yaml).)
+
+### Architecture Searching on CIFAR-10 (using small proxy model)
+
+**Step1**: Search cell architecture on CIFAR-10 using small proxy model.
+
+```bash
+python train_search.py --cfg_path="./configs/pcdarts_cifar10_search.yaml" --gpu=0
+```
+
+Note:
+- The `--gpu` is used to choose the id of your avaliable GPU devices with `CUDA_VISIBLE_DEVICES` system varaible.
+- You can visualize the training status on tensorboard by running "`tensorboard --logdir=./logs/`"
+- You can visualize the learning rate scheduling by running "`python ./modules/lr_scheduler.py`".
+- You can visualize the dataset augmantation by running "`python ./dataset_checker.py`".
+
+**Step2**: After the searching completed, you can find the result genotypes in `./logs/{sub_name}/search_arch_genotype.py`. Open it and copy the latest genotype into the [./modules/genotypes.py](https://github.com/peteryuX/pcdarts-tf2/tree/master/modules/genotypes.py), which will be used for further training later. The genotype like bellow:
+
+```python
+TheNameYouWantToCall = Genotype(
+    normal=[
+        ('sep_conv_3x3', 1),
+        ('skip_connect', 0),
+        ('sep_conv_3x3', 0),
+        ('dil_conv_3x3', 1),
+        ('sep_conv_5x5', 0),
+        ('sep_conv_3x3', 1),
+        ('avg_pool_3x3', 0),
+        ('dil_conv_3x3', 1)],
+    normal_concat=range(2, 6),
+    reduce=[
+        ('sep_conv_5x5', 1),
+        ('max_pool_3x3', 0),
+        ('sep_conv_5x5', 1),
+        ('sep_conv_5x5', 2),
+        ('sep_conv_3x3', 0),
+        ('sep_conv_3x3', 3),
+        ('sep_conv_3x3', 1),
+        ('sep_conv_3x3', 2)],
+    reduce_concat=range(2, 6))
+```
+
+Note:
+- You can visualize the genotype by running "`python ./visualize_genotype.py TheNameYouWantToCall`".
+<p align="center">
+    <img src="photo/genotype_normal.jpg" width="20%">
+    <img src="photo/genotype_reduction.jpg" width="70%">
+</p>
+
+### Training on CIFAR-10 (using full-sized model)
+
+**Step1**: Make sure that you already modifed the flag `arch` in [./configs/pcdarts_cifar10.yaml](https://github.com/peteryuX/pcdarts-tf2/tree/master/configs/pcdarts_cifar10.yaml) to match the genotype you want to use in [./modules/genotypes.py](https://github.com/peteryuX/pcdarts-tf2/tree/master/modules/genotypes.py). 
+
+Note:
+- The default flag `arch` (`PCDARTS`) is the genotype proposed by official paper. You can train this model by yourself, or use dowload it from [BenchmarkModels](#Models).
+
+**Step2**: Train the full-sized model on CIFAR-10 with specific genotype.
+
+```bash
+python train.py --cfg_path="./configs/pcdarts_cifar10.yaml" --gpu=0
+```
+
+### Testing on CIFAR-10 (using full-sized model)
+
+To evaluate the full-sized model with the corresponding cfg file on the testing dataset. You can also download my trained model for testing from [Models](#Models) without training it yourself, which default `arch` (`PCDARTS`) is the best cell proposed in paper.
+
+```bash
+python test.py --cfg_path="./configs/pcdarts_cifar10.yaml" --gpu=0
+```
+
+****
+
+## Benchmark
+:coffee:
+
+### Results on CIFAR-10
+| Method | Search Method | Params(M) | Test Error(%)| Search-Cost(GPU-days) |
+| ------ | ------------- | --------- | ------------ | --------------------- |
+| [NASNet-A](https://arxiv.org/abs/1611.01578) | RL | 3.3 | 2.65 | 1800 |
+| [AmoebaNet-B](https://arxiv.org/abs/1802.01548) | Evolution | 2.8 | 2.55 | 3150 |
+| [ENAS](https://arxiv.org/abs/1802.03268) | RL | 4.6 | 2.89 | 0.5 |
+| [DARTSV1](https://arxiv.org/abs/1806.09055) | gradient-based | 3.3 | 3.00 | 0.4 |
+| [DARTSV2](https://arxiv.org/abs/1806.09055) | gradient-based | 3.3 | 2.76 | 1.0 |
+| [SNAS](https://arxiv.org/abs/1812.09926)    | gradient-based | 2.8 | 2.85 | 1.5 |
+| [PC-DARTS](https://github.com/yuhuixu1993/PC-DARTS) (official PyTorch version) | gradient-based | 3.63 | **2.57** | **0.1** |
+| PC-DARTS TF2 (paper architecture) | gradient-based | 3.63 | 2.73 | - |
+| PC-DARTS TF2 (searched by myself) | gradient-based | 3.56 | 2.88 | 0.12 |
+
+Note:
+- Above results are referenced from [official repository](https://github.com/yuhuixu1993/PC-DARTS) and [orignal paper](https://arxiv.org/abs/1907.05737).
+- There still have a slight performance gap between my PC-DARTS TF2 and official version. In both cases, we used Nvidia 1080ti (11G memory). My PC-DARTS TF2 pre-trained model can be found in [Models](#Models).
+- If you get unsatisfactory results with the archecture searched by yourself, you might try to search it more than one time. (see the discussions [here](https://github.com/yuhuixu1993/PC-DARTS/issues/7))
+
+****
+
+## Models
+:doughnut:
+
+Dowload these models bellow, then extract them into `./checkpoints/` for restoring.
+
+| Model Name          | Config File | `arch` | Download Link |
+|---------------------|-------------|--------|---------------|
+| PC-DARTS (CIFAR-10, paper architecture) | [pcdarts_cifar10.yaml](https://github.com/peteryuX/pcdarts-tf2/tree/master//configs/pcdarts_cifar10.yaml) | `PCDARTS` | [GoogleDrive](https://drive.google.com/file/d/1BhLlktX78z90yOaORXvch_GAnIWWkYrX/view?usp=sharing) |
+| PC-DARTS (CIFAR-10, searched by myself) | [pcdarts_cifar10_TF2.yaml](https://github.com/peteryuX/pcdarts-tf2/tree/master//configs/pcdarts_cifar10_TF2.yaml) | `PCDARTS_TF2_SEARCH` | [GoogleDrive](https://drive.google.com/file/d/1UgeZzEnQZ6oeMKpr01rEDflVi9X1dzeq/view?usp=sharing) |
+
+Note:
+- You can find the training settings of the models in the corresponding [./configs/*.yaml](https://github.com/peteryuX/pcdarts-tf2/tree/master/configs) files, and make sure that the `arch` flag in it is matched with the genotypes name in [./modules/genotypes.py](https://github.com/peteryuX/pcdarts-tf2/tree/master/modules/genotypes.py).
+
+****
+
+## References
+:hamburger:
+
+Thanks for these source codes porviding me with knowledges to complete this repository.
+
+- https://github.com/yuhuixu1993/PC-DARTS (Official)
+    - PC-DARTS:Partial Channel Connections for Memory-Efficient Differentiable Architecture Search
+- https://github.com/quark0/darts
+    - Differentiable architecture search for convolutional and recurrent networks https://arxiv.org/abs/1806.09055
+- https://github.com/zzh8829/yolov3-tf2
+    - YoloV3 Implemented in TensorFlow 2.0
diff --git a/configs/pcdarts_cifar10.yaml b/configs/pcdarts_cifar10.yaml
@@ -0,0 +1,30 @@
+# general setting
+batch_size: 96
+val_batch_size: 512
+input_size: 32
+init_channels: 36
+layers: 20
+num_classes: 10
+auxiliary_weight: 0.4
+drop_path_prob: 0.3
+arch: PCDARTS
+sub_name: 'pcdarts_cifar10'
+using_normalize: True
+
+# training dataset
+dataset_len: 50000  # number of training samples
+using_crop: True
+using_flip: True
+using_cutout: True
+cutout_length: 16
+
+# training setting
+epoch: 600
+init_lr: 0.025
+lr_min: 0.0
+momentum: 0.9
+weights_decay: !!float 3e-4
+grad_clip: 5.0
+
+val_steps: 520
+save_steps: 520
diff --git a/configs/pcdarts_cifar10_TF2.yaml b/configs/pcdarts_cifar10_TF2.yaml
@@ -0,0 +1,30 @@
+# general setting
+batch_size: 96
+val_batch_size: 512
+input_size: 32
+init_channels: 36
+layers: 20
+num_classes: 10
+auxiliary_weight: 0.4
+drop_path_prob: 0.3
+arch: PCDARTS_TF2_SEARCH
+sub_name: 'pcdarts_cifar10_tf2_arch'
+using_normalize: True
+
+# training dataset
+dataset_len: 50000  # number of training samples
+using_crop: True
+using_flip: True
+using_cutout: True
+cutout_length: 16
+
+# training setting
+epoch: 600
+init_lr: 0.025
+lr_min: 0.0
+momentum: 0.9
+weights_decay: !!float 3e-4
+grad_clip: 5.0
+
+val_steps: 520
+save_steps: 520
diff --git a/configs/pcdarts_cifar10_search.yaml b/configs/pcdarts_cifar10_search.yaml
@@ -0,0 +1,30 @@
+# general setting
+batch_size: 256
+input_size: 32
+init_channels: 16
+layers: 8
+num_classes: 10
+sub_name: 'pcdarts_cifar10_search'
+using_normalize: True
+
+# training dataset
+dataset_len: 50000  # number of training samples
+train_portion: 0.5
+using_crop: True
+using_flip: True
+using_cutout: False
+cutout_length: 16
+
+# training setting
+epoch: 50
+start_search_epoch: 15
+init_lr: 0.1
+lr_min: 0.0
+momentum: 0.9
+weights_decay: !!float 3e-4
+grad_clip: 5.0
+
+arch_learning_rate: !!float 6e-4
+arch_weight_decay: !!float 1e-3
+
+save_steps: 97
diff --git a/dataset_checker.py b/dataset_checker.py
@@ -0,0 +1,20 @@
+import cv2
+import numpy as np
+import tensorflow as tf
+import tensorflow_datasets as tfds
+from modules.utils import set_memory_growth
+from modules.dataset import load_cifar10_dataset
+
+
+set_memory_growth()
+
+dataset = load_cifar10_dataset(batch_size=1, split='train', shuffle=False,
+                               using_normalize=False)
+
+for (img, labels)in dataset:
+    img = img.numpy()[0]
+    print(img.shape, labels.shape, labels.numpy())
+
+    cv2.imshow('img', cv2.cvtColor(img, cv2.COLOR_RGB2BGR))
+    if cv2.waitKey(0) == ord('q'):
+        exit()
diff --git a/environment.yml b/environment.yml
@@ -0,0 +1,15 @@
+name: pcdarts-tf2
+channels:
+  - conda-forge
+
+dependencies:
+  - python==3.7
+  - pip
+  - pip:
+    - tensorflow-gpu==2.1.0
+    - tensorflow_datasets
+    - numpy
+    - opencv-python
+    - PyYAML
+    - ipython
+    - python-graphviz
diff --git a/modules/__init__.py b/modules/__init__.py
Original file line number	Diff line number	Diff line change
		@@ -0,0 +1,2 @@
		* linguist-vendored
		*.py linguist-vendored=false