diff --git a/README.md b/README.md
index 09f88e7e7a..629db7428e 100644
--- a/README.md
+++ b/README.md
@@ -48,7 +48,7 @@ Welcome to PaddleSeg! PaddleSeg is an end-to-end image segmentation development
 - [x] Pascal VOC
 - [x] ADE20K
 - [x] Pascal Context
-- [ ] COCO stuff
+- [x] COCO stuff
 
 ## Installation
 
diff --git a/README_CN.md b/README_CN.md
index 8b95895b4e..60eecba13c 100644
--- a/README_CN.md
+++ b/README_CN.md
@@ -48,7 +48,7 @@ PaddleSeg是基于飞桨[PaddlePaddle](https://www.paddlepaddle.org.cn)开发的
 - [x] Pascal VOC
 - [x] ADE20K
 - [x] Pascal Context
-- [ ] COCO stuff
+- [x] COCO stuff
 
 ## 安装
 
diff --git a/configs/_base_/coco_stuff.yml b/configs/_base_/coco_stuff.yml
new file mode 100644
index 0000000000..d57fbbf22e
--- /dev/null
+++ b/configs/_base_/coco_stuff.yml
@@ -0,0 +1,45 @@
+batch_size: 4
+iters: 80000
+
+train_dataset:
+  type: CocoStuff
+  dataset_root: data/cocostuff/
+  transforms:
+    - type: ResizeStepScaling
+      min_scale_factor: 0.5
+      max_scale_factor: 2.0
+      scale_step_size: 0.25
+    - type: RandomPaddingCrop
+      crop_size: [520, 520]
+    - type: RandomHorizontalFlip
+    - type: RandomDistort
+      brightness_range: 0.4
+      contrast_range: 0.4
+      saturation_range: 0.4
+    - type: Normalize
+  mode: train
+
+val_dataset:
+  type: CocoStuff
+  dataset_root: data/cocostuff/
+  transforms:
+    - type: Normalize
+  mode: val
+
+
+optimizer:
+  type: sgd
+  momentum: 0.9
+  weight_decay: 4.0e-5
+
+learning_rate:
+  value: 0.01
+  decay:
+    type: poly
+    power: 0.9
+    end_lr: 0.0
+
+loss:
+  types:
+    - type: CrossEntropyLoss
+  coef: [1]
diff --git a/docs/data_prepare.md b/docs/data_prepare.md
index c641a59e74..b07a548194 100644
--- a/docs/data_prepare.md
+++ b/docs/data_prepare.md
@@ -1,124 +1,142 @@
-# 数据集准备
-
-PaddleSeg目前支持CityScapes、ADE20K、Pascal VOC等数据集的加载，在加载数据集时，如若本地不存在对应数据，则会自动触发下载(除Cityscapes数据集).
-
-## 关于CityScapes数据集
-Cityscapes是关于城市街道场景的语义理解图片数据集。它主要包含来自50个不同城市的街道场景，
-拥有5000张（2048 x 1024）城市驾驶场景的高质量像素级注释图像，包含19个类别。其中训练集2975张， 验证集500张和测试集1525张。
-
-由于协议限制，请自行前往[CityScapes官网](https://www.cityscapes-dataset.com/)下载数据集，
-我们建议您将数据集存放于`PaddleSeg/data`中，以便与我们配置文件完全兼容。数据集下载后请组织成如下结构：
-
-    cityscapes
-    |
-    |--leftImg8bit
-    |  |--train
-    |  |--val
-    |  |--test
-    |
-    |--gtFine
-    |  |--train
-    |  |--val
-    |  |--test
-
-运行下列命令进行标签转换：
-```shell
-pip install cityscapesscripts
-python tools/convert_cityscapes.py --cityscapes_path data/cityscapes --num_workers 8
-```
-其中`cityscapes_path`应根据实际数据集路径进行调整。 `num_workers`决定启动的进程数，可根据实际情况进行调整大小。
-
-## 关于Pascal VOC 2012数据集
-[Pascal VOC 2012](http://host.robots.ox.ac.uk/pascal/VOC/)数据集以对象分割为主，包含20个类别和背景类，其中训练集1464张，验证集1449张。
-通常情况下会利用[SBD(Semantic Boundaries Dataset)](http://home.bharathh.info/pubs/codes/SBD/download.html)进行扩充，扩充后训练集10582张。
-运行下列命令进行SBD数据集下载并进行扩充：
-```shell
-python tools/voc_augment.py --voc_path data/VOCdevkit --num_workers 8
-```
-其中`voc_path`应根据实际数据集路径进行调整。
-
-**注意** 运行前请确保在PaddleSeg目录下执行过下列命令：
-```shell
-export PYTHONPATH=`pwd`
-# windows下请执行相面的命令
-# set PYTHONPATH=%cd%
-```
-
-## 关于ADE20K数据集
-[ADE20K](http://sceneparsing.csail.mit.edu/)由MIT发布的可用于场景感知、分割和多物体识别等多种任务的数据集。
-其涵盖了150个语义类别，包括训练集20210张，验证集2000张。
-
-## 关于Pascal Context数据集
-Pascal Context是基于PASCAL VOC 2010数据集额外标注的像素级别的语义分割数据集。我们提供的转换脚本支持59个类别，其中训练集4996, 验证集5104张.
-
-
-在使用Pascal Context数据集前， 请先下载[VOC2010](http://host.robots.ox.ac.uk/pascal/VOC/voc2010/VOCtrainval_03-May-2010.tar)，随后自行前往[Pascal-Context主页](https://www.cs.stanford.edu/~roozbeh/pascal-context/)下载数据集及[标注](https://codalabuser.blob.core.windows.net/public/trainval_merged.json)
-我们建议您将数据集存放于`PaddleSeg/data`中，以便与我们配置文件完全兼容。数据集下载后请组织成如下结构：
-    
-    VOC2010
-    |
-    |--Annotations
-    |
-    |--ImageSets
-    |
-    |--SegmentationClass
-    |  
-    |--JPEGImages
-    |
-    |--SegmentationObject
-    |
-    |--trainval_merged.json
-    
-其中，标注图像的标签从1，2依次取值，不可间隔。若有需要忽略的像素，则按0进行标注。在使用Pascal Context数据集时，需要安装[Detail](https://github.com/zhanghang1989/detail-api).
-
-
-## 自定义数据集
-
-如果您需要使用自定义数据集进行训练，请按照以下步骤准备数据.
-
-1.推荐整理成如下结构
-
-    custom_dataset
-        |
-        |--images
-        |  |--image1.jpg
-        |  |--image2.jpg
-        |  |--...
-        |
-        |--labels
-        |  |--label1.jpg
-        |  |--label2.png
-        |  |--...
-        |
-        |--train.txt
-        |
-        |--val.txt
-        |
-        |--test.txt
-
-其中train.txt和val.txt的内容如下所示：
-
-    images/image1.jpg labels/label1.png
-    images/image2.jpg labels/label2.png
-    ...
-
-2.标注图像的标签从0,1依次取值，不可间隔。若有需要忽略的像素，则按255进行标注。
-
-可按如下方式对自定义数据集进行配置：
-```yaml
-train_dataset:
-  type: Dataset
-  dataset_root: custom_dataset
-  train_path: custom_dataset/train.txt
-  num_classes: 2
-  transforms:
-    - type: ResizeStepScaling
-      min_scale_factor: 0.5
-      max_scale_factor: 2.0
-      scale_step_size: 0.25
-    - type: RandomPaddingCrop
-      crop_size: [512, 512]
-    - type: RandomHorizontalFlip
-    - type: Normalize
-  mode: train
-```
+# 数据集准备
+
+PaddleSeg目前支持CityScapes、ADE20K、Pascal VOC等数据集的加载，在加载数据集时，如若本地不存在对应数据，则会自动触发下载(除Cityscapes数据集).
+
+## 关于CityScapes数据集
+Cityscapes是关于城市街道场景的语义理解图片数据集。它主要包含来自50个不同城市的街道场景，
+拥有5000张（2048 x 1024）城市驾驶场景的高质量像素级注释图像，包含19个类别。其中训练集2975张， 验证集500张和测试集1525张。
+
+由于协议限制，请自行前往[CityScapes官网](https://www.cityscapes-dataset.com/)下载数据集，
+我们建议您将数据集存放于`PaddleSeg/data`中，以便与我们配置文件完全兼容。数据集下载后请组织成如下结构：
+
+    cityscapes
+    |
+    |--leftImg8bit
+    |  |--train
+    |  |--val
+    |  |--test
+    |
+    |--gtFine
+    |  |--train
+    |  |--val
+    |  |--test
+
+运行下列命令进行标签转换：
+```shell
+pip install cityscapesscripts
+python tools/convert_cityscapes.py --cityscapes_path data/cityscapes --num_workers 8
+```
+其中`cityscapes_path`应根据实际数据集路径进行调整。 `num_workers`决定启动的进程数，可根据实际情况进行调整大小。
+
+## 关于Pascal VOC 2012数据集
+[Pascal VOC 2012](http://host.robots.ox.ac.uk/pascal/VOC/)数据集以对象分割为主，包含20个类别和背景类，其中训练集1464张，验证集1449张。
+通常情况下会利用[SBD(Semantic Boundaries Dataset)](http://home.bharathh.info/pubs/codes/SBD/download.html)进行扩充，扩充后训练集10582张。
+运行下列命令进行SBD数据集下载并进行扩充：
+```shell
+python tools/voc_augment.py --voc_path data/VOCdevkit --num_workers 8
+```
+其中`voc_path`应根据实际数据集路径进行调整。
+
+**注意** 运行前请确保在PaddleSeg目录下执行过下列命令：
+```shell
+export PYTHONPATH=`pwd`
+# windows下请执行相面的命令
+# set PYTHONPATH=%cd%
+```
+
+## 关于ADE20K数据集
+[ADE20K](http://sceneparsing.csail.mit.edu/)由MIT发布的可用于场景感知、分割和多物体识别等多种任务的数据集。
+其涵盖了150个语义类别，包括训练集20210张，验证集2000张。
+
+## 关于Coco Stuff数据集
+Coco Stuff是基于Coco数据集的像素级别语义分割数据集。它主要覆盖172个类别，包含80个'thing'，91个'stuff'和1个'unlabeled',
+其中训练集118k, 验证集5k.
+
+在使用Coco Stuff数据集前， 请自行前往[COCO-Stuff主页](https://github.com/nightrome/cocostuff)下载数据集，或者下载[coco2017训练集原图](http://images.cocodataset.org/zips/train2017.zip), [coco2017验证集原图](http://images.cocodataset.org/zips/val2017.zip)及[标注图](http://calvin.inf.ed.ac.uk/wp-content/uploads/data/cocostuffdataset/stuffthingmaps_trainval2017.zip)
+我们建议您将数据集存放于`PaddleSeg/data`中，以便与我们配置文件完全兼容。数据集下载后请组织成如下结构：
+
+    cocostuff
+    |
+    |--images
+    |  |--train2017
+    |  |--val2017
+    |
+    |--annotations
+    |  |--train2017
+    |  |--val2017
+    
+其中，标注图像的标签从0,1依次取值，不可间隔。若有需要忽略的像素，则按255进行标注。
+
+## 关于Pascal Context数据集
+Pascal Context是基于PASCAL VOC 2010数据集额外标注的像素级别的语义分割数据集。我们提供的转换脚本支持59个类别，其中训练集4996, 验证集5104张.
+
+
+在使用Pascal Context数据集前， 请先下载[VOC2010](http://host.robots.ox.ac.uk/pascal/VOC/voc2010/VOCtrainval_03-May-2010.tar)，随后自行前往[Pascal-Context主页](https://www.cs.stanford.edu/~roozbeh/pascal-context/)下载数据集及[标注](https://codalabuser.blob.core.windows.net/public/trainval_merged.json)
+我们建议您将数据集存放于`PaddleSeg/data`中，以便与我们配置文件完全兼容。数据集下载后请组织成如下结构：
+    
+    VOC2010
+    |
+    |--Annotations
+    |
+    |--ImageSets
+    |
+    |--SegmentationClass
+    |  
+    |--JPEGImages
+    |
+    |--SegmentationObject
+    |
+    |--trainval_merged.json
+    
+其中，标注图像的标签从1，2依次取值，不可间隔。若有需要忽略的像素，则按0进行标注。在使用Pascal Context数据集时，需要安装[Detail](https://github.com/zhanghang1989/detail-api).
+
+## 自定义数据集
+
+如果您需要使用自定义数据集进行训练，请按照以下步骤准备数据.
+
+1.推荐整理成如下结构
+
+    custom_dataset
+        |
+        |--images
+        |  |--image1.jpg
+        |  |--image2.jpg
+        |  |--...
+        |
+        |--labels
+        |  |--label1.jpg
+        |  |--label2.png
+        |  |--...
+        |
+        |--train.txt
+        |
+        |--val.txt
+        |
+        |--test.txt
+
+其中train.txt和val.txt的内容如下所示：
+
+    images/image1.jpg labels/label1.png
+    images/image2.jpg labels/label2.png
+    ...
+
+2.标注图像的标签从0,1依次取值，不可间隔。若有需要忽略的像素，则按255进行标注。
+
+可按如下方式对自定义数据集进行配置：
+```yaml
+train_dataset:
+  type: Dataset
+  dataset_root: custom_dataset
+  train_path: custom_dataset/train.txt
+  num_classes: 2
+  transforms:
+    - type: ResizeStepScaling
+      min_scale_factor: 0.5
+      max_scale_factor: 2.0
+      scale_step_size: 0.25
+    - type: RandomPaddingCrop
+      crop_size: [512, 512]
+    - type: RandomHorizontalFlip
+    - type: Normalize
+  mode: train
+```
diff --git a/paddleseg/datasets/cocostuff.py b/paddleseg/datasets/cocostuff.py
new file mode 100644
index 0000000000..88a8c8a903
--- /dev/null
+++ b/paddleseg/datasets/cocostuff.py
@@ -0,0 +1,82 @@
+# Copyright (c) 2020 PaddlePaddle Authors. All Rights Reserved.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+import os
+import glob
+
+from paddleseg.datasets import Dataset
+from paddleseg.cvlibs import manager
+from paddleseg.transforms import Compose
+
+@manager.DATASETS.add_component
+class CocoStuff(Dataset):
+    """
+    COCO-Stuff dataset `https://github.com/nightrome/cocostuff`.
+    The folder structure is as follow:
+
+        cocostuff
+        |
+        |--images
+        |  |--train2017
+        |  |--val2017
+        |
+        |--annotations
+        |  |--train2017
+        |  |--val2017
+
+
+    Args:
+        transforms (list): Transforms for image.
+        dataset_root (str): Cityscapes dataset directory.
+        mode (str): Which part of dataset to use. it is one of ('train', 'val'). Default: 'train'.
+    """
+
+    def __init__(self, transforms, dataset_root, mode='train'):
+        self.dataset_root = dataset_root
+        self.transforms = Compose(transforms)
+        self.file_list = list()
+        mode = mode.lower()
+        self.mode = mode
+        self.num_classes = 172
+        self.ignore_index = 255
+
+        if mode not in ['train', 'val']:
+            raise ValueError(
+                "mode should be 'train', 'val', but got {}.".format(mode))
+
+        if self.transforms is None:
+            raise ValueError("`transforms` is necessary, but it is None.")
+
+        img_dir = os.path.join(self.dataset_root, 'images')
+        label_dir = os.path.join(self.dataset_root, 'annotations')
+        if self.dataset_root is None or not os.path.isdir(
+                self.dataset_root) or not os.path.isdir(
+                    img_dir) or not os.path.isdir(label_dir):
+            raise ValueError(
+                "The dataset is not Found or the folder structure is nonconfoumance."
+            )
+
+        label_files = sorted(
+            glob.glob(
+                os.path.join(label_dir, mode+'2017', '*.png')))
+        
+        img_files = sorted(
+            glob.glob(os.path.join(img_dir, mode+'2017',  '*.jpg')))
+
+        self.file_list = [[
+            img_path, label_path
+        ] for img_path, label_path in zip(img_files, label_files)]
+
+
+