This is the first 2,000 images of our internal comma10k dataset. After we clean up these new labels, we'll release more. Learn more from the Medium post, or on the comma.ai discord in the #comma-pencil channel.
It's 10,000 pngs of real driving captured from the comma fleet. It's MIT license, no academic only restrictions or anything.
Run
./viewer.pyto see them with segnet overlay.
imgs/ -- The png image files masks/ -- PNG segmentation masks (update these!) segs/ -- The outputs in probability from our internal segnet (unreleased, too big)
1 - #402020 - road (all parts, anywhere nobody would look at you funny for driving) 2 - #ff0000 - lane markings (don't include non lane markings like turn arrows and crosswalks) 3 - #808060 - undrivable 4 - #00ff66 - movable (vehicles and people/animals) 5 - #cc00ff - my car (and anything inside it, including wires, mounts, etc. No reflections)
-
Visit the Google Spreadsheet (request access to edit the spreadsheet if you haven't already) and put your discord username in the "labeller" column for the mask(s) you're working on and change the status to "In Progress"
-
Start labelling! Useful label tools:
- img-labeler
- The included comma pencil tool
- An external image manipulation tool such as GIMP/Krita (Free) or Adobe Photoshop (Paid) If you choose to use an external tool please ensure your color mode is set to 8-bit, and that antialiasing doesn't change the colors on the edges of your mask.
-
Fork this repository to your account using the "Fork" button in the top right
-
Create a new branch from the master branch, and use your labelling tool of choice to label some images
-
Open a pull request from your new branch to the master branch in the official repository to submit your changes!
-
Visit the #comma-pencil channel on the comma.ai Discord for the latest news and chat about the project.
See the pencil
folder.
cd pencil
pip install -r requirements.txt
python server.py
Then open a browser window to http://localhost:5000/
comma10k is still a work in progress. For now, just cite the GitHub link. Once we reach 10k images, we'll release a paper, a train/test split, and a benchmark model.