Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

🚸 Walkthrough on object detection with bounding boxes #49

Merged
merged 9 commits into from
Sep 7, 2022

Conversation

weiji14
Copy link
Owner

@weiji14 weiji14 commented Sep 2, 2022

Tutorial on preparing vector bounding box labels for an object detection task. Using Microsoft Building Footprints over Kampong Ayer, Brunei 🇧🇳

Preview at https://zen3geo--49.org.readthedocs.build/en/49/object-detection-boxes.html

image vector
Buildings over Kampong Ayer, Brunei Digitized vector building footprints over Kampong Ayer, Brunei

TODO:

Context geo-tiles in Python!
Initial draft tutorial on preparing vector bounding box labels for an object detection task. Will be looking at building footprints over Kampong Ayer, Brunei on ESRI World Imagery. Also added intersphinx links for contextily and numpy.
@weiji14 weiji14 added the documentation Improvements or additions to documentation label Sep 2, 2022
@weiji14 weiji14 added this to the 0.4.0 milestone Sep 2, 2022
@weiji14 weiji14 self-assigned this Sep 2, 2022
Fsspec-compatible Azure Datalake and Azure Blob Storage access!
Show how to load the Microsoft Building Footprints from GeoParquet files! Using `geopandas.read_parquet` for now (should it become a DataPipe?) to demonstrate things work. Have tried `pyogrio.read_dataframe(asset.href)` but got `DataSourceError: abfs://footprints/global/2022-07-06/ml-buildings.parquet/RegionName=Brunei: No such file or directory`. Might need to figure out https://gdal.org/user/virtual_file_systems.html#vsiadls auth issues somehow.
Show how to turn a numpy.ndarray image into an xarray.DataArray grid with coordinates assigned. Had to first change the image from channel-last to channel-first format (Pytorch/Chainer style) and drop the alpha channel (since there's no mask anyway. Afterwards is just setting the coordinates with xarray/rioxarray.
New GeoPandasRectangleClipper DataPipe to use!
Show how to use small chips (generated with xbatcher) as masks to clip an input vector geometry. Included an important note that GeoPandasRectangleClipper only works with one global vector database, but it will reproject and clip to each chip's crs and bounding box extent. Showed a side by side image and vector polygon visualization, and did a bit of Brunei Tourism marketing 😆 Also had to change `.rio.set_crs` to `.rio.write_crs` to preserve the crs property on xbatcher slicing.
Show how to turn things upside down! First step is to derive geographic bounding boxes from the polygons. Second step is to convert the boxes from geographical coordinates to image/pixel coordinates. Also added an intersphinx link for torchvision to reference the `torchvision.ops.box_convert` function.
Show how to create a list of bounding box tensors, looping over batches of object detection data, and georeferencing of the 'predicted' output. Metadata is critical to tie everything together! Also added an intersphinx link for mmdetection, shoutout to their fantastic Model Zoo!
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant