Deep-Vision-System-with-5G-Edge-GPU-Server

Wheel detection based on CNNs and fiducial markers for plane detection. The frames are streamed from the delta controller using 5G.

Features

Plane Detection
- Uses ArUco/AprilTag markers for plane recognition.
- Calculates homography transformations for planar mapping.
- Added under the plane_detector submodule.
Object Detection
- Employs a neural network (e.g., MobileNet or ResNet) for object detection.
- Supports object tracking and visualization with bounding boxes and confidence scores.
Data Communication
- Posts detection results to a server endpoint in JSON format.
Real-Time Performance
- Processes frames from a camera stream and displays results in real time.
- Uses multi-threading and multiprocessing to optimize performance.

Requirements

Libraries

Python 3.9
PyTorch
OpenCV (cv2)
NumPy
Requests
Custom modules:
- Detection_models
- PlaneDetection from plane_computation.plane_detection
- test_model_Delta_v3 from conv_net_detect

Hardware

A camera device (Raspberry Pi Camera or other video sources).
GPU for running neural networks (CUDA-supported).

Setup

Clone the repository with submodule

git clone --recursive  <repository-url>
cd <repository-folder>

Install dependencies
Ensure Python dependencies are installed:
```
pip install <missing module>
```
Prepare Configuration Files
Add the following required configuration files to the specified paths:
- Camera Configuration:
  - camera_matrix_rpi.txt
  - distortion_rpi.txt
- Plane Points:
  - plane_points_new_tray.json
  - plane_points_old_tray.json
- Neural Network Model:
  - Pre-trained weights (MOBILENET_V2_FINER_GRID_2_weights_saved.pt or RESNET_18_FINER_GRID_2_weights_saved.pt).

Directory Structure
Ensure the following structure:

.
├── conv_net_detect/
│   ├── disk_centroid_template_1.png
│   ├── disk_centroid_template_2.png
│   └── disk_centroid_template_3.png
├── plane_computation/
│   └── plane_detection.py
├── model_configs/
│   └── [MODEL_WEIGHTS_FILES]
├── vision_configs/
│   └── [CAMERA_CONFIG_FILES]
└── main.py

Usage

Run the Application
```
python capture_stream.py
```
Key Operations
- ESC: Stop the program.
- S: Save frames for debugging or analysis.
Modify Parameters
Update the parameters in the config dictionary to adapt to your setup:
- Change IS_ONLINE to toggle between a live camera and a video file.
- Update paths for models and camera configurations.

Components Overview

`robot_perception`

Processes camera frames, performs plane and object detection, and overlays the results.

`cam_reader`

Captures video frames from the camera or video source.

`post_detections`

Posts detection data to a specified server endpoint.

`plot_boxes`

Draws bounding boxes and labels on detected objects.

Troubleshooting

Camera Not Opening:
Check cam_source in the configuration.
Model Loading Issues:
Ensure the pre-trained model weights are in the correct path.
Slow Performance:
Use a GPU and verify CUDA installation.

License

This project is licensed under the MIT License.

Acknowledgments

Darknet2PyTorch library for object detection.
OpenCV for computer vision operations.
PyTorch for deep learning models.

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
__pycache__		__pycache__
assets		assets
capture_stream_yolo_v4		capture_stream_yolo_v4
conv_net_detect		conv_net_detect
deprecated		deprecated
model_configs		model_configs
plane_detector @ b00b468		plane_detector @ b00b468
rpi_delta_controller_client		rpi_delta_controller_client
tool		tool
vision_configs		vision_configs
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
capture_stream.py		capture_stream.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep-Vision-System-with-5G-Edge-GPU-Server

Features

Requirements

Libraries

Hardware

Setup

Usage

Components Overview

`robot_perception`

`cam_reader`

`post_detections`

`plot_boxes`

Troubleshooting

License

Acknowledgments

About

Releases

Packages

Contributors 2

Languages

License

david-s-martinez/Deep-Vision-System-with-5G-Edge-GPU-Server

Folders and files

Latest commit

History

Repository files navigation

Deep-Vision-System-with-5G-Edge-GPU-Server

Features

Requirements

Libraries

Hardware

Setup

Usage

Components Overview

robot_perception

cam_reader

post_detections

plot_boxes

Troubleshooting

License

Acknowledgments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

`robot_perception`

`cam_reader`

`post_detections`

`plot_boxes`

Packages