Classify real time desktop and speech

Overview

Team DeepThings (Mez Gebre and I) won the Best Product Category at the Deep Learning Hackathon in San Francisco. We developed in three days a real-time system capable of identifying objects and speaking what it sees, thinking about making a useful tool for the visually impaired, as it could make navigation easier. Proof of concept on a laptop, final model running on Android.

This is only the first prototype for Windows.

The goals / steps of this project are the following:

Get the Webcam feed without bottlenecks.
Recognize images using Inception v3.
Text to speech with Google TTS API.
Making a functional model.
Tuninning the parameters.
Output visual display of the results.

Dependencies

This module requires:

Usage

Just run: python classify_real_time_v2.py

The output should look like this:

More details

For more information, check my medium post here

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
LICENSE		LICENSE
README.md		README.md
classify_real_time.py		classify_real_time.py
example.png		example.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Classify real time desktop and speech

Overview

The goals / steps of this project are the following:

Dependencies

Usage

More details

Licence

About

Releases

Packages

Languages

License

ndjido/Classify-Real-Time-Desktop

Folders and files

Latest commit

History

Repository files navigation

Classify real time desktop and speech

Overview

The goals / steps of this project are the following:

Dependencies

Usage

More details

Licence

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages