Problem Statement
"BizCardX-Extracting-Business-Card-Data-with-OCR" appears to involve the challenge of efficiently extracting data from business cards using OCR (Optical Character Recognition) technology. This suggests a desire to automate and streamline the process of capturing information from business cards, potentially eliminating manual data entry and enhancing the overall efficiency of handling business card data. The goal is likely to develop a solution that can accurately and quickly extract relevant information, improving the user experience and reducing the time and effort traditionally associated with managing business card data.
Technology Stack Used:
- Python
- Mysql
- streamlit
- numpy
- easyocr
- cv2
- re
Installation:
pip install pandas
pip install numpy
pip install cv2
pip install mysql.connector
pip install re
pip install stramlit
pip install easyocr
Features
- streamline the process of extracting text information from business card images.
- Leveraging EasyOCR, it employs OpenCV for image preprocessing tasks like resizing, cropping, and enhancement.
- The use of regular expressions (RegEx) facilitates the parsing and extraction of specific fields such as name, designation, company, and contact details.
- To ensure easy retrieval and analysis, the extracted information is stored in a MySQL database.
- For user convenience, a friendly interface built with Streamlit allows users to effortlessly upload images, extract information, and interact with the database, providing a seamless experience in managing business card data.
Acknowledgments
*Streamlit: Used for building interactive web applications with ease, providing a user-friendly interface for tasks like uploading images, extracting information, and interacting with the database.
*EasyOCR: Employed for text extraction from images, facilitating the process of extracting relevant information from business card images.
*OpenCV: Utilized for image preprocessing and manipulation tasks such as resizing, cropping, and enhancing, ensuring that the input images are optimized for further processing.
*MySQL: Chosen as the database management system to store and manage the extracted information, making it easily retrievable and available for analysis.