Skip to content

Classification/Recognition and clustering of English letters using ML algorithms.

License

Notifications You must be signed in to change notification settings

tekboart/ML-letter_recognition

Repository files navigation


               

Classification and Clustering of English letters using Machine Learning (ML) algorithms

Python Pandas Sklearn Matplotlib seaborn

The ML algorithms used:

  • Classification
    1. Decision Tree
    2. Support Vector Machine (SVM)
    3. Ada Boost (an ensemble method)
  • Clustering
    1. Hierarchical clustering (Agglomerative)

Requirements

Python Pandas

Project Dir Structure

.
├── data
│   ├── 1. original
│   └── 2. edited
├── images
│   └── logos
├── logs
├── outputs
├── reports
└── utils
    └── jqmcvi

10 directories

Data

The data was obtained from here and was firt used and introduced by Frey and Slate [1].

Dataset info:

  • Title: Letter Image Recognition Data
  • Source Information
    • Creator: David J. Slate

      Odesta Corporation; 1890 Maple Ave; Suite 115; Evanston, IL 60201

    • Donor: David J. Slate (dave@math.nwu.edu) (708) 491-3867
    • Date: January, 1991
  • Number of Instances: 20000
  • Number of Attributes: 17 (Letter category and 16 numeric features)
  • Attribute Information:
    1. lettr capital letter (26 values from A to Z)
    2. x-box horizontal position of box (integer)
    3. y-box vertical position of box (integer)
    4. width width of box (integer)
    5. high height of box (integer)
    6. onpix total # on pixels (integer)
    7. x-bar mean x of on pixels in box (integer)
    8. y-bar mean y of on pixels in box (integer)
    9. x2bar mean x variance (integer)
    10. y2bar mean y variance (integer)
    11. xybar mean x y correlation (integer)
    12. x2ybr mean of x * x * y (integer)
    13. xy2br mean of x * y * y (integer)
    14. x-ege mean edge count left to right (integer)
    15. xegvy correlation of x-ege with y (integer)
    16. y-ege mean edge count bottom to top (integer)
    17. yegvx correlation of y-ege with x (integer)

Contact

Should you have any questions, feel free to contact TekBoArt @tekboart.

Reference

[1] Frey, P. W., & Slate, D. J. (1991). Letter recognition using Holland-style adaptive classifiers.Machine learning,6(2), 161-182.

License

Shield: CC BY-NC-SA 4.0

  • Refer to the file LICENSE for more information regarding the license of this repository.

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

CC BY-NC-SA 4.0

Releases

No releases published

Packages

No packages published