Skip to content

Latest commit

 

History

History
25 lines (19 loc) · 937 Bytes

README.md

File metadata and controls

25 lines (19 loc) · 937 Bytes

Hadoop face detection system

Technologies

  • Python
    • PyArrow - Apache Arrow Python
  • Hadoop
    • Hadoop streaming
  • OpenCV
    • Haar cascade algorithm

Pipeline

image

Mapper

image

MapReduce Job Execution Flow

image

Evaluation

We have used F1-score as metric for evaluation.

The average result on testing dataset was 68,40%

image

For the purpose of increasing accuracy, we may utilize a larger dataset and a more complex face detection algorithm, such as YOLO.