Skip to content

Latest commit

 

History

History
15 lines (8 loc) · 599 Bytes

File metadata and controls

15 lines (8 loc) · 599 Bytes

Large-Scale-Data-Computation-Word-Count-project

Final project of the course CDLE (Computação de Dados em Larga Escala)

Mestrado Engenharia Informática e Multimédia - ISEL


Technologies used: Java, Apache Hadoop, HDFS

This work uses the MapReduce programming model to solve large-scale computational problems (Big Data).

This work presents a word count project using Apache Hadoop for analyzing a set of documents by counting the frequency of n-grams.

The project also utilizes compressed data, collects statistical data using counters, and make use of a distributed cache.