A notebook for analyzing Spain's Official Bulletin of the Companies House
Jupyter notebook adapted from Marcel Caraciolo's excellent tutorial on TF-IDF: http://aimotion.blogspot.com.es/2011/12/machine-learning-with-python-meeting-tf.html
1.- Follow the instructions to download the data from BORME's official webpage in a folder of your choice: http://www.boe.es/datosabiertos/
2.- Convert the supplied PDFs into text
3.- Run the notebook