KinderMiner 2.0

This repository provides the python script for KinderMiner 2.0, a general text mining system to find association between any two terms in ~30 million PubMed articles. This project is done by Stewart Computational Biology Group (https://morgridge.org/research/regenerative-biology/bioinformatics/) within Thomson Lab (https://morgridge.org/research/regenerative-biology/thomson-lab/) at Morgridge Institute for Research, Madison, WI, USA.

A local PubMed database version is required for executing KinderMiner 2.0. Please see the github project https://github.com/iross/km_indexer/releases/tag/v1.2 for details.

---- COMPILE AND RUN ON THE COMMAND LINE ----

$ python kinderminer2.py TARGETS_FILE KEYPHRASE_FILE OUTPUT_DIRECTORY

---- EVALUATE AND RANK TARGETS ----

To evaluate at default Fisher exact test (FET) p-value, 1.0E-05:
$ python evaluate_fisher_exact_fetpvalue_and_ratio_sorted.py OUTPUT_DIRECTORY/OUTPUT_FILE OUTPUT_DIRECTORY/OUTPUT_FILE_EVALUATION_RESULT

To evaluate at cutomized FET p-value (ex. 0.05):
$ python evaluate_fisher_exact_fetpvalue_and_ratio_sorted.py OUTPUT_DIRECTORY/OUTPUT_FILE OUTPUT_DIRECTORY/OUTPUT_FILE_EVALUATION_RESULT 0.05

KinderMiner 2.0 was developed with Python 3.7.2.

Authors: Kalpana Raja and John Steill

Affiliation: Morgridge Institute for Research, Madison, WI, USA.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
sample_output		sample_output
Readme.md		Readme.md
evaluate_fisher_exact_fetpvalue_and_ratio_sorted.py		evaluate_fisher_exact_fetpvalue_and_ratio_sorted.py
kinderminer2.py		kinderminer2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KinderMiner 2.0

About

Releases

Packages

Languages

stewart-lab/KinderMiner_2

Folders and files

Latest commit

History

Repository files navigation

KinderMiner 2.0

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages