Skip to content

Latest commit

 

History

History
82 lines (48 loc) · 6.7 KB

README.md

File metadata and controls

82 lines (48 loc) · 6.7 KB

Topological Descriptors for Parkinson’s Disease Classification and Regression Analysis

This repository contains sample code for understanding the process of persistence image extraction referenced in the paper.

This file contains information on the function of each of the files and folders in the repository. At the end of the document, you can find some troubleshooting information.

Primary Files:

The most important files to use to understand the proposed method are sublevel_set_TDA.ipynb and Sublevel-Set-TDA.py. In these files, we utilize sample signals meant to mimic different classes of subjects to run through the proposed method. We process the data in the same way we would real-world data. The sublevel_set_TDA.ipynb is meant to be a visual aid using only a couple sample signals, while the Sublevel-Set-TDA.py script is meant to analyze a full dataset. Refer to the “Files” section for more detailed information about the contents of the repository, including these primary files.

Files:

sublevel_set_TDA.ipynb : Jupyter Notebook script designed to familiarize the user with the method of persistence diagram and persistence image extraction. A couple of sample signals are used, and visuals are provided to aid in this learning process. A more complex example of persistence diagram extraction, using sample_signal.mat, is provided at the end of the script.

Sublevel-Set-TDA.py: Python script that takes information from the Dummy_Dataset to exemplify the proposed method. Preprocesses all of the data in the Dummy_Dataset, creates persistence diagrams from the processed signals, and then creates persistence images from the persistence diagrams. The images are then run through an SVM for classification. The persistence images could also be used in regression tasks. With the current settings, SVM classification is 100% because this is merely meant to be an exercise in how to implement the proposed method rather than an indication of real-world accuracy values.

utils.py: Python script that contains helper functions that are used in the Sublevel-Set-TDA.py and sublevel_set_TDA.ipynb files.

Create_Dummy_Dataset.py: Python script to enable the user to create a collection of their own sample signals from which to test on. This script was used to create the signals in the Dummy_Dataset folder.

Dummy_Dataset: Folder containing sample signals to highlight the method explained in the paper. Contains 50 signals with an average amount of distortion in the Normal_Distortion folder, and 30 signals with heightened distortion in the Increased_Distortion folder.

Mean_Normalized_Signals: Folder containing the same signals in the Dummy_Dataset, however, the signals have been processed to be mean normalized.

Range_Enforced_Signals: Folder containing the same signals as in the Mean_Normalized_Signals, however, the signals y-values have been enforced to lie between [-1, 1]

Persistence_Diagrams: Folder containing Persistence Diagrams extracted using the signals in the Range_Enforced_Signals folder.

Persistence_Images: Folder containing Persistence Images extracted from the Persistence_Diagrams folder, and using parameters specified by the Sublevel-Set-TDA.py script

Helper_Scripts: Folder containing many of the same functions found in utils.py, however, they are separated into their own scripts

sample_signal.mat: A complex sample signal displayed in the sublevel_set_TDA.ipynb script. The purpose of this signal is to highlight many of the different kinds of pairings that are possible when Persistence Diagrams are extracted.

jpg/png files: Supplementary image files for sample_signal.mat

Paper Conference Video.pptx: EMBC Conference presentation PowerPoint slides

Paper Conference Video.pdf: EMBC Conference presentation PDF (of slides)

Required Packages:

The following is a list of required packages for the code to run. In parenthesis, the version used when creating this repository is included.

numpy (version 1.15.4)

matplotlib (version 3.2.1)

scipy (version 1.4.1)

ripser (version 0.3.0)

persim (version 0.1.2)

sklearn (version 0.0)

os

sys

Trouble Shooting

If any of your code fails to run, please check the version of your numpy and persim packages. A specific function that is used in persim (which is not developed by us) utilizes an old version of numpy that may not work if you have a newer version of numpy installed. The version used to develop the repository are listed in the “Packages” section. Also note that reinstalling persim will update your numpy to the most recent version of numpy. Therefore, if you have to move to a different version of numpy, do so after installing persim.

You can use the command “pip list” to view your current package versions.

Link to Paper:

Topological Descriptors for Parkinson's Disease Classification and Regression Analysis

Nawar, A., Rahman, F., Krishnamurthy, N., Som, A. and Turaga, P., Topological Descriptors for Parkinson's Disease Classification and Regression Analysis. arXiv preprint arXiv:2004.07384., 2020.

Bibtex:

@article{nawar2020topological,
  title={Topological Descriptors for Parkinson's Disease Classification and Regression Analysis},
  author={Nawar, Afra and Rahman, Farhan and Krishnamurthi, Narayanan and Som, Anirudh and Turaga, Pavan},
  journal={arXiv preprint arXiv:2004.07384},
  year={2020}
}