A Medication Compliance Monitoring System using Machine Learning and Distributed Computing

By: Katja Wittfoth, Hai Vu Le, Donya Fozoonmayeh and Diane Woodbridge

This is a research project in cooperation with professor Diane Woodbridge at University of San Francisco. My team and I presented our project's result at the Data Institute SF Conference 2019. We have submitted a scientific paper which is currently in review. Our work is built upon and continuation of the project which was published Jul 2018.

Research Overview

This research aims to detect medication intake based on accelerometer and gyroscope data collected by smartwatches. The end goal is to develop a smart machine learning application that would provide patients with accurate and timely reminders to improve their medication adherence.

Why medication compliance is important?

Poor medication intake threatens a patient's health and puts an economic burden on the health-care system. The annual medication-adherence rate in the US is between 25% to 50% which accounts for about $300 billion avoidable healthcare costs.

Data processing and engineering

We collected three-dimensional sensor data of following activities: medication intake (pill and liquid), texting, writing, drinking bottled water, and walking. The data is from 24 study subjects using LG Watch Sport devices. To account for the variation in activity duration and the difference in subjects' pace, all records were discretized into the same number of time windows. For each dimension, we extract 18 descriptive statistics out of each window as features to train ML models.

To tackle the challenge of processing massive high frequency sensor data in real time, we take a distributed computing approach with an end-to-end pipeline built on distributed systems.

Results

For efficient medication adherence application, it is important to have an algorithm which would not only yield highest precision but also would be cost-efficient in production meaning fast training and prediction time.

We trained various ML models for this classification problem, including Logistic Regression, Random Forest, and XGBoost. Our best RF model yields a precision score of 0.98 with data discretized into 40 windows. It outperformed other models in terms of speed with 14 seconds training and 0.17 seconds prediction time.

Tools

Python, PySpark, SparkML, MongoDB, AWS S3, AWS EMR

This repository is a duplication of the private repository with confidential contents removed.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
images		images
nbs		nbs
DSCO19-poster.pdf		DSCO19-poster.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Medication Compliance Monitoring System using Machine Learning and Distributed Computing

Research Overview

Why medication compliance is important?

Data processing and engineering

Results

Tools

About

Releases

Packages

Languages

katjawittfoth/Med_Adherence

Folders and files

Latest commit

History

Repository files navigation

A Medication Compliance Monitoring System using Machine Learning and Distributed Computing

Research Overview

Why medication compliance is important?

Data processing and engineering

Results

Tools

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages