This project was conducted alongside 3 other group members for the Computational and Graphical Statistics class in Fall 2021, taught by professor Min Xu. Content in this repository was largely written by Kris Sagarino.
The dataset featured is the UCI Breast Cancer dataset, found here: https://archive.ics.uci.edu/ml/datasets/breast+cancer+wisconsin+(diagnostic)
• 'dataExploration.R' combs through the data (wdbc.data) and creates a usable dataframe • 'final project.R' conducts a ridge regularized logistic regression and LASSO analysis using the glmnet package • 'final_proj_logistc_reg.R' is a more refined version of the previous R file, featuring comparisons between the logistic regression and LASSO analysis and printouts of important information regarding analysis.