Skip to content

Using R programming This repository includes data, scripts, and documentation for analyzing and predicting crime rates in Chicago using weather data, featuring models like Linear Regression and Random Forest, and ARIMA for time series analysis.

Notifications You must be signed in to change notification settings

rohansattarapu18/chicago_crime_rate_prediction

Repository files navigation

chicago_crime_rate_prediction

This repository includes data, scripts, and documentation for analyzing and predicting crime rates in Chicago using weather data, featuring models like Linear Regression and Random Forest, and ARIMA for time series analysis using R Studio.

Contents

  • Data Collection and Cleaning: Scripts and methodologies for acquiring and cleaning the crime and weather datasets.
  • Exploratory Data Analysis (EDA): Initial analysis to uncover patterns and insights within the data. image image
  • Feature Engineering: Techniques employed to create predictive features from raw data. image image
  • Data Modeling: Implementation of various statistical models including Linear Regression and Random Forest Regression.
  • Time Series Analysis: Utilization of ARIMA models to understand and predict the temporal patterns of crime. image
  • Documentation: Detailed project reports and presentation slides explaining the methodologies, results, and implications of the findings.

Repository Structure

  • scripts/ - Contains R scripts for data preprocessing, exploratory analysis, and modeling.
  • data/ - Raw and processed datasets used in the analyses.
  • docs/ - Project reports and presentation materials.
  • figures/ - Generated plots and figures to visualize insights and model results.

Key Findings

  • Predictive Modeling: The implementation of machine learning models provides forecasts of crime occurrences, aiding in proactive public safety planning.
  • Temporal Analysis: Time series analysis highlights significant seasonal and temporal trends in crime rates, which are crucial for planning law enforcement interventions.
  • Impact of Weather: Correlations between weather conditions and crime rates were explored, emphasizing the influence of environmental factors on crime.

Tools Used

  • R: For all data processing and analysis tasks.
  • Libraries: dplyr, tidyr, ggplot2, forecast, caret, and randomForest.
  • Software: RStudio for script execution and development.

How to Use

  1. Clone this repository.
  2. Navigate to the scripts/ directory.
  3. Run the scripts in the order specified in the README.md.

Contributors

  • Rohan Sattarapu

About

Using R programming This repository includes data, scripts, and documentation for analyzing and predicting crime rates in Chicago using weather data, featuring models like Linear Regression and Random Forest, and ARIMA for time series analysis.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages