Welcome to StatFlow_EDA_APP, a web application built using Streamlit, Pandas Profiling, and Python. This app is designed to provide you with a new and efficient way to perform Exploratory Data Analysis (EDA). With StatFlow, you can easily visualize your data and conduct statistical analysis, gaining valuable insights to make informed decisions.
-
Interactive Data Visualization: StatFlow_EDA_APP allows you to interactively explore your data using various charts, including line plots, bar plots, scatter plots, histograms, and more. Visualizing data patterns and relationships has never been easier!
-
Statistical Analysis: Perform a wide range of statistical analysis on your dataset, including descriptive statistics, correlation analysis, and inferential statistics. StatFlow_EDA_APP aims to make complex statistical procedures accessible to everyone.
-
Pandas Profiling Report: Leveraging the power of Pandas Profiling, StatFlow_EDA_APP automatically generates a comprehensive report that provides a detailed overview of your data. This report includes summary statistics, data types, missing value analysis, and more, helping you to better understand your dataset quickly.
To use StatFlow_EDA_APP, follow these simple steps:
-
Install the required dependencies by running:
pip install streamlit pandas-profiling
. -
Clone the StatFlow_EDA_APP repository to your local machine.
-
Navigate to the project directory and run the app using the following command:
streamlit run app.py
. -
Once the app is running, you can upload your dataset in CSV format or any other compatible format.
-
Explore your data by selecting variables and visualizations of interest.
-
Utilize the generated Pandas Profiling report to get an in-depth understanding of your dataset.
Streamlit is an easy-to-use Python library that allows you to create web applications for data science and machine learning projects. With its simple syntax, you can quickly turn your data scripts into shareable web apps.
To learn more about Streamlit, visit their official documentation: Streamlit Documentation
Pandas Profiling is an open-source Python library that generates an exploratory analysis report for a given dataset. It provides a comprehensive overview of the dataset's characteristics, including basic statistics, data types, correlations, missing values, and more.
To learn more about Pandas Profiling, check out their GitHub repository: Pandas Profiling on GitHub
We welcome contributions to enhance the functionality and user experience of StatFlow_EDA_APP. If you encounter any issues, have ideas for new features, or want to fix bugs, please feel free to submit a pull request.
StatFlow_EDA_APP is the Apache-2.0 license. Feel free to use and modify the application as per the terms of the license.