Skip to content

Latest commit

 

History

History
183 lines (126 loc) · 8.2 KB

README.md

File metadata and controls

183 lines (126 loc) · 8.2 KB

CloudAISim: A toolkit for modelling and simulation of modern applications in AI-driven cloud computing environments

Abhimanyu Bhowmik, Madhushree Sannigrahi, Deepraj Chowdhury, Ajoy Dey, Sukhpal Singh Gill

paper paper dataset models slides

Abstract: There is a very significant knowledge gap between Artificial Intelligence (AI) and a multitude of industries that exist in today’s modern world. This is primarily attributable to the limited availability of resources and technical expertise. However, a major obstacle is that AI needs to be flexible enough to work in many different applications, utilising a wide variety of datasets through cloud computing. As a result, we developed a benchmark toolkit called CloudAISim to make use of the power of AI and cloud computing in order to satisfy the requirements of modern applications. The goal of this study is to come up with a strategy for building a bridge so that AI can be utilised in order to assist those who are not very knowledgeable about technological advancements. In addition, we modelled a healthcare application as a case study in order to verify the scientific reliability of the CloudAISim toolkit and simulated it in a cloud computing environment using Google Cloud Functions to increase its real-time efficiency. A non-expert-friendly interface built with an interactive web app has also been developed. Any user without any technical knowledge can operate the entire model, which has a 98% accuracy rate. The proposed use case is designed to put AI to work in the healthcare industry, but CloudAISim would be useful and adaptable for other applications in the future.



CloudAISim


Contributions

  1. Proposing a toolkit called CloudAISim for efficient explainable machine learning technique modelling and implementation in the healthcare domain.

  2. Finding the most accurate and responsive machine learning model for chronic as well as infectious diseases like diabetes, heart disease, breast cancer and COVID-19 in the healthcare domain.

  3. Simulating a prototype web application for the validation of CloudAISim to provide a visual display for data, models and the explainability of results.

  4. Implementing the CloudAISim in a cloud computing environment using Google Cloud Functions to increase real-time efficiency.

  5. Highlighting the promising future directions.

Dataset

Firstly, the “Breast Cancer Wisconsin (Diagnostic) Data Set” by “UCI ML Repository” is implemented on the novel methodology for the paper. The dataset contains tabular data with 32 features and over 569 data points. A fine needle aspirate (FNA) of a breast lump is used to generate the features from a digital image in 3-dimensional space as described by Bannett et al. They characterise the properties of all the observable cell nuclei in the image. Every data point is classified into either Benign(B) or Malignant(M) class.

Secondly, the architecture is applied to the “Heart Disease Cleveland dataset” Dataset by “UCI ML Repository”. The dataset constitutes over 300 patients’ data with 75 attributes, However, only 14 of the feature are taken into consideration for determining whether a patient has heart disease or not.

Thirdly, the “Diabetes dataset”, originally from the National Institute of Diabetes and Digestive and Kidney Diseases, is used in this paper. The goal is to determine if a patient has diabetes based on diagnostic parameters. The implemented Diabetes dataset is a subset of an enormous dataset with 10 attributes and 768 instances. All patients are Pima Indian females who are at least 21 years old.

Finally the “Covid-19” is a dataset, used in the paper which contains data from 800 people and 26 attributes such as their profession, health parameters and lifestyle parameter, and the risk factor of getting infection by covid is mentioned. The higher the risk factor the higher chance of getting infected by Covid. So we classified the person with a risk factor of more than 0.5 as high (1) and less than 0.5 as low (0).

Proposed Methodology



Overall view of the proposed model: CloudAISim

Code Execution Setup

Prerequisites:

  • Pip
    Python 3.7+
    autokeras==1.0.19
    keras==2.12.0
    tensorflow==2.12.0
    streamlit==1.21.0
    
  • Clone the Project Repository : git@github.com:abhimanyubhowmik/CloudAISim.git

  • Add these in Google Cloud functions:

    Feature_Engineering
    Auto_ML New
    Explainable Ai
    
  • Download the dependencies from "requirements.txt" pip install -r CloudAISim/App/requirements.txt

  • Create a Google BigQuery storage client with name auto-ml-and-explainable-ai and connect it with the project.

  • Connect the Streamlit App with GCS Storage Service: https://docs.streamlit.io/knowledge-base/tutorials/databases/gcs

  • Run the Stramlit App: streamlit run CloudAISim/App/1_📈_Data.py

Results

Case I: Breast Cancer Wisconsin Diagnosis

ROC-AUC curve and Confusion Matrix 75:25 train-validation ratio

Case II: Heart Disease Cleveland Dataset

ROC-AUC curve and Confusion Matrix 75:25 train-validation ratio



Case III: Diabetes Dataset

ROC-AUC curve and Confusion Matrix 75:25 train-validation ratio



Case IV: COVID-19 Dataset

ROC-AUC curve and Confusion Matrix 75:25 train-validation ratio




CloudAISim Application


The Dataset page with Breast Cancer Dataset


The EDA page with Breast Cancer Dataset



The Feature Engineering page with Breast Cancer Dataset



The Model page with Breast Cancer Dataset



The Results page with Breast Cancer Dataset



The LIME Explainer page with Breast Cancer Dataset




Cite our work

@article{bhowmik2024cloudaisim,
  title={CloudAISim: A toolkit for modelling and simulation of modern applications in al-driven cloud computing environments},
  author={Bhowmik, Abhimanyu and Sannigrahi, Madhushree and Chowdhury, Deepraj and Dey, Ajoy and Gill, Sukhpal Singh},
  journal={BenchCouncil Transactions on Benchmarks, Standards and Evaluations},
  pages={100150},
  year={2024},
  publisher={Elsevier}
}

Contact

For any queries, please contact: bhowmikabhimnayu@gmail.com