Skip to content

Dataiku DSS is a Automated Data Science Application from Dataiku which automates all the Data Science tasks starting from Data Cleaning to Deploying Model in Production. Worked Extensively on onboarding the application on three instances of the application : POC, Sandbox, Production

Notifications You must be signed in to change notification settings

vignan98/Citizen-Data-Science-Platform

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 

Repository files navigation

☑️ Citizen Data Science Platform :

DATAIKU  PYTHON  LINUX  DOCKER  KUBERNETES  MYSQL  AZURE  AWS  POSTGRESQL 

INTRODUCTION 🙋‍♂️

  • As a Data Scientist at Ericsson India Global Services I worked on configuring the entire Environmental set up of a complex, automated Data Science Application called DATAIKU which was onboared under the name Citizen Data Science Platform.
  • Co-ordinated the three instances of the application POC, Sandbox, Production in Microsfot Azure & Linux Environments.
  • Understanding the application and its design, Configuring Azure VMs, Handling production issues, onboarding new features were the primary tasks involved during the installation process
  • This Application requires knowledge of Data Science, Machine Learning to understand different workflows that were developed to automate the steps in Machine Learning

USE CASE & MY WORK 🖥️

  • Different business teams like Supply, Sourcing, Innovation have their Data in different database instances. The primary goal of these business teams is to analyse their data efficiently & develop a Machine Learning model that would cater the needs of different teams
  • As a part of Application-owners team our duties include understanding the use case,suggesting further steps in onboarding the data from postgresql/Mysql databases & supporting the business teams from Data Cleaning to Model deployment stage.
  • Documentation of the entire installation process was carried out to track the progress and install the same application on different instances.

APPLICATION IN PRODUCTION 🏃‍♂️

  • Various Security questions regarding Data privacy, visibility of the Data, leakage of data from databases, User controls, admin controls were brainstormed and handled effectively.
  • Once the application was pushed to production, users were onboarded to work on different use case projects. The issues related to server Management, log Management, Memory Management were handled efficiently & quickly.
  • The repeated tasks related to Log management, Disk management were automated using bash scripting.
  • Integration of the application with new tools (ex: Tableau) was made after co-ordinating with the vendor team(Dataiku).

About

Dataiku DSS is a Automated Data Science Application from Dataiku which automates all the Data Science tasks starting from Data Cleaning to Deploying Model in Production. Worked Extensively on onboarding the application on three instances of the application : POC, Sandbox, Production

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published