Dynamic Risk Assessment System

Project for ML DevOps Engineer Nanodegree, unit 5.

Description

A company that has 10,000 corporate clients company needs to create, deploy, and monitor a risk assessment ML model that will estimate the attrition risk of each of the company's clients. If the model is accurate, it will enable the client managers to contact the clients with the highest risk and avoid losing clients and revenue.

Creating and deploying the model isn't the end of the work, though. The industry is dynamic and constantly changing, and a model that was created a year or a month ago might not still be accurate today. Because of this, we need to set up regular monitoring of the model to ensure that it remains accurate and up-to-date. Scripts to re-train, re-deploy, monitor, and report on the ML model will be created. In this way, the company can get risk assessments that are as accurate as possible and minimize client attrition.

Prerequisites

Python 3 required
Linux environment may be needed within windows through WSL

Dependencies

This project dependencies is available in the requirements.txt file.

Installation

Use the package manager pip to install the dependencies from the requirements.txt. Its recommended to install it in a separate virtual environment.

pip install -r requirements.txt

or through pipenv:

sudo apt install pipenv
pipenv shell
pipenv install

Steps Overview

Data ingestion: Automatically check if new data that can be used for model training. Compile all training data to a training dataset and save it to folder.
Training, scoring, and deploying: Write scripts that train an ML model that predicts attrition risk, and score the model. Saves the model and the scoring metrics.
Diagnostics: Determine and save summary statistics related to a dataset. Time the performance of some functions. Check for dependency changes and package updates.
Reporting: Automatically generate plots and PDF document that report on model metrics and diagnostics. Provide an API endpoint that can return model predictions and metrics.
Process Automation: Create a script and cron job that automatically run all previous steps at regular intervals.

Usage

1- Edit config.json file to use practice data

"input_folder_path": "practicedata",
"output_folder_path": "ingesteddata", 
"test_data_path": "testdata", 
"output_model_path": "practicemodels", 
"prod_deployment_path": "production_deployment"

2- Run data ingestion

python ingestion.py

Artifacts output:

ingesteddata/finaldata.csv
ingesteddata/ingestedfiles.txt

3- Model training

python training.py

Artifacts output:

practicemodels/trainedmodel.pkl
practicemodels/encoder.pkl

4- Model scoring

python scoring.py

Artifacts output:

practicemodels/latestscore.txt

5- Model deployment

python deployment.py

Artifacts output:

production_deployment/ingestedfiles.txt
production_deployment/trainedmodel.pkl
production_deployment/latestscore.txt

6- Run diagnostics

python diagnostics.py

7- Run reporting

python reporting.py

Artifacts output:

practicemodels/confusionmatrix.png

8- Run Flask App

python app.py

9- Run API endpoints

python apicalls.py

Artifacts output:

practicemodels/apireturns.txt

11- Edit config.json file to use production data

"input_folder_path": "sourcedata",
"output_folder_path": "ingesteddata", 
"test_data_path": "testdata", 
"output_model_path": "models", 
"prod_deployment_path": "production_deployment"

Train production model:

python training.py

10- Full process automation

python fullprocess.py

11- Cron job

Start cron service

sudo service cron start

Edit crontab file

sudo crontab -e

Select option 3 to edit file using vim text editor
Press i to insert a cron job
Write the cron job in cronjob.txt which runs fullprocces.py every 10 mins
Save after editing, press esc key, then type :wq and press enter

View crontab file

sudo crontab -l

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dynamic Risk Assessment System

Description

Prerequisites

Dependencies

Installation

Steps Overview

Usage

1- Edit config.json file to use practice data

2- Run data ingestion

3- Model training

4- Model scoring

5- Model deployment

6- Run diagnostics

7- Run reporting

8- Run Flask App

9- Run API endpoints

11- Edit config.json file to use production data

10- Full process automation

11- Cron job

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
ingesteddata		ingesteddata
models		models
practicedata		practicedata
practicemodels		practicemodels
production_deployment		production_deployment
sourcedata		sourcedata
testdata		testdata
.gitignore		.gitignore
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
__init__.py		__init__.py
apicalls.py		apicalls.py
app.py		app.py
common_functions.py		common_functions.py
config.json		config.json
cronjob.txt		cronjob.txt
deployment.py		deployment.py
diagnostics.py		diagnostics.py
fullprocess.py		fullprocess.py
ingestion.py		ingestion.py
reporting.py		reporting.py
requirements.txt		requirements.txt
scoring.py		scoring.py
training.py		training.py
wsgi.py		wsgi.py

desared/Dynamic-Risk-Assessment-System

Folders and files

Latest commit

History

Repository files navigation

Dynamic Risk Assessment System

Description

Prerequisites

Dependencies

Installation

Steps Overview

Usage

1- Edit config.json file to use practice data

2- Run data ingestion

3- Model training

4- Model scoring

5- Model deployment

6- Run diagnostics

7- Run reporting

8- Run Flask App

9- Run API endpoints

11- Edit config.json file to use production data

10- Full process automation

11- Cron job

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages