1st MLSecOps Project (C2) - High Level Overview

This is a Machine Learning 'operationalization' project [showcasing the know-how by determining deployments targets, enabling foresights, baselining performance metrics, scaning logs and benchmarking], which was part of the ML Engineer for Microsoft Azure Nanodegree Scholarship (preparing for Microsoft [Designing and Implementing a Data Science Solution on Azure] Exam DP-100).

The Problem Statement:

The business goal/objective was binary classification inferencing [for decision intel foresights] on product sales (predicting if the product (term deposit) could be subscribed to (or not), when a bank client is contacted?). ... more on the Bank [Direct] Marketing dataset whitepaper

The Solution:

Rather than the usual "VotingEnsemble", the best performing model was an Azure AutoML [MaxAbsScaler & LightGBM] algorithm having had scored a %95.09 AUC weighted (in contrast to the other run type which was a pipeline same algorithm's best model having scored a %94.5 AUC weighted); using such AutoML to train the best ML model before its OpenAPI endpoint gets deployed for production-ready consumption by the other ecosystem microservices (including azure automation and/or logic apps) for optimal 'Decision Intelligence MLSecOps' synergy.

Architectural Diagram

Below is the formal diagram related to the visualization of the project's main steps (as listed in the next section):

Areas of Improvement

Potential future improvements could be identified as:

Retraining Hyperparameter Tuning with HyperDrive using Bayesian and/or 'Entire Grid' sampling.
Extending AutoML config to include more parameters.
Documenting XAI and [operationalizing] model [explanation] as a web service, using the [interpretability package] to explain ML models & predictions.
Exporting Models with ONNX to be deployed to a DLT connected Dapp device.
Integrating App Insights with Sentinel and/or CASB.

[Optional] Standout Suggestions

Pending standout suggestions that were attempted include:

Completing the optional items about load-testing the endpoint.
Using a Parallel Run Step in a pipeline. - Testing a local container with a downloaded model.
Exporting the model for supportting ONNX.

1st MLSecOps Project (C2) - Low Level Specification (Key Steps)

Screencast Recording Documentation

1st MLSecOps Project (C2) Screencast Demo in 10 mins: https://vimeo.com/510089644/b87c5d0480
1st MLSecOps Project (C2) Screencast Demo in 5 mins: https://vimeo.com/510089284/85418699a2

Above recordings' links were for the MLSecOps demo of the entire process of the actionable [production ready] project; including showcasing of:

Working deployed ML model endpoint.
Deployed Pipeline.
Available AutoML Model.
Successful API requests to the endpoint with a JSON payload.

Required Screenshots' Documentation

Below are the required screenshots to demonstrate [along with the above screencasts and describtions] the completed project [overall end-to-end workflow's] critical stages criteria meeting its various main steps's specifications:

Part I: Machine Learning Ops Principles

Key Setp #1: MLSecOps Prepping & Authentication:

Mainly creating a Service Principal account and associating it with a specific workspace.

Role Owner authorization to perform roleAssignmentswrite over scope - NO ERROR OR TRACEBACK

!az ml workspace share -w workspace1st -g dsvm1st --user --- --role owner Udacity no authorization to perform roleAssignmentswrite over scope

Part II: Deploy model in Azure ML Studio

Key Setp #2: Automating an ML Experiment:

Creating an experiment using Automated ML, configuring a compute cluster, and using it to run an experiment.

Create a new AutoML run

The submission includes screenshots of:

“Registered Datasets” in ML Studio shows "Bankmarketing" dataset available:

The Auto ML experiment is shown as completed:

Screeshot of the Auto ML Best Model after experiment completes:

Key Setp #3: Deploying the Best Model:

Deploying the AutoML run's Best Model for interacting with the HTTP API service and or the model [over POST requests].

Deploy a model and consume a model endpoint via an HTTP API

The submission includes screenshots of:

Endpoints section in Azure ML Studio, showing that “Application Insights enabled” says “true”:

Key Setp #4: Logs Enabling:

Retrieving logs by enabling Application Insights at deploy time [with a check-box, or later by runing a code].

Logging is enabled by running the provided logs.py script:

Key Setp #5: Documenting Swagger:

Consuming the deployed model using the [OpenAPI] Swagger JSON file to interact with the trained model.

Swagger runs on localhost showing the HTTP API methods and responses for the model:

Key Setp #6: Consuming Model Endpoints:

Consuming the deployed model using the endpoint's scoring_uri and its key that was generated post deployment.

endpoint.py script runs against the API producing JSON output from the model:

[OPTIONAL] Apache Benchmark (ab) runs against the HTTP API using authentication keys to retrieve performance results:

Part III: Publish an ML Pipeline

Key Setp #7: Pipeline Creating and Publishing:

Updating a Jupyter Notebook with already created [same AutoMl's] keys, URI, dataset, cluster, and model names.

Create and publish a pipeline

The submission includes screenshots of:

The pipeline section of Azure ML studio, showing that the pipeline has been created:

The Bankmarketing dataset with the AutoML module:

Registered Datasets in ML Studio shows [Bankmarketing] dataset available

The “Published Pipeline overview”, showing a REST endpoint and a status of ACTIVE:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

1st MLSecOps Project (C2) - High Level Overview

The Problem Statement:

The Solution:

Architectural Diagram

Areas of Improvement

[Optional] Standout Suggestions

1st MLSecOps Project (C2) - Low Level Specification (Key Steps)

Screencast Recording Documentation

Required Screenshots' Documentation

Part I: Machine Learning Ops Principles

Key Setp #1: MLSecOps Prepping & Authentication:

Role Owner authorization to perform roleAssignmentswrite over scope - NO ERROR OR TRACEBACK

Part II: Deploy model in Azure ML Studio

Key Setp #2: Automating an ML Experiment:

Create a new AutoML run

The submission includes screenshots of:

Screeshot of the Auto ML Best Model after experiment completes:

Key Setp #3: Deploying the Best Model:

Deploy a model and consume a model endpoint via an HTTP API

The submission includes screenshots of:

Key Setp #4: Logs Enabling:

Key Setp #5: Documenting Swagger:

Key Setp #6: Consuming Model Endpoints:

Part III: Publish an ML Pipeline

Key Setp #7: Pipeline Creating and Publishing:

Create and publish a pipeline

The submission includes screenshots of:

Configure a pipeline with the Python SDK

A screenshot of the Jupyter Notebook is included in the submission:

Use a REST endpoint to interact with a Pipeline

The submission includes screenshots of:

Files

README.md

Latest commit

History

README.md

File metadata and controls

1st MLSecOps Project (C2) - High Level Overview

The Problem Statement:

The Solution:

Architectural Diagram

Areas of Improvement

[Optional] Standout Suggestions

1st MLSecOps Project (C2) - Low Level Specification (Key Steps)

Screencast Recording Documentation

Required Screenshots' Documentation

Part I: Machine Learning Ops Principles

Key Setp #1: MLSecOps Prepping & Authentication:

Role Owner authorization to perform roleAssignmentswrite over scope - NO ERROR OR TRACEBACK

Part II: Deploy model in Azure ML Studio

Key Setp #2: Automating an ML Experiment:

Create a new AutoML run

The submission includes screenshots of:

Screeshot of the Auto ML Best Model after experiment completes:

Key Setp #3: Deploying the Best Model:

Deploy a model and consume a model endpoint via an HTTP API

The submission includes screenshots of:

Key Setp #4: Logs Enabling:

Key Setp #5: Documenting Swagger:

Key Setp #6: Consuming Model Endpoints:

Part III: Publish an ML Pipeline

Key Setp #7: Pipeline Creating and Publishing:

Create and publish a pipeline

The submission includes screenshots of:

Configure a pipeline with the Python SDK

A screenshot of the Jupyter Notebook is included in the submission:

Use a REST endpoint to interact with a Pipeline

The submission includes screenshots of: