FL Accelerator Changelog

April/May 2023

Our major updates for April and May are the addition of Variational Autoencoders to the CCFRAUD example, the integration of Private Set Intersection for Vertical FL, and the introduction of Private Sandboxes supporting private AzureML workspaces.

FL Experience

Enriched the CCFRAUD example to support Variational Autoencoders for faster Vertical FL (instructions to run the example here, additional information there).
Introduced support for Private Set Intersection (preliminary step to a Vertical FL workflow) by leveraging the APSI library from MSR. More information here.

Provisioning

Added a new type of sandbox (Private Sandboxes) that supports private AzureML workspaces (i.e. workspaces behind a VNet).

Documentation

Simplified the Kaggle credentials part of the Bank Marketing tutorial.
Revamped our documentation for Vertical FL to make it more clear and accessible.

Bug fixes

Fixed a few issues in FL Experience (pytorch-version related bug) and Provisioning (uncaught validation issues with silos identities, deployment name conflicts for sandboxes, old AKS API's).

March 2023

We are changing the release process of our FL Accelerator repository. We are moving from monthly releases to continuous releases. We will however keep updating this changelog once a month to highlight the most significant changes.

Our major updates for this month are the addition of resources to provision various flavors of FL sandboxes, a new example for Vertical FL, and better support for confidentiality (encryption at rest, communications in Vertical FL).

FL Experience

Added support for data encryption at rest to the CCFRAUD example (instructions here).
Introduced new Vertical FL example: Bank Marketing Campaign Prediction.
Improved the communications in Vertical FL jobs (support for redis streams and encrypted communications, better logging to measure communications overhead).

Provisioning

Released a new bicep script to deploy silos with AKS clusters using confidential computes and set up open orchestrator.
Added support for including Kaggle credentials during workspace provisioning (useful for downloading Kaggle data to run our examples); see instructions included in our real-world examples documentation, or standalone instructions here.
Added a "confidentiality keyvault" to our provisioning scripts to host a custom key for encryption at rest during preprocessing and training.

Documentation

Added some documentation and the associated bicep scripts to easily deploy sandboxes for exploring different variants of FL setups.
Added instructions on how to leverage MLOps to restrict FL to peer-reviewed code.
Updated our generic instructions to support encryption of the uploaded data using confidentiality keyvault.

Repository structure

Added more CI/CD tests for the examples introduced last month.
Added unit tests for communication and encryption components.

February 2023 release

We are excited to announce the release of the February iteration of our FL Accelerator repository.

Some of the major updates we have made include the launch of a vertical federated learning feature, an FL pipeline that offers a native AML FL experience integrated with the factory engine, and benchmark results that reveal a comprehensive comparison between FL and non-FL experiments.

FL Experience

Implemented Vertical Federated Learning and offered a tutorial to run MNIST or CCFRAUD examples.
Introduced a scatter-gather pipeline that delivers a real AML FL native experience.
Conducted a comprehensive comparison b/w FL and non-FL experiments and the benchmark report can be accessed here.

Provisioning

Provided instructions and a script to facilitate the upload of local data to a silo storage.
Incremental improvements:
- Enhanced the network security rules and minimized the workspace dependencies for provisioning resources.

To get started, go here!

If you find a bug or have a feature request, please open an issue on the GitHub repository.

January 2023 release

We are excited to announce the release of the January iteration of our FL Accelerator repository.

Here below we list all the new features. The most significant changes are the introduction of a guide to help onboard people to FL on Azure ML, the implementation of Differential Privacy in all 3 industry-relevant examples, the support of third-party FL frameworks (NVFlare and Flower), and the support of distributed training.

FL Experience

Implemented Differential Privacy in all 3 industry-relevant examples, and added a tutorial that explains the process.
Introduced experimental support for third-party FL frameworks. The pneumonia example can now be run using NVFlare (see tutorial) or Flower (see tutorial).
Implemented distributed training (in each silo individually) in all 3 industry-relevant examples for scalable training.
Introduced support for multiple computes per silo (for instance: to use CPU's for pre-processing and GPU's for training).
Introduced resources for Exploratory Data Analysis on the credit card fraud example.
Incremental improvements:
- Made sure component scripts can run locally to facilitate authoring and debugging.
- Fixed a bug about data loading for the MNIST example.

Provisioning

Added instructions on how to properly configure Confidential Computes so all of the resources can be properly utilized.
Updated the bicep templates to also allow for GPU's provisioning (not restricted to CPU's anymore).

Documentation

Introduced a guide to help people plan their onboarding to FL on Azure ML.

To get started, go here!

If you find a bug or have a feature request, please open an issue on the GitHub repository.

November 2022 release

We are excited to announce the release of the November iteration of our FL Accelerator repository.

Here below we list all the new features. The most significant ones are the addition of 3 industry-relevant examples (credit card fraud detection, named entity recognition, pneumonia detection), and the introduction of a Troubleshooting Guide.

FL Experience

Added 3 industry-relevant examples. The examples include 2 jobs. The first one (generic) downloads some public data, partitions them, and uploads them to the silos. The second job (example-specific) trains a model using FL. The 3 examples address the following topics:
- credit card fraud detection (finance example);
- named entity recognition (NLP example);
- pneumonia detection (medical imaging example).
Fixed a bug causing the contents of the config.json file to take precedence over the CLI arguments (subscription id, resource group, workspace name) when submitting a job. Now, the CLI arguments takes precedence over the config.yaml file, which itself takes precedence over the config.json file.

Documentation

Introduced a new Troubleshooting Guide to help you troubleshoot common issues. We will keep adding to it as we become aware of more common issues.

Repository structure

Improved the documentation home page by adding some pictures and introducing the industry-relevant examples.
Revisited our CI/CD processes for better agility (enable concurrent jobs, accommodate token expiration issue).

October 2022 release

We are excited to announce the release of the October iteration of our FL Accelerator repository.

Here are the new features.

Provisioning

A new provisioning cookbook so you can pick and choose the setup that best suits your needs, based on the provided templates (orchestrator behind vnet, silos behind vnets, silos using an AKS cluster with confidential compute, silos using an existing storage account...).
All bicep scripts can now be used as standalone to adapt to your specific infrastructure needs.

Documentation

Instructions on how to provision external silos.
Tutorial on how to adapt literal and factory code to your scenarios.
Introduced "one-click deployments" for all bicep templates.

FL Experience

Using AzureML SDK subgraphs (preview) as a basis for the factory code sample, so that writing an FL experiment is easier and more flexible, and the experiment graph is shown better in the Azure ML UI.
Added Subscription ID, Resource Group, and Workspace Name as CLI arguments for job submission.

Repository structure

Enabled CI/CD on the repo to ensure nothing breaks.

September 2022 release

We are excited to announce the release of the September iteration of our FL Accelerator repository.

Here are the new features.

Repository structure

Cleaned up repository - removed redundant directories/files.

Provisioning

Auto-provisioning scripts to create a complete sandbox using bicep templates (by default an open sandbox on 3 regions). An experimental script with VNets and private endpoints is also available.
A quickstart to run an FL demo in minutes, and easily run FL jobs on your own data!

Documentation

Single location for all documentation.

FL Experience

Easy navigation using Azure Storage Explorer to discover models of various iterations, here is a more intuitive path (<experiment-name>/<iteration-num>/<model-name>)
Introduction of "factory" code to simplify the creation of FL experiments.
- Decoupled components for more readability.
- Soft validation to check the correct permissions of your assets.
Metrics:
- The combined losses and model performances of several silos can now be seen at the pipeline level.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CHANGELOG.md

CHANGELOG.md

FL Accelerator Changelog

April/May 2023

FL Experience

Provisioning

Documentation

Bug fixes

March 2023

FL Experience

Provisioning

Documentation

Repository structure

February 2023 release

FL Experience

Provisioning

January 2023 release

FL Experience

Provisioning

Documentation

November 2022 release

FL Experience

Documentation

Repository structure

October 2022 release

Provisioning

Documentation

FL Experience

Repository structure

September 2022 release

Repository structure

Provisioning

Documentation

FL Experience

Files

CHANGELOG.md

Latest commit

History

CHANGELOG.md

File metadata and controls

FL Accelerator Changelog

April/May 2023

FL Experience

Provisioning

Documentation

Bug fixes

March 2023

FL Experience

Provisioning

Documentation

Repository structure

February 2023 release

FL Experience

Provisioning

January 2023 release

FL Experience

Provisioning

Documentation

November 2022 release

FL Experience

Documentation

Repository structure

October 2022 release

Provisioning

Documentation

FL Experience

Repository structure

September 2022 release

Repository structure

Provisioning

Documentation

FL Experience