GitHub - SugoiKitsune/Portfolio_Diffusion: CS236: Portfolio Diffusion - please find python modules attached to create portfolio diffusion module and operate with it

Overview This project explores the application of generative modeling, specifically diffusion models, to the construction of investment portfolios. By leveraging techniques typically used in image generation, we aim to automate and optimize the creation of investment portfolios based on specific criteria such as asset class, geography, and strategy. This is the Final Project for the module CS236: Deep Generative Modelling, taught by prof Stefano Ermon at Stanford University.

Project Objectives Apply generative modeling techniques to financial data for portfolio generation. Utilize diffusion models, typically used in image generation, to replicate and construct investment portfolios. Evaluate the effectiveness of these models in generating realistic and effective portfolios based on input prompts. Methodology The approach is inspired by text-to-image generation models, such as the SDXL model, adapted for financial data. The core components include:

Dataset and Tokenization:

Collected data from over 6,000 funds and ETFs focusing on US Equities. Tokenized both descriptive and quantitative data for model training. Below is the example of the input data:

For the output model the funds and investor portfolio composition has been utilized with the precise asset allocation

Diffusion Model: VAE has been implemented to encode and decode data in a probabilistic manner, optimizing for low portfolio composition error. Within Diffusion pipeline the classical U-net has been replaced with the multi-layer perceptron(MLP) supported by x-attention modules Applied to generate new portfolio samples through a forward and reverse process, gradually adding and removing noise.

Results The project demonstrated that diffusion models could indeed generate portfolios that, while not exact, closely replicate the structure of actual portfolios based on input prompts.

Denoising Sequence: A reverse process showing how the model reconstructs portfolios from noisy data.

Example: Buffet prtfolio as of 09/2023

An example portfolio was generated based on the prompt: "Buffett, Closed-End Fund, Thematic, Value, Large-Cap, Sep 2023". The generated portfolio closely matched Warren Buffett's actual portfolio, though with some differences in asset allocation.

Future Work

Model Accuracy: Enhancements to the VAE and diffusion models to reduce data loss and improve portfolio generation precision.

Dimensionality Expansion: Increasing model scope to handle multiple asset classes and international markets.

Alternative Architectures: Exploring models that incorporate underlying market data to condition the diffusion process.

Acknowledgements

Special thanks to Professor Stefano Ermon at Stanford University for his advice and to Elyas Obbad for mentorship throughout this project.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Data_aggregation.py		Data_aggregation.py
README.md		README.md
attention.py		attention.py
diffusion_training.py		diffusion_training.py
modules.py		modules.py
plots.py		plots.py
tvae.py		tvae.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

SugoiKitsune/Portfolio_Diffusion

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages