Add offline policy evaluation module and update dependencies #59

shaharbar1 · 2024-09-10T12:40:10Z

Changes

Introduced offline_policy_evaluator.py with classes for propensity score estimation and offline policy evaluation.
Introduced offline_policy_estimator.py with classes for offline policy estimation.
Updated pyproject.toml to include new dependencies: bokeh and optuna. Further adjusted existing dependencies to compatible versions and added python 3.12 support.
Changed .pre-commit-config.yaml to utilize nbstripout instead of nbdev_clean.
Added class method to PyBanditsBaseModel on base.py to allow seeing default values for arguments that were not passed to the model.
Added test_offline_policy_evaluator.py and test_offline_policy_estimator.py as a test suite for the OfflinePolicyEvaluator.
Added get_non_abstract_classes, visualize_via_bokeh and in_jupyter_notebook utility functions.

j3rom3c

Looks great to me, but my coding skills are not are your level. Just minor comments.
I would need a python notebook with an example to better understand how your new implementation works!

pyproject.toml

tests/test_offline_policy_evaluator.py

pybandits/offline_policy_evaluator.py

tests/test_offline_policy_evaluator.py

pybandits/offline_policy_evaluator.py

j3rom3c · 2024-09-17T13:31:46Z

Mainly general comments, since I am using the library for a project:

a print of the current "phase" of the process ("priors update", "q computation", "propensity score computation" ... )
a progress bar for MC sampling
Some processes may last because of the data (large contextual n dim ...), and it will inform user on the step currently processing ...

shaharbar1 · 2024-09-23T11:20:19Z

Mainly general comments, since I am using the library for a project:

a print of the current "phase" of the process ("priors update", "q computation", "propensity score computation" ... )

a progress bar for MC sampling
Some processes may last because of the data (large contextual n dim ...), and it will inform user on the step currently processing ...

@j3rom3c, note that:
First item - done.
Second item - this is already done using tqdm (See line #850 on offline_policy_evalutor.py).

### Changes * Introduced `offline_policy_evaluator.py` with classes for propensity score estimation and offline policy evaluation. * Introduced `offline_policy_estimator.py` with classes for offline policy estimation. * Updated `pyproject.toml` to include new dependencies: `bokeh` and `optuna`. Further adjusted existing dependencies to compatible versions and added python 3.12 support. * Changed .pre-commit-config.yaml to utilize nbstripout instead of nbdev_clean. * Added caching of dependencies on CI and CD. * Added class method to PyBanditsBaseModel on base.py to allow seeing default values for arguments that were not passed to the model. * Added test_offline_policy_evaluator.py and test_offline_policy_estimator.py as a test suite for the OfflinePolicyEvaluator. * Added `get_non_abstract_classes` and `visualize_via_bokeh` utility functions.

shaharbar1 requested review from adarmiento, dariodandrea and j3rom3c September 10, 2024 12:40

shaharbar1 force-pushed the feature/offline_policy_evaluation branch from 41a0a84 to b8ea4dd Compare September 10, 2024 12:41

shaharbar1 added the enhancement New feature or request label Sep 10, 2024

shaharbar1 force-pushed the feature/offline_policy_evaluation branch from b8ea4dd to cb05e5a Compare September 10, 2024 12:43

j3rom3c reviewed Sep 10, 2024

View reviewed changes

shaharbar1 force-pushed the feature/offline_policy_evaluation branch 14 times, most recently from b117ef0 to d5118c3 Compare September 15, 2024 11:12

shaharbar1 force-pushed the feature/offline_policy_evaluation branch 2 times, most recently from 2ba33e1 to d4d0dfc Compare September 23, 2024 11:08

shaharbar1 force-pushed the feature/offline_policy_evaluation branch 5 times, most recently from 0d618da to 703e25a Compare September 26, 2024 10:10

shaharbar1 force-pushed the feature/offline_policy_evaluation branch 17 times, most recently from 97e8e51 to c2d073f Compare October 10, 2024 08:52

shaharbar1 force-pushed the feature/offline_policy_evaluation branch 11 times, most recently from 92e979a to 2f136be Compare October 28, 2024 10:19

shaharbar1 force-pushed the feature/offline_policy_evaluation branch from 2f136be to 74da683 Compare October 28, 2024 10:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add offline policy evaluation module and update dependencies #59

Add offline policy evaluation module and update dependencies #59

shaharbar1 commented Sep 10, 2024 •

edited

Loading

j3rom3c left a comment •

edited

Loading

j3rom3c commented Sep 17, 2024

shaharbar1 commented Sep 23, 2024 •

edited

Loading

Add offline policy evaluation module and update dependencies #59

Are you sure you want to change the base?

Add offline policy evaluation module and update dependencies #59

Conversation

shaharbar1 commented Sep 10, 2024 • edited Loading

Changes

j3rom3c left a comment • edited Loading

Choose a reason for hiding this comment

j3rom3c commented Sep 17, 2024

shaharbar1 commented Sep 23, 2024 • edited Loading

shaharbar1 commented Sep 10, 2024 •

edited

Loading

j3rom3c left a comment •

edited

Loading

shaharbar1 commented Sep 23, 2024 •

edited

Loading