Skip to content
/ blog Public

A list of my open source contributions, and ML Qs and As

License

Notifications You must be signed in to change notification settings

kddubey/blog

Folders and files

NameName
Last commit message
Last commit date

Latest commit

f456afe · Mar 9, 2025
Mar 26, 2024
Dec 3, 2024
Feb 9, 2025
Mar 9, 2025
Dec 5, 2024
Dec 3, 2024
Jul 1, 2023
Dec 13, 2024
Jul 13, 2024
Jul 1, 2023
Feb 2, 2025
Apr 15, 2023
Mar 8, 2025
Aug 8, 2023
Sep 10, 2023
Dec 7, 2024
Sep 20, 2023
Jul 22, 2024
Feb 18, 2024
Mar 28, 2024
Jul 1, 2023
Sep 6, 2024

Repository files navigation

My blog

Here also lies a list of my contributions to open source software.

Posts

Mostly simulations for some questions and answers on stats.stackexchange.com and stackoverflow.com.

[post] select_on_test.ipynb: Demonstrate that a model can simultaneously be selected and evaluated on a test set

[post] train_on_test_features: For high rank data and a small test set, train a PCA on test set features to boost test set performance!

precision_drop.ipynb: A simple answer to: why did precision drop in production?

[post] auprc.ipynb: Demonstrate that integral approximators are trying to hurt you

db_sampling_rate.ipynb: Calculate a sampling rate for a database query

[post] negative_vs_downsampling.ipynb: What's the need to formulate negative sampling for contrastive training? (not done). Also investigated in sigltt/train.ipynb.

[post] var_pred_var_error: Does higher variance in predictions result in higher variance error estimation?

[post] sample_via_gumbel: Demonstrate that one can sample directly in log-space

langchain_save_all: Save all method calls. Inspired by this issue

My dumber code dumps are in ./dumpy/.

Setup

Need Python 3.8+

Create an environment blog using venv:

cd /your/venvs

python -m venv blog

source blog/bin/activate

python -m pip install -r /path/to/blog/requirements.txt

If the notebook says that it needs to run on a GPU machine, and you have a Google account, open the notebook in Google Colab.

Usage

Interact w/ the code via Jupyter. I like VS code notebooks.

About

A list of my open source contributions, and ML Qs and As

Resources

License

Stars

Watchers

Forks

Languages