Min P: Code and Documentation

✨Hi there! ✨

This is the repository for the Min P paper! Here, you will find the following:

Min P Code Implementation: The latest implementation of Min P sampling from the Huggingface Transformers library as of June 2024.
WandB logs of GPQA and GSM8K evals: Logs comparing results between Min P and Top P for both GPQA and GSM8K evaluations, at different truncation sampling parameters and temperature scaling values.

External links:

Colab notebook to replicate GPQA and GSM8K evals: If you’d like to replicate the GPQA and GSM8K COT evaluations in the paper, you may do so at [PUBLIC]_Min_P_Evals_Replication_for_GPQA_and_GSM8K_COT.ipynb
Logs for AlpacaEval Creative Writing: For logs of the independently run AlpacaEval Creative Writing evals for Min P, see https://github.com/IlyaGusev/quest (not affiliated with authors)
Interactive Demo: For the independently created interactive demo, check out https://artefact2.github.io/llm-sampling/index.xhtml (not affiliated with authors)

How to use Min P

Please check if Min P is already available. Currently it is already available on Transformers, VLLM and many others. Transformers has already merged Min P a few months back: huggingface/transformers#30639

To use it, you only need to add a gen/output hyperparameter like you would with top_p or temperature (I think).

# Generate text using Top-p sampling
output = model.generate(
    input_ids,
    do_sample=True,        # Enable sampling
    top_p=0.9,             # Cumulative probability threshold
    min_p=0.1
    max_length=50          # Maximum length of generated text
)

To integrate your own custom samplers, you can check out the changes in the above PR to see what you need to get it working. The actual implementation which we copied into the paper is under logits_process.py, but you'd need to change a lot of other files which reference logits_process.py: https://github.com/huggingface/transformers/blob/80f2b1610fa17ebf582897c8611180cac38652f0/src/transformers/generation/logits_process.py#L4 . What you need to change entirely depends on how the inference engine is set up. For VLLM, changes were much simpler: vllm-project/vllm#1642
Do note that our evaluations were conducted on VLLM. This is important because VLLM does temperature scaling before truncation sampling, whereas Hugging Face does the reverse order. This means you will see different behaviour depending on what you use. I recommend VLLM due to its faster speed and because diversity from temperature is higher if you do it before truncation (for creative writing for example). You will probably get better benchmark scores from Hugging Face, but I feel it sort of defeats the purpose of using temperature sampling at all.

Let me know if you have other questions!

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
GPQA_Main_5-shot_Mistral_7B_wandb_export_2024-10-02T12_55_32.220+08_00.csv		GPQA_Main_5-shot_Mistral_7B_wandb_export_2024-10-02T12_55_32.220+08_00.csv
GPQA_Main_5-shot_Mistral_Large_Min_P_wandb_export_2024-10-02T13_05_07.370+08_00.csv		GPQA_Main_5-shot_Mistral_Large_Min_P_wandb_export_2024-10-02T13_05_07.370+08_00.csv
GPQA_Main_5-shot_Mistral_Large_Top_P_and_control_wandb_export_2024-10-02T12_58_36.356+08_00.csv		GPQA_Main_5-shot_Mistral_Large_Top_P_and_control_wandb_export_2024-10-02T12_58_36.356+08_00.csv
GSM8K_COT_8-shot_Mistral_7B_wandb_export_2024-10-02T12_54_57.415+08_00.csv		GSM8K_COT_8-shot_Mistral_7B_wandb_export_2024-10-02T12_54_57.415+08_00.csv
LICENSE		LICENSE
README.md		README.md
[PUBLIC]_Min_P_Evals_Replication_for_GPQA_and_GSM8K_COT.ipynb		[PUBLIC]_Min_P_Evals_Replication_for_GPQA_and_GSM8K_COT.ipynb
implementation		implementation
min_p user preference study v2.0 (Responses) - Form responses 1 (1) (2).csv		min_p user preference study v2.0 (Responses) - Form responses 1 (1) (2).csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Min P: Code and Documentation

External links:

How to use Min P

About

Releases

Packages

Languages

License

menhguin/minp_paper

Folders and files

Latest commit

History

Repository files navigation

Min P: Code and Documentation

External links:

How to use Min P

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages