Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
cerebrium.toml		cerebrium.toml
main.py		main.py

README.md

Fast Stable Diffusion

This example includes the model weights in the base image, which loads faster. The total response time is around 18 seconds.

The key part is the shell_commands in the cerebrium.toml file

shell_commands = ["export HF_HOME=/cortex/.cache/huggingface","python3 -c \"import torch; from diffusers import StableDiffusionPipeline; StableDiffusionPipeline.from_pretrained('stabilityai/stable-diffusion-2-1', torch_dtype=torch.float16)\""]

This will download the model at build time.

pipe = StableDiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.float16, local_files_only=True)

local_files_only=True will ensure that the model is loaded from the cache and not downloaded again.

Calling the API

curl --location 'https://api.cortex.cerebrium.ai/v4/YOUR-PROJECT_ID/22-fast-stable-diffusion/predict' \
--header 'Content-Type: application/json' \
--header 'Authorization: Bearer YOUR_TOKEN' \
--data '{
    "item": {
        "prompt": "A golden retriever puppy sitting in the grass."
    }
}'```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

5-fast-stable-diffusion

5-fast-stable-diffusion

README.md

Fast Stable Diffusion

Calling the API

Files

5-fast-stable-diffusion

Directory actions

More options

Directory actions

More options

Latest commit

History

5-fast-stable-diffusion

Folders and files

parent directory

README.md

Fast Stable Diffusion

Calling the API