Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Select initial models and make available via TF #2

Open
4 tasks
wesdottoday opened this issue Sep 26, 2023 · 1 comment
Open
4 tasks

Select initial models and make available via TF #2

wesdottoday opened this issue Sep 26, 2023 · 1 comment
Assignees
Labels
documentation Improvements or additions to documentation enhancement New feature or request

Comments

@wesdottoday
Copy link
Contributor

  • Look through existing models from SageMaker Jumpstart and select 2-3 embedding and text generation models
  • TF read Environment Variable (or in the TF code) to choose the models
  • TF put models on S3 bucket
  • Documentation of the models and how to update the code to support them

We need to have a few models for the users to choose from and then make them available for SageMaker inside of a S3 bucket.

@wesdottoday wesdottoday added documentation Improvements or additions to documentation enhancement New feature or request labels Sep 26, 2023
@wesdottoday wesdottoday self-assigned this Sep 26, 2023
@wesdottoday
Copy link
Contributor Author

wesdottoday commented Sep 26, 2023

Just a snippet of code I had that informed me of the fact that there are existing models that we can copy over:

# download JumpStart model_manifest file.
boto3.client("s3").download_file(
    f"jumpstart-cache-prod-{aws_region}", "models_manifest.json", "models_manifest.json"
)
with open("models_manifest.json", "rb") as json_file:
    model_list = json.load(json_file)

# filter-out all the Text Embedding models from the manifest list.
text_embedding_models = []
for model in model_list:
    model_id = model["model_id"]
    if "-tcembedding-" in model_id and model_id not in text_embedding_models:
        text_embedding_models.append(model_id)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation enhancement New feature or request
Projects
None yet
Development

When branches are created from issues, their pull requests are automatically linked.

1 participant