Skip to content

Can't find any configuration for OpenAI Embeddings #875

@HasnainKhanNiazi

Description

@HasnainKhanNiazi

Hey, I am playing around with marqo, did multiple experiments and I am having a few questions.

  1. I can use any model given here to generate embeddings and create an index: https://docs.marqo.ai/2.8/Guides/Models-Reference/list_of_models/ ; But how can I use OpenAI text-embedding-03-large or any other model which is not available on huggingface.
  2. I used hf/multilingual-e5-large](https://huggingface.co/intfloat/multilingual-e5-large to generate embeddings and created an index and now search isn't working as expected. For example, if I type in a search query "bike" then the first 3-4 retrieved documents are not even related to bikes.
  3. I have also tried using filter_string but in the case of filter_string, the results are empty list.

This is how I am using marqo index;

docs = []

for index, row in data.iterrows():
    local_dict = {}
    local_dict["title"] = row["title"]
    local_dict["description"] = row["markdown"]
    local_dict["attributes"] = row["attributes"]
    docs.append(local_dict)



mq = marqo.Client(url='http://localhost:8882')
results = mq.index("my-first-index").delete()
mq.create_index("my-first-index", model='hf/multilingual-e5-large')

mq.index("my-first-index").add_documents(docs,
    tensor_fields=["title", "description", "attributes"], client_batch_size=64
)

results_with_filters = mq.index("my-first-index").search(
    q="Bike", filter_string="price:[0 To 1000]"
)

results_without_filters = mq.index("my-first-index").search(
    q="Bike"
)

And above both queries are not working as expected. Any help or guidance will be appreciated. Thanks

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions