💅 Update `text2vec-azure-openai` to utilize `isAzure: true` flag and mark `resourceName` + `deploymentId` as optional #196

flipace · 2024-09-18T08:55:34Z

This relates to the changes in weaviate/weaviate#5776

With this adjustment, devs can use the text2VecAzureOpenAI vectorizer, without specifying deploymentId or resourceName upfront for their collection.

Instead, they may provide the headers X-Azure-Deployment-Id and X-Azure-Resource-Name in their requests to set these.

vectorizer.text2VecAzureOpenAI({
  vectorizeCollectionName: false,
})

Internally, using text2VecAzureOpenAI will set the an isAzure: true flag for the OpenAI vectorizer, so it understands that the Azure logic must be used.

…rk `resourceName` + `deploymentId` as optional This relates to the changes in weaviate/weaviate#5776

weaviate-git-bot · 2024-09-18T10:01:37Z

Great to see you again! Thanks for the contribution.

beep boop - the Weaviate bot 👋🤖

PS:
Are you already a member of the Weaviate Slack channel?

tsmith023

Thanks for the contribution! Left a few comments mainly around house-keeping otherwise the PR looks great 😁

src/collections/configure/unit.test.ts

tsmith023 · 2024-09-20T10:11:36Z

src/collections/config/types/vectorizer.ts

+  /** Will automatically be set to true. You don't need to set this manually. */
+  isAzure?: true;


If this is always true, does it still need the ? operator? If not, can we remove it?

So the true is always set by the text2VecAzureOpenAI function internally, should not be necessary to be passed by the user - but due to how the types in general are structured I could not easily 1) remove it completely from the config object, nor 2) remove the optional operator, since then the user would be required to supply isAzure: true manually 🤔

Ahhh, okay I see now. I think this makes sense if the user makes use of the .azureOpenAI method but part of the API is still to allow users to work with the raw types if they so wish. As such, if a user did:

generative: { name: 'generative-openai', config: { deploymentId: config.deploymentId, resourceName: config.resourceName, baseURL: config.baseURL, } }

then the type system would allow it since isAzure is optional yet the runtime would interpret this as isAzure: undefined, which is a false-y value.

I like the idea of introducing the isAzure flag to the TS client but I think it may be better placed as a pure internal, e.g. not exposed in the user types, wdyt?

I would love to not expose it - but I could not figure out how to make it work so that it is not marked as unexpected internally where i place it 😬

This part here is what I mean:

I wouldn't know how to tell TS here that this is an expected prop without adding a // @ts-expect-error 🤔

My thinking is that isAzure should be removed from the ...Config types and instead interpreted by the client's runtime itself depending on the name of the module. So this would most likely require the addition of generative-azure-openai, alongside text2vec-azure-openai, that is then parsed appropriately in the collection creation logic

There we'd have some boolean clauses to determine whether the module is an azure one, based on the name, and then inject isAzure: true into the config appropriately. IMO, this would be the most consistent for the client/server relationship as I'm sure there will be future refactoring of the server that changes this behaviour. Then, we'd only break the internal relationship rather than the public API

We already do something similar here, wdyt about extending this logic as described above?

If you'd rather not then that's fine, I can add it to my backlog 😁 Also, sorry for the spaghetti of the collection.create method, I've not had the chance to refactor it into a better structure 😅

Got it - this definitely makes more sense 👍 I didn't look into this part.

I'm drowning a bit in other work right now, but I should be able to look into this in more detail hopefully next week or so 🙏

If I get round to it this week, I'll ping you on here to let you know. Thanks for your help so far!

src/collections/config/types/generative.ts

…passed at all Since the only indicator for the azureOpenAI config is now the isAzure: true flag, which is set in the vectorizer setup directly, no config object is necessary for it.

flipace mentioned this pull request Sep 18, 2024

💅 add support for sending azure deployment id and resource name via request headers weaviate/weaviate#5776

Merged

4 tasks

flipace force-pushed the feature/azure-openai-config branch from 1952e48 to 6b07490 Compare September 18, 2024 08:56

Update text2vec-azure-openai to utilize isAzure: true flag and ma…

9184e62

…rk `resourceName` + `deploymentId` as optional This relates to the changes in weaviate/weaviate#5776

flipace force-pushed the feature/azure-openai-config branch from 6b07490 to 9184e62 Compare September 18, 2024 09:06

Update GenerativeOpenAI to support new isAzure: true flag

9f072ca

tsmith023 reviewed Sep 20, 2024

View reviewed changes

chore: adjust types for text2VecAzureOpenAI to allow no object to be …

ec23daa

…passed at all Since the only indicator for the azureOpenAI config is now the isAzure: true flag, which is set in the vectorizer setup directly, no config object is necessary for it.

flipace requested a review from tsmith023 October 7, 2024 08:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

💅 Update `text2vec-azure-openai` to utilize `isAzure: true` flag and mark `resourceName` + `deploymentId` as optional #196

💅 Update `text2vec-azure-openai` to utilize `isAzure: true` flag and mark `resourceName` + `deploymentId` as optional #196

flipace commented Sep 18, 2024

weaviate-git-bot commented Sep 18, 2024

tsmith023 left a comment

tsmith023 Sep 20, 2024

flipace Sep 23, 2024

tsmith023 Oct 7, 2024

flipace Oct 7, 2024

tsmith023 Oct 7, 2024 •

edited

Loading

flipace Oct 7, 2024

tsmith023 Oct 7, 2024

		/** Will automatically be set to true. You don't need to set this manually. */
		isAzure?: true;

💅 Update text2vec-azure-openai to utilize isAzure: true flag and mark resourceName + deploymentId as optional #196

Are you sure you want to change the base?

💅 Update text2vec-azure-openai to utilize isAzure: true flag and mark resourceName + deploymentId as optional #196

Conversation

flipace commented Sep 18, 2024

weaviate-git-bot commented Sep 18, 2024

tsmith023 left a comment

Choose a reason for hiding this comment

tsmith023 Sep 20, 2024

Choose a reason for hiding this comment

flipace Sep 23, 2024

Choose a reason for hiding this comment

tsmith023 Oct 7, 2024

Choose a reason for hiding this comment

flipace Oct 7, 2024

Choose a reason for hiding this comment

tsmith023 Oct 7, 2024 • edited Loading

Choose a reason for hiding this comment

flipace Oct 7, 2024

Choose a reason for hiding this comment

tsmith023 Oct 7, 2024

Choose a reason for hiding this comment

💅 Update `text2vec-azure-openai` to utilize `isAzure: true` flag and mark `resourceName` + `deploymentId` as optional #196

💅 Update `text2vec-azure-openai` to utilize `isAzure: true` flag and mark `resourceName` + `deploymentId` as optional #196

tsmith023 Oct 7, 2024 •

edited

Loading