Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Please support rerank model from infinity #8475

Open
4 of 5 tasks
shizidushu opened this issue Sep 15, 2024 · 1 comment
Open
4 of 5 tasks

Please support rerank model from infinity #8475

shizidushu opened this issue Sep 15, 2024 · 1 comment
Assignees

Comments

@shizidushu
Copy link

Self Checks

  • I have searched for existing issues search for existing issues, including closed ones.
  • I confirm that I am using English to submit this report (我已阅读并同意 Language Policy).
  • [FOR CHINESE USERS] 请务必使用英文提交 Issue,否则会被关闭。谢谢!:)
  • Please do not modify this template :) and fill in all the required fields.

1. Is this request related to a challenge you're experiencing? Tell me about your story.

I want to use rerank model that deployed with infinity (https://github.com/michaelfeil/infinity).

2. Additional context or comments

Here is an example of how to send request to infinity rerank model:

curl -X 'POST' \
  'http://localhost:7997/rerank' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
  "query": "like",
  "documents": [
    "like", "love", "hate"
  ],
  "return_documents": false,
  "model": "BAAI/bge-reranker-v2-m3"
}'

3. Can you help us with this feature?

  • I am interested in contributing to this feature.
@michaelfeil
Copy link

I second this. Potentially could be done best with a new moded that is similar to text-embeddings-inference.

@Yawen-1010 Yawen-1010 self-assigned this Oct 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants