mixedbread-ai
/

mxbai-embed-large-v1

Feature Extraction

sentence-transformers

Transformers.js

text-embeddings-inference

Inference Endpoints

Model card Files Files and versions Community

juliuslipp commited on Apr 4, 2024

Commit

d41dac6

•

1 Parent(s): 456b7cf

Update README.md

Files changed (1) hide show

README.md +24 -1

README.md CHANGED Viewed

@@ -2740,8 +2740,31 @@ console.log(similarities); // [0.7919578577247139, 0.6369278664248345, 0.1651201
 ### Using API
-You’ll be able to use the models through our API as well. The API is coming soon and will have some exciting features. Stay tuned!
 ## Evaluation
 As of March 2024, our model archives SOTA performance for Bert-large sized models on the [MTEB](https://huggingface.co/spaces/mteb/leaderboard). It ourperforms commercial models like OpenAIs text-embedding-3-large and matches the performance of model 20x it's size like the [echo-mistral-7b](https://huggingface.co/jspringer/echo-mistral-7b-instruct-lasttoken). Our model was trained with no overlap of the MTEB data, which indicates that our model generalizes well across several domains, tasks and text length. We know there are some limitations with this model, which will be fixed in v2.

 ### Using API
+You can use the Model via our API as follows.
+```python
+from mixedbread_ai.client import MixedbreadAI
+from sklearn.metrics.pairwise import cosine_similarity
+import os
+mxbai = MixedbreadAI(api_key="{MIXEDBREAD_API_KEY}")
+english_sentences = [
+    'What is the capital of Australia?',
+    'Canberra is the capital of Australia.'
+]
+res = mxbai.embeddings(
+     input=english_sentences,
+     model="mixedbread-ai/mxbai-embed-large-v1"
+)
+embeddings = [entry.embedding for entry in res.data]
+similarities = cosine_similarity([embeddings[0]], [embeddings[1]])
+print(similarities)
+```
+The API comes with native INT8 and binary quantization support!
 ## Evaluation
 As of March 2024, our model archives SOTA performance for Bert-large sized models on the [MTEB](https://huggingface.co/spaces/mteb/leaderboard). It ourperforms commercial models like OpenAIs text-embedding-3-large and matches the performance of model 20x it's size like the [echo-mistral-7b](https://huggingface.co/jspringer/echo-mistral-7b-instruct-lasttoken). Our model was trained with no overlap of the MTEB data, which indicates that our model generalizes well across several domains, tasks and text length. We know there are some limitations with this model, which will be fixed in v2.