Update README.md
Browse files
README.md
CHANGED
@@ -1,12 +1,23 @@
|
|
1 |
---
|
2 |
library_name: transformers
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
4 |
---
|
5 |
|
6 |
|
7 |
-
# Sarvam-
|
8 |
|
9 |
-
Sarvam-
|
10 |
|
11 |
The model was trained with [NVIDIA NeMo™ Framework](https://github.com/NVIDIA/NeMo) on the Yotta Shakti Cloud using HGX H100 systems.
|
12 |
|
@@ -51,8 +62,8 @@ The model was trained with [NVIDIA NeMo™ Framework](https://github.com/NVIDIA
|
|
51 |
from transformers import AutoModelForCausalLM, AutoTokenizer
|
52 |
|
53 |
# Load model and tokenizer
|
54 |
-
model = AutoModelForCausalLM.from_pretrained("
|
55 |
-
tokenizer = AutoTokenizer.from_pretrained("
|
56 |
|
57 |
# Example usage
|
58 |
text = "कर्नाटक की राजधानी है:"
|
|
|
1 |
---
|
2 |
library_name: transformers
|
3 |
+
language:
|
4 |
+
- bn
|
5 |
+
- en
|
6 |
+
- gu
|
7 |
+
- hi
|
8 |
+
- kn
|
9 |
+
- ml
|
10 |
+
- mr
|
11 |
+
- or
|
12 |
+
- pa
|
13 |
+
- ta
|
14 |
+
- te
|
15 |
---
|
16 |
|
17 |
|
18 |
+
# Sarvam-1
|
19 |
|
20 |
+
Sarvam-1 is a 2-billion parameter language model specifically optimized for Indian languages. It provides best in-class performance in 10 Indic languages (bn, gu, hi, kn, ml, mr, or, pa, ta, te) when compared with popular models like Gemma-2-2B and Llama-3.2-3B. It is also competitive against the much larger models like Llama-3.1-8B in these languages. More details can be found in our [release blog](https://www.sarvam.ai/blogs/sarvam-1).
|
21 |
|
22 |
The model was trained with [NVIDIA NeMo™ Framework](https://github.com/NVIDIA/NeMo) on the Yotta Shakti Cloud using HGX H100 systems.
|
23 |
|
|
|
62 |
from transformers import AutoModelForCausalLM, AutoTokenizer
|
63 |
|
64 |
# Load model and tokenizer
|
65 |
+
model = AutoModelForCausalLM.from_pretrained("sarvamai/sarvam-1")
|
66 |
+
tokenizer = AutoTokenizer.from_pretrained("sarvamai/sarvam-1")
|
67 |
|
68 |
# Example usage
|
69 |
text = "कर्नाटक की राजधानी है:"
|