nvidia
/

nemo-megatron-gpt-1.3B

Text2Text Generation

Model card Files Files and versions Community

okuchaiev commited on Sep 14, 2022

Commit

96ac9a1

·

1 Parent(s): 3edf4fd

Update README.md

Files changed (1) hide show

README.md +33 -0

README.md CHANGED Viewed

@@ -59,6 +59,39 @@ git checkout v1.11.0
 python megatron_gpt_eval.py gpt_model_file=nemo_gpt5B_fp16.nemo server=True tensor_model_parallel_size=1 trainer.devices=1
 ```
 ## Training Data

 python megatron_gpt_eval.py gpt_model_file=nemo_gpt5B_fp16.nemo server=True tensor_model_parallel_size=1 trainer.devices=1
 ```
+### Step 3: Send prompts to you model!
+```python
+import json
+import requests
+port_num = 5555
+headers = {"Content-Type": "application/json"}
+def request_data(data):
+    resp = requests.put('http://localhost:{}/generate'.format(port_num),
+                        data=json.dumps(data),
+                        headers=headers)
+    sentences = resp.json()['sentences']
+    return sentences
+data = {
+    "sentences": ["Tell me an interesting fact about space travel."]*1,
+    "tokens_to_generate": 50,
+    "temperature": 1.0,
+    "add_BOS": True,
+    "top_k": 0,
+    "top_p": 0.9,
+    "greedy": False,
+    "all_probs": False,
+    "repetition_penalty": 1.2,
+    "min_tokens_to_generate": 2,
+}
+sentences = request_data(data)
+print(sentences)
+```
 ## Training Data