Salesforce
/

LLaMA-3-8B-SFR-Iterative-DPO-R

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

bpucla commited on May 10, 2024

Commit

fc5d28a

·

verified ·

1 Parent(s): 15a287e

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -86,7 +86,10 @@ print(model_outputs[0])
 ## Limitations
-SFR-Iterative-DPO-LLaMA-3-8B-R is a reseach model as a result on our RLHF research at Salesforce.
 ## Citation
 Please cite our techical report if you find our model is useful for your research or product.

 ## Limitations
+SFR-Iterative-DPO-LLaMA-3-8B-R is a research model developed as part of our RLHF initiative at Salesforce.
+While safety and ethical considerations are integral to our alignment process,
+there remains the possibility that the model could generate offensive or unethical content, particularly under adversarial conditions.
+We are committed to continuous improvement in our models to minimize such risks and encourage responsible usage.
 ## Citation
 Please cite our techical report if you find our model is useful for your research or product.