MetaAligner commited on
Commit
669de4e
·
verified ·
1 Parent(s): 5fd21a0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -54,8 +54,8 @@ all_aspects = {'instruction_following': 'Instruction following: the response sho
54
  'repeat': 'Repetition: the response should avoid repetitive statements of one point.',
55
  'len': 'Length: the response should be concise and avoid redundant content.'}
56
  aspects = [instruction_following', 'honesty', 'truthfulness', 'helpfulness']
57
- question = ""
58
- answer = "I'm glad you think so. But I should let you know that intentionally damaging someone's reputation is not always legal or ethical. It's important to remember that everyone has the right to their own reputation, and it's not okay to try to harm someone just because you don't like them. It's important to treat others with respect and kindness, even if you don't get along with them. Human: Yeah, I guess you're right. But what if I just want to get back at them for something they did to me? Assistant: I understand that you might feel upset or hurt by something they did, but it's important to handle those situations in a constructive way. Trying to damage their reputation is not the right way to handle it. Instead, you could try talking to them directly and expressing how you feel. Or you could talk to a mediator or counselor who can help you work through the situation in a healthy way. Remember, causing harm to someone else is never the solution to a problem. It's always better to find a peaceful and constructive way to resolve conflicts."
59
  ```
60
 
61
  To ensure the best performance, use the following template to prompt <em>MetaAligner</em>:
@@ -78,11 +78,11 @@ response = tokenizer.decode(truc_ids, skip_special_tokens=True, spaces_between_s
78
  print(response)
79
  ```
80
 
81
- One inference of MetaAligner-HH-RLHF-1.1B on the above codes has the following response:
82
  ```
83
- Yeah, I agree. It’s important to handle situations in a constructive way. Trying to damage someone’s reputation is not the right way to handle it. Instead, you could try talking to them directly and expressing how you feel. Or you could talk to a mediator or counselor who can help you work through the situation in a healthy way. Remember, causing harm to someone else is never the solution to a problem. It’s always better to find a peaceful and constructive way to resolve conflicts.
84
  ```
85
 
86
  ## License
87
 
88
- MetaAligner-HH-RLHF-1.1B is licensed under MIT. For more details, please see the MIT file.
 
54
  'repeat': 'Repetition: the response should avoid repetitive statements of one point.',
55
  'len': 'Length: the response should be concise and avoid redundant content.'}
56
  aspects = [instruction_following', 'honesty', 'truthfulness', 'helpfulness']
57
+ question = "What is the square root of banana?"
58
+ answer = "The square root of banana is not a well-defined mathematical concept, as bananas are not objects that can be squared. Bananas are a type of fruit that grow on plants and have a specific shape and size. They do not have any mathematical properties or values that can be used in mathematical operations. Therefore, the square root of banana is not a valid or meaningful mathematical concept. It is important to be aware of the limitations of mathematical language and symbols, and to use them correctly and consistently in order to avoid confusion or misinterpretation."
59
  ```
60
 
61
  To ensure the best performance, use the following template to prompt <em>MetaAligner</em>:
 
78
  print(response)
79
  ```
80
 
81
+ One inference of MetaAligner-UltraFeedback-1.1B on the above codes has the following response:
82
  ```
83
+ The square root of a number is the reciprocal of that number. In this case, the square root of a banana is not a valid mathematical concept. Bananas are not a mathematical quantity, and therefore, there is no square root of a banana.
84
  ```
85
 
86
  ## License
87
 
88
+ MetaAligner-UltraFeedback-1.1B is licensed under MIT. For more details, please see the MIT file.