Spaces:
Running
Running
Updated prompt
Browse files
common.py
CHANGED
@@ -47,11 +47,14 @@ EVAL_DESCRIPTION = """
|
|
47 |
- Examples (Optional)
|
48 |
"""
|
49 |
|
50 |
-
DEFAULT_EVAL_PROMPT = """You are assessing a chat bot response to a user's input based on how well it follows the user's instructions. Your evaluation should consider
|
51 |
|
52 |
Score:
|
53 |
-
|
54 |
-
|
|
|
|
|
|
|
55 |
|
56 |
[User Query]: {{input}}
|
57 |
|
|
|
47 |
- Examples (Optional)
|
48 |
"""
|
49 |
|
50 |
+
DEFAULT_EVAL_PROMPT = """You are assessing a chat bot response to a user's input based on how well it follows the user's instructions. Your evaluation should consider factors such as the helpfulness, relevance, accuracy, depth, creativity, and level of detail of the response. Do not allow the length of the response to influence your evaluation. Be objective as possible and give a brief explanation for your score.
|
51 |
|
52 |
Score:
|
53 |
+
Score 1: The response ignores or misinterprets instructions, providing irrelevant or inaccurate content that fails to address the request.
|
54 |
+
Score 2: The response follows instructions partially but misses key elements, lacking depth or precision while containing minor inaccuracies.
|
55 |
+
Score 3: The response follows main instructions adequately, providing correct and relevant information with reasonable depth.
|
56 |
+
Score 4: The response follows instructions thoroughly with strong attention to detail, offering accurate, well-developed content that thoughtfully addresses needs.
|
57 |
+
Score 5: The response demonstrates exceptional instruction following with precise, comprehensive content that shows both insight and perfect alignment with the request.
|
58 |
|
59 |
[User Query]: {{input}}
|
60 |
|