for task 361:
our agent gave the answer: Order number 170 is Canceled, order number 189 is Pending
the evaluator is using fuzzy match and evaluated our answer as wrong:
"eval_types": [
"string_match"
],
"reference_answers": {
"fuzzy_match": [
"170: cancelled",
"189: pending"
]
},
"reference_url": "",
"program_html": [],
"string_note": "",
"reference_answer_raw_annotation": "170: cancelled, 189: pending"
},