Evaluator

Evaluate any Gooey.AI Workflow output against a dataset of inputs and "golden" or expert-created desired answers. Score every row of any CSV, google sheet or excel with any LLM-as-Judge instruction prompt; then average every score in any column to generate automated evaluations.

Input Data Spreadsheet
Loading...
Input Data Preview

Here's what you uploaded:

Loading...


Evaluation Prompts

Aggregations

Run cost = 9 credits

With each run, you agree to Gooey.AI's terms & privacy policy.

Download

Loading...


Aggregate: Mean

Loading...

Loading...

Related Workflows