gemini-flash
rag_instruct_benchmark_tester.jsonl
text → text
is_correct
Are the answers equivalent? Answer "true" or "false".
Answer 1: {answer}
Answer 2: {prediction} Nov 7, 2024, 11:40 PM UTC
Nov 7, 2024, 11:40 PM UTC
5 row sample
198 tokens$ 0.0000
5 rows processed, 198 tokens used ($0.0000)
Estimated cost for all 200 rows: $0.0002Sample Results completed
8 columns, 1-5 of 200 rows