gemini-flash
rag_instruct_benchmark_tester.jsonl
text → text
Unknown/Gemini 1.5 Flash

is_correct
Are the answers equivalent? Answer "true" or "false". All lowercase. Answer 1: {answer} Answer 2: {prediction}
Nov 7, 2024, 11:40 PM UTC
Nov 7, 2024, 11:40 PM UTC
5 row sample
213 tokens$ 0.0000
5 rows processed, 213 tokens used ($0.0000)
Estimated cost for all 200 rows: $0.0002Sample Results completed
8 columns, 1-5 of 200 rows