Add comprehensive test dataset with 7 domain-specific Wikipedia documents (climate, finance, medical, sports) and corresponding test cases in JSON format. Total of 2292 lines of test data across 8 files for RAG quality evaluation and end-to-end testing infrastructure. |
||
|---|---|---|
| .. | ||
| sample_documents | ||
| wiki_documents | ||
| __init__.py | ||
| compare_results.py | ||
| download_wikipedia.py | ||
| e2e_test_harness.py | ||
| eval_rag_quality.py | ||
| ingest_test_docs.py | ||
| populate_test_data.sh | ||
| README_EVALUASTION_RAGAS.md | ||
| sample_dataset.json | ||
| wiki_test_dataset.json | ||