LightRAG/lightrag/evaluation
clssck dd1413f3eb test(lightrag,examples): add prompt accuracy and quality tests
Add comprehensive test suites for prompt evaluation:
- test_prompt_accuracy.py: 365 lines testing prompt extraction accuracy
- test_prompt_quality_deep.py: 672 lines for deep quality analysis
- Refactor prompt.py to consolidate optimized variants (removed prompt_optimized.py)
- Apply ruff formatting and type hints across 30 files
- Update pyrightconfig.json for static type checking
- Modernize reproduce scripts and examples with improved type annotations
- Sync uv.lock dependencies
2025-12-05 16:39:52 +01:00
..
sample_documents feat(evaluation): Add sample documents for reproducible RAGAS testing 2025-11-03 13:28:46 +01:00
wiki_documents chore(lightrag): add wikipedia test dataset for evaluation 2025-11-30 20:14:52 +01:00
__init__.py test(lightrag,examples): add prompt accuracy and quality tests 2025-12-05 16:39:52 +01:00
compare_results.py test(lightrag,examples,api): comprehensive ruff formatting and type hints 2025-12-05 15:17:06 +01:00
download_wikipedia.py test(lightrag,examples,api): comprehensive ruff formatting and type hints 2025-12-05 15:17:06 +01:00
e2e_test_harness.py test(lightrag,examples,api): comprehensive ruff formatting and type hints 2025-12-05 15:17:06 +01:00
eval_rag_quality.py test(lightrag,examples,api): comprehensive ruff formatting and type hints 2025-12-05 15:17:06 +01:00
ingest_test_docs.py test(lightrag,examples,api): comprehensive ruff formatting and type hints 2025-12-05 15:17:06 +01:00
populate_test_data.sh chore(docker-compose, lightrag): optimize test infrastructure and add evaluation tools 2025-11-29 10:39:20 +01:00
README_EVALUASTION_RAGAS.md Update LLM cache migration docs and improve UX prompts 2025-11-08 23:48:19 +08:00
sample_dataset.json Update evaluation defaults and expand sample dataset 2025-11-04 22:17:17 +08:00
wiki_test_dataset.json test(lightrag,examples,api): comprehensive ruff formatting and type hints 2025-12-05 15:17:06 +01:00