cognee/evals/eval_framework
lxobr 4b7c21d7d8
feat: retrieve golden contexts [COG-1364] (#579)
<!-- .github/pull_request_template.md -->

## Description
<!-- Provide a clear description of the changes in this PR -->
• Added load_golden_context parameter to BaseBenchmarkAdapter's abstract
load_corpus method, establishing a common interface for retrieving
supporting evidence
• Refactored HotpotQAAdapter with a modular design: introduced
_get_metadata_field_name method to handle dataset-specific fields
(making it extensible for child classes), implemented get golden context
functionality.
• Refactored TwoWikiMultihopAdapter to inherit from HotpotQAAdapter,
overriding only the necessary methods while reusing parent's
functionality
• Added golden context support to MusiqueQAAdapter with their
decomposition-based format
## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **New Features**
- Introduced an option to include additional context during corpus
loading, enhancing the quality and flexibility of generated QA pairs.
- **Refactor**
- Streamlined and modularized the processing workflow across different
adapters for improved consistency and maintainability.
- Updated metadata extraction to refine the display of contextual
information.
- Shifted focus in the `TwoWikiMultihopAdapter` from corpus loading to
context extraction.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
2025-02-27 13:25:47 +01:00
..
answer_generation feat: Cognee evaluation framework development (#498) 2025-02-11 16:31:54 +01:00
benchmark_adapters feat: retrieve golden contexts [COG-1364] (#579) 2025-02-27 13:25:47 +01:00
corpus_builder feat: add experimental cognify pipeline [COG-1293] (#541) 2025-02-25 16:14:27 +01:00
evaluation feat: Cognee evaluation framework development (#498) 2025-02-11 16:31:54 +01:00
__init__.py feat: Cognee evaluation framework development (#498) 2025-02-11 16:31:54 +01:00
eval_config.py feat: add experimental cognify pipeline [COG-1293] (#541) 2025-02-25 16:14:27 +01:00
metrics_dashboard.py feat: Cognee evaluation framework development (#498) 2025-02-11 16:31:54 +01:00
run_eval.py feat: Cognee evaluation framework development (#498) 2025-02-11 16:31:54 +01:00