cognee/cognee
alekszievr 7b5bd7897f
Feat: evaluate retrieved context against golden context [cog-1481] (#619)
<!-- .github/pull_request_template.md -->

## Description
- Compare retrieved context to golden context using deepeval's
summarization metric
- Display relevant fields to each metric on metrics dashboard

Example output:

![image](https://github.com/user-attachments/assets/9facf716-b2ab-4573-bfdf-7b343d2a57c5)


## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **New Features**
- Enhanced context handling in answer generation and corpus building to
include extended details.
- Introduced a new context coverage metric for deeper evaluation
insights.
- Upgraded the evaluation dashboard with dynamic presentation of metric
details.
- Added a new parameter to support loading golden context in corpus
loading methods.

- **Bug Fixes**
- Improved clarity in how answers are structured and appended in the
answer generation process.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->
2025-03-10 15:27:48 +01:00
..
api Feature/cog 1312 integrating evaluation framework into dreamify (#562) 2025-03-03 19:55:47 +01:00
eval_framework Feat: evaluate retrieved context against golden context [cog-1481] (#619) 2025-03-10 15:27:48 +01:00
exceptions ruff format 2025-01-05 19:09:08 +01:00
infrastructure fix: add proper node labels (#607) 2025-03-06 13:30:13 +01:00
modules feat: COG-1523 add top_k in run_question_answering (#625) 2025-03-10 10:55:31 +01:00
shared Change data models for gemini (#600) 2025-03-04 14:09:28 +01:00
tasks feat: add regex entity extractor (#605) 2025-03-06 12:13:59 +01:00
tests Feat: evaluate retrieved context against golden context [cog-1481] (#619) 2025-03-10 15:27:48 +01:00
__init__.py Change data models for gemini (#600) 2025-03-04 14:09:28 +01:00
base_config.py Update base_config.py 2025-01-16 13:19:43 +01:00
fetch_secret.py ruff format 2025-01-05 19:09:08 +01:00
low_level.py fix: custom model pipeline (#508) 2025-02-08 02:00:15 +01:00
pipelines.py
root_dir.py ruff format 2025-01-05 19:09:08 +01:00