cognee/evals/eval_framework/evaluation/base_eval_adapter.py at ca2cbfab918d80b49dc15739b8b1d1e6e22a827f - gmakstutis/cognee - Forgejo: Beyond coding. We Forge.

gmakstutis/cognee

hajdul88 6a0c0e3ef8

feat: Cognee evaluation framework development (#498 )

<!-- .github/pull_request_template.md -->

This PR contains the evaluation framework development for cognee

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin


<!-- This is an auto-generated comment: release notes by coderabbit.ai
-->
## Summary by CodeRabbit

- **New Features**
- Expanded evaluation framework now integrates asynchronous corpus
building, question answering, and performance evaluation with adaptive
benchmarks for improved metrics (correctness, exact match, and F1
score).

- **Infrastructure**
- Added database integration for persistent storage of questions,
answers, and metrics.
- Launched an interactive metrics dashboard featuring advanced
visualizations.
- Introduced an automated testing workflow for continuous quality
assurance.

- **Documentation**
  - Updated guidelines for generating concise, clear answers.
<!-- end of auto-generated comment: release notes by coderabbit.ai -->

2025-02-11 16:31:54 +01:00

10 lines

268 B

Python

Raw Blame History

 from abc import ABC, abstractmethod
 from typing import Any, Dict, List
 class BaseEvalAdapter(ABC):
     @abstractmethod
     async def evaluate_answers(
         self, data: List[Dict[str, Any]], evaluator_metrics: List[str]
     ) -> List[Dict[str, Any]]:
         pass