cognee/cognee/api
Vasilije c7d0f64cb1
fix: Refactor web parsing (#1575)
<!-- .github/pull_request_template.md -->

## Description
<!--
Please provide a clear, human-generated description of the changes in
this PR.
DO NOT use AI-generated descriptions. We want to understand your thought
process and reasoning.
-->

This PR is an iteration over #1552:

1. Refactors `preferred_loaders` from dicts, to a list that can contain
strings (name of the loader) or dicts (`{loader_name: {arg1: val1}}`),
i.e. - `[{"loader_name_one": {"arg1": "val1"}}, "loader_name_two"]`
2. Adds default extraction rules for html parsing
3. Adds unit tests that cover the changes + unit test for tavily

## Type of Change
<!-- Please check the relevant option -->
- [ ] Bug fix (non-breaking change that fixes an issue)
- [ ] New feature (non-breaking change that adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to change)
- [ ] Documentation update
- [x] Code refactoring
- [ ] Performance improvement
- [ ] Other (please specify):

## Screenshots/Videos (if applicable)
<!-- Add screenshots or videos to help explain your changes -->

## Pre-submission Checklist
<!-- Please check all boxes that apply before submitting your PR -->
- [ ] **I have tested my changes thoroughly before submitting this PR**
- [ ] **This PR contains minimal changes necessary to address the
issue/feature**
- [ ] My code follows the project's coding standards and style
guidelines
- [ ] I have added tests that prove my fix is effective or that my
feature works
- [ ] I have added necessary documentation (if applicable)
- [ ] All new and existing tests pass
- [ ] I have searched existing PRs to ensure this change hasn't been
submitted already
- [ ] I have linked any relevant issues in the description
- [ ] My commits have clear and descriptive messages

## DCO Affirmation
I affirm that all code in every commit of this pull request conforms to
the terms of the Topoteretes Developer Certificate of Origin.
2025-10-22 19:09:19 +02:00
..
v1 fix: Refactor web parsing (#1575) 2025-10-22 19:09:19 +02:00
.env.example Added Flexible CORS Configuration via Environment Variableadded flexibility to cors (#1109) 2025-07-19 20:48:52 +02:00
__init__.py
client.py feat: Add update endpoint to Cognee 2025-09-26 12:44:05 +02:00
DTO.py
health.py refactor: Keep code DRY in health endpoint 2025-10-10 12:59:28 +02:00