Commit graph

192 commits

Author SHA1 Message Date
yangdx
6a29b5daa0 Update Docker deployment comments for LLM and embedding hosts 2025-10-23 13:53:26 +08:00
yangdx
c92ab8373a Fix linting 2025-10-22 15:00:18 +08:00
yangdx
88a45523e2 Increase default max file paths from 30 to 100 and improve documentation
- Bump DEFAULT_MAX_FILE_PATHS to 100
- Add clarifying comment about display
2025-10-21 17:33:00 +08:00
yangdx
3ad616be4f Change default source IDs limit method from KEEP to FIFO 2025-10-21 16:12:11 +08:00
yangdx
e0fd31a60d Fix logging message formatting 2025-10-20 22:09:09 +08:00
yangdx
a9fec26798 Add file path limit configuration for entities and relations
• Add MAX_FILE_PATHS env variable
• Implement file path count limiting
• Support KEEP/FIFO strategies
• Add truncation placeholder
• Remove old build_file_path function
2025-10-20 20:12:53 +08:00
yangdx
dc62c78f98 Add entity/relation chunk tracking with configurable source ID limits
- Add entity_chunks & relation_chunks storage
- Implement KEEP/FIFO limit strategies
- Update env.example with new settings
- Add migration for chunk tracking data
- Support all KV storage
2025-10-20 15:24:15 +08:00
yangdx
9f49e56a44 Merge branch 'main' into feat-entity-size-caps 2025-10-17 15:59:44 +08:00
yangdx
daeca17f38 Change default docker image to offline version
• Add lite verion docker image with tiktoken cache
• Update docs and build scripts
2025-10-16 16:52:01 +08:00
DivinesLight
c06522b927 Get max source Id config from .env and lightRAG init 2025-10-15 18:24:38 +05:00
haseebuchiha
d52c3377b4 Import from env and use default if none and removed useless import 2025-10-14 16:14:03 +05:00
yangdx
d0ae7a67a0 Fix typos and grammar in env.example configuration comments 2025-10-10 11:56:19 +08:00
yangdx
bd535e3e7a Add PostgreSQL connection retry configuration options
- Add retry environment variables
- Fix asyncpg import in retry tests
2025-10-10 03:06:21 +08:00
kevinnkansah
22a7b482c5 fix: renamed PostGreSQL options env variable and allowed LRU cache to be an optional env variable 2025-10-06 11:56:09 +02:00
kevinnkansah
108cdbe133 feat: add options for PostGres connection 2025-10-05 23:29:04 +02:00
yangdx
9dd1790b5c Add "Creature" entity type and reorganize type mappings
- Add Creature to default entity types
- Map animals/beings to creature type
2025-09-23 21:58:33 +08:00
yangdx
55176a24b4 Update env.example 2025-09-22 17:49:22 +08:00
yangdx
e14cee69a3 Fix Neo4j typo and add fulltext search with performance optimizations
- Fix NEO4J_DATABASE typo in env.example
- Add fulltext index for entity searches
- Implement get_popular_labels method
- Add search_labels with fuzzy matching
- Simplify B-Tree index creation logic
2025-09-20 12:37:13 +08:00
yangdx
e644a3e02f Adjust OpenAI temperature default and add mitigation guidance 2025-09-17 02:56:05 +08:00
yangdx
4a97e9f469 Update env.example 2025-09-17 02:31:41 +08:00
yangdx
3b18c33770 Update env.example 2025-09-15 05:02:45 +08:00
Daniel.y
ad0a533d21
Merge pull request #2100 from danielaskdd/return-data-only
Feature: Add /query/data API Endpoint for Raw Retrieval Data
2025-09-15 04:07:21 +08:00
yangdx
5311083f43 Rename "Process" entity type to "Method" across all components 2025-09-14 02:30:05 +08:00
yangdx
7060cf17f0 Add Process and Data entity types to LLM extraction system
• Add Process and Data to default types
• Update env.example configuration
• Add translations for new entities
• Support 5 languages (en/zh/fr/ar/tw)
2025-09-14 01:14:47 +08:00
yangdx
2686fc526e Change entity type from CreativeWork to Content and update delimiter
• Replace CreativeWork with Content type
• Improve LLM output error messages
• Update prompt for binary relationships
• Fix delimiter corruption examples
2025-09-14 00:55:15 +08:00
yangdx
41cdeaeaad Add Concept and NaturalObject to default entity types 2025-09-13 15:37:11 +08:00
yangdx
f7aa108cc2 Update env.example 2025-09-13 11:27:02 +08:00
Francois van Schalkwyk
7075b3bec6
Update env.example
Fixed spelling typo.
2025-09-11 14:24:19 +01:00
yangdx
87f1b47218 Update env.examples 2025-09-11 15:50:16 +08:00
yangdx
4a21b7f53f Update OpenAI API config docs for max_tokens and max_completion_tokens
• Clarify max_tokens vs max_completion_tokens
• Add Gemini exception note
• Update parameter descriptions
• Add new completion tokens option
2025-09-10 16:23:10 +08:00
Daniel.y
298037d8f7
Merge pull request #2076 from danielaskdd/prompt-refactor
refactor: Optimize Entity Extraction for Small Parameter LLMs with Enhanced Prompt Caching
2025-09-08 15:40:13 +08:00
yangdx
d218f15a62 Refactor entity extraction with system prompts and output limits
- Add system/user prompt separation
- Set max tokens for endless output fix
- Improve extraction error logging
- Update cache type from extract to summary
2025-09-08 15:20:45 +08:00
Shlomi
8d7ef07bbf
fix env file example 2025-09-07 15:22:24 +03:00
yangdx
64bbe7233b Update env.example 2025-09-06 01:24:12 +08:00
yangdx
2db7e4a3e8 Update env.example 2025-09-05 17:13:29 +08:00
yangdx
c903b14849 Bump AIP version to 0214 and update env.example 2025-09-04 12:04:50 +08:00
yangdx
78abb397bf Reorder entity types and add Document type to extraction 2025-09-03 12:44:40 +08:00
yangdx
c86f863fa4 feat: optimize entity extraction for smaller LLMs
Simplify entity relationship extraction process to improve compatibility
and performance with smaller, less capable language models.

Changes:
- Remove iterative gleaning loop with LLM-based continuation decisions
- Simplify to single gleaning pass when entity_extract_max_gleaning > 0
- Streamline entity extraction prompts with clearer instructions
- Add explicit completion delimiter signals in all examples
2025-09-03 10:33:01 +08:00
yangdx
9d81cd724a Fix typo: change "Equiment" to "Equipment" in entity types 2025-09-02 03:19:31 +08:00
yangdx
c8c59c38b0 Fix entity types configuration to support JSON list parsing
- Add JSON parsing for list env vars
- Update entity types example format
- Add list type support to get_env_value
2025-09-01 00:14:57 +08:00
yangdx
57fe1403c3 Update default entity types in env.example configuration 2025-08-31 22:33:34 +08:00
yangdx
d9aa021682 Update env.example 2025-08-30 11:02:53 +08:00
Pedro Fernandes Steimbruch
8430e1a051 fix: adjust the EMBEDDING_BINDING_HOST for openai in the env.example 2025-08-29 09:48:42 -03:00
yangdx
d39afcb831 Add temperature guidance for Qwen3 models in env example 2025-08-29 15:13:52 +08:00
yangdx
925e631a9a refac: Add robust time out handling for LLM request 2025-08-29 13:50:35 +08:00
yangdx
ac2db35160 Update env.example 2025-08-29 10:18:12 +08:00
Sandmeyer
1cd27dc048
docs(config): fix typo in .env comments 2025-08-28 20:23:51 +08:00
yangdx
0be4f0144b Merge branch 'entityTypesServerSupport' 2025-08-27 12:23:58 +08:00
yangdx
ff0a18e08c Unify SUMMARY_LANGUANGE and ENTITY_TYPES implementation method 2025-08-27 12:23:22 +08:00
yangdx
cb0a035076 Update env.example 2025-08-27 11:12:52 +08:00