From d78d73e6666c297f1bd2cd961a112ed654368dc4 Mon Sep 17 00:00:00 2001 From: Daniel Chalef <131175+danielchalef@users.noreply.github.com> Date: Tue, 7 Oct 2025 09:19:26 -0700 Subject: [PATCH] Add comprehensive issue triage documentation MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Created three documentation files to organize and prioritize 100+ open GitHub issues: - ISSUE_TRIAGE.md: Full categorization into 16 clusters by component and theme - DUPLICATES.md: Identified 6 confirmed duplicates and 7 potential duplicate clusters - PRIORITY_ROADMAP.md: Prioritized critical issues (P0-P3) with immediate action plan Key findings: - P0 issues: MCP protocol deprecation, database name config, duplicate detection - Component breakdown: core (25+), database (20+), mcp (12+), llm (12+), search (8+) - Estimated 15-25 issues can be consolidated through deduplication 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude --- DUPLICATES.md | 156 +++++++++++++++++++ ISSUE_TRIAGE.md | 361 ++++++++++++++++++++++++++++++++++++++++++++ PRIORITY_ROADMAP.md | 283 ++++++++++++++++++++++++++++++++++ 3 files changed, 800 insertions(+) create mode 100644 DUPLICATES.md create mode 100644 ISSUE_TRIAGE.md create mode 100644 PRIORITY_ROADMAP.md diff --git a/DUPLICATES.md b/DUPLICATES.md new file mode 100644 index 00000000..026f8329 --- /dev/null +++ b/DUPLICATES.md @@ -0,0 +1,156 @@ +# Duplicate GitHub Issues + +**Date**: 2025-10-07 + +## Confirmed Duplicates (Already Marked) + +### #963 - Duplicate Entities in Neo4j +- **Status**: Marked as duplicate +- **Original Issue**: #875 - Duplicate Entities in Neo4j with custom db name +- **Labels**: bug, duplicate +- **Action**: Can be closed with reference to #875 +- **Notes**: Same core issue - deduplication not working properly + +--- + +### #941 - TaskGroup Errors +- **Title**: ERROR:unhandled errors in a TaskGroup (1 sub-exception) +- **Status**: Marked as duplicate +- **Labels**: duplicate +- **Notes**: Reporter mentioned issue was previously reported in #353 +- **Action**: Close with reference to original issue + +--- + +### #920 - Timezone-Naive/Aware Datetime Comparison +- **Title**: [BUG] edge_operations.py is unable to compare a timezone-naive datetime with a timezone-aware one +- **Status**: Marked as duplicate +- **Labels**: bug, duplicate +- **Original Issue**: Likely resolved by commit 73015e9 "Fix datetime comparison errors by normalizing to UTC" +- **Action**: Close with reference to recent fix. Ask reporter to verify on latest version. + +--- + +### #867 - MCP with GPT-oss Models +- **Title**: MPC does not work with GPT-oss:20 or 120B +- **Status**: Marked as duplicate +- **Duplicate Of**: #831 - [BUG] GPT-oss:20 and 120 +- **Labels**: bug, duplicate +- **Action**: Close with reference to #831 + +--- + +### #801 - Empty Fulltext Search Results +- **Title**: [BUG] empty-result bug in episode_fulltext_search +- **Status**: Marked as duplicate +- **Labels**: bug, duplicate +- **Original Issue**: Likely #810 - Empty group_id handling issues +- **Action**: Close with reference to #810 or related search issue + +--- + +### #787 - Rate Limit with SEMAPHORE_LIMIT +- **Title**: [BUG] Got rate limit even SEMAPHORE_LIMIT=1 in mcp server +- **Status**: Marked as duplicate +- **Labels**: bug, duplicate +- **Action**: Close with reference to original rate limiting issue + +--- + +## Potential Duplicates (Require Investigation) + +### Database Configuration Issues (Likely Related) + +**#851**, **#798**, **#715** - All relate to database name handling +- **#851** - Search only connects to 'neo4j' db +- **#798** - Database name not passed through Graphiti object +- **#715** - Feature request to configure Neo4j database name + +**Analysis**: These may be describing the same underlying issue. #715 appears to be the feature request, while #851 and #798 are bugs from the missing feature. + +**Recommendation**: Fix the root cause and consolidate. Keep #715 as the tracking issue if implementing multi-DB support properly. + +--- + +### Bulk Upload Failures + +**#882**, **#879**, **#871**, **#658** - Bulk upload failures +- **#882** - IndexError during node resolution +- **#879** - ValidationError 'duplicates' field missing +- **#871** - Invalid JSON and index errors +- **#658** - "Bulk ingestion not possible" + +**Analysis**: All appear to be bulk upload failures, potentially from same root cause in validation/schema handling. + +**Recommendation**: Investigate if these are manifestations of same bug. If so, consolidate into single issue. + +--- + +### FalkorDB Query Errors + +**#815**, **#757**, **#731** - FalkorDB query errors +- **#815** - falkordb query error +- **#757** - quickstart_falkordb example query error +- **#731** - Malformed Cypher query on episode insertion + +**Analysis**: May be same underlying issue with FalkorDB Cypher query generation. + +**Recommendation**: Investigate if root cause is same. Could consolidate. + +--- + +### MCP Server + Custom LLM Providers + +**#565**, **#945** - OPENAI_BASE_URL issues +- **#565** - Cross-encoder ignores OPENAI_BASE_URL +- **#945** - Custom OPENAI_BASE_URL causes NaN embeddings + +**Analysis**: Both relate to custom OpenAI-compatible endpoints not being respected in MCP server. + +**Recommendation**: Likely same root issue - configuration not properly passed through MCP components. + +--- + +### Ollama Compatibility + +**#868**, **#831** - Ollama issues +- **#868** - Cannot get minimal example to work with Ollama +- **#831** - GPT-oss:20 and 120 (Ollama models) + +**Analysis**: Both relate to Ollama model compatibility issues. + +--- + +### Search Result Issues + +**#534**, **#801**, **#810** - Search returning empty/no results +- **#534** - retrieve_episodes always returns no results +- **#801** - episode_fulltext_search empty results (marked duplicate) +- **#810** - Empty group_id handled inconsistently + +**Analysis**: Likely all related to search filtering/group_id handling issues. + +--- + +### Documentation/Setup Questions (Can Be Consolidated) + +**#517**, **#530** - OpenRouter setup questions +- **#517** - How to setup with OpenRouter and Voyage +- **#530** - Does this work with Cursor AI + OpenRouter ChatGPT + +**Analysis**: Same topic - using OpenRouter as provider. Could close with documentation reference. + +--- + +## Summary + +- **Confirmed Duplicates (Already Marked)**: 6 issues ready to close +- **Potential Duplicate Clusters**: 7 clusters (20+ issues) requiring investigation +- **Estimated Consolidation**: Could reduce issue count by 15-25 through deduplication + +## Recommended Actions + +1. **Immediate**: Close the 6 confirmed duplicates with appropriate references +2. **Investigation**: Review potential duplicate clusters to confirm root causes +3. **Consolidation**: Create tracking issues for clusters where multiple issues stem from same bug +4. **Documentation**: Several "questions" can be closed once docs are updated diff --git a/ISSUE_TRIAGE.md b/ISSUE_TRIAGE.md new file mode 100644 index 00000000..3cb3a88e --- /dev/null +++ b/ISSUE_TRIAGE.md @@ -0,0 +1,361 @@ +# GitHub Issues Triage & Categorization + +**Date**: 2025-10-07 +**Total Open Issues**: 100+ +**Analysis Scope**: Issues #451 - #972 + +## Issue Clusters + +### 1. Database Driver Support + +#### FalkorDB Issues (6 issues) +- **#972** - [BUG] Vector type mismatch in cosine distance operations + - *Component*: `component:database` + - *Status*: Open + +- **#815** - [BUG] falkordb query + - *Component*: `component:database` + - *Status*: Open + +- **#757** - [BUG] run quickstart_falkordb example, get query error + - *Component*: `component:database` + - *Status*: Open + +- **#749** - [BUG] Official Docker Image doesn't support FalkorDB + - *Component*: `component:database`, `component:mcp` + - *Status*: Open + +- **#731** - Episode insertion fails due to malformed Cypher query + - *Component*: `component:database` + - *Status*: Open + +- **#719** - FalkorDB support in MCP-Server + - *Component*: `component:mcp`, `component:database` + - *Status*: Open + +#### New Database Support Requests (6 issues) +- **#947** - Apache AGE Graph DB support +- **#933** - Support RDF +- **#781** - More graph database support (NebulaGraph mentioned) +- **#779** - Postgres with pgvector support +- **#644** - AWS Neptune driver +- **#643** - Kuzu driver support (may already be implemented?) +- **#642** - MemGraph driver + +**Analysis**: FalkorDB has multiple critical bugs affecting basic operations. Consider deprecating or fixing comprehensively. High demand for alternative backends suggests need for driver abstraction layer. + +--- + +### 2. MCP Server Issues (9 issues) + +- **#923** - [BUG] Switch from SSE (deprecated) to Streamable HTTP + - *Priority*: HIGH - using deprecated protocol + - *Component*: `component:mcp` + +- **#945** - [BUG] Custom OPENAI_BASE_URL causes NaN embeddings + - *Component*: `component:mcp`, `component:llm` + +- **#848** - [BUG] clear_graph tool fails silently (async session bug) + - *Component*: `component:mcp` + +- **#840** - [BUG] Failed to validate request (initialization timing) + - *Component*: `component:mcp` + +- **#723** - DEFAULT_MAX_TOKENS 8192 too restrictive + - *Component*: `component:mcp` + +- **#578** - MCP server with Gemini model + - *Component*: `component:mcp`, `component:llm` + +- **#565** - Cross-encoder ignores OPENAI_BASE_URL + - *Component*: `component:mcp`, `component:llm` + +- **#509** - MCP server add_nodes tool call doesn't work + - *Component*: `component:mcp` + +**Duplicates in this cluster**: +- **#867** + **#831** - GPT-oss:20 and 120B models (duplicate) +- **#787** - Rate limit even with SEMAPHORE_LIMIT=1 (marked duplicate) + +**Analysis**: MCP server has protocol deprecation issue (#923) and multiple configuration/provider issues. Should be high priority for users relying on MCP integration. + +--- + +### 3. LLM Provider Compatibility (8 issues) + +- **#902** - [BUG] OpenAI internal call broken (reasoning.effort param) + - *Component*: `component:llm` + +- **#878** - [BUG] GPT-5 temperature parameter unsupported + - *Component*: `component:llm` + +- **#912** - [BUG] Pydantic validation error with deepseek-r1:7b + - *Component*: `component:llm` + +- **#791** - [BUG] Small model setting defaults to gpt-4.1-nano + - *Component*: `component:llm` + +- **#790** - [BUG] Failed to parse structured response with Gemini + - *Component*: `component:llm` + +- **#868** - [BUG] Cannot work with Ollama + - *Component*: `component:llm` + +- **#763** - [BUG] LLMConfig.max_tokens not respected + - *Component*: `component:llm` + +- **#760** - [BUG] Hallucinations with default models + - *Component*: `component:llm` + +**Additional LLM Provider Requests**: +- **#907** - VSC Copilot models +- **#751** - Alibaba Cloud AI model +- **#724** - Gemini with Google Cloud Credentials +- **#459** - Amazon Bedrock support + +**Analysis**: Structured output compatibility is a recurring theme. Many issues stem from providers not supporting OpenAI's structured output format. Documentation should clearly state which providers are fully compatible. + +--- + +### 4. Duplicate Entities (3 issues) + +- **#963** - [BUG] Duplicate entities in Neo4j (marked duplicate) + - *Component*: `component:core` + - *Status*: Duplicate (see #875) + +- **#875** - [BUG] Duplicate entities with custom db name + - *Component*: `component:core`, `component:database` + - *Priority*: HIGH - core deduplication functionality broken + +- **#774** - Same Chinese text extracted as different facts + - *Component*: `component:core` + +**Analysis**: Core deduplication functionality appears broken, especially with custom database names. Critical for production use. + +--- + +### 5. Bulk Upload Issues (3 issues) + +- **#882** - [BUG] IndexError during node resolution + - *Component*: `component:bulk` + +- **#879** - ValidationError 'duplicates' field missing + - *Component*: `component:bulk` + +- **#871** - Invalid JSON and index errors + - *Component*: `component:bulk` + +**Related**: +- **#747** - Add progress reporting to bulk upload +- **#658** - Bulk ingestion not possible (may be duplicate) + +**Analysis**: Bulk operations are fundamentally broken. Appears to be schema/validation issues in the bulk processing pipeline. + +--- + +### 6. Search Issues (6 issues) + +- **#810** - [BUG] Empty group_id handled inconsistently in search + - *Component*: `component:search` + +- **#801** - [BUG] episode_fulltext_search empty results (marked duplicate) + - *Component*: `component:search` + +- **#838** - Allow searching across all groups when group_ids is None + - *Component*: `component:search` + +- **#772** - [BUG] BFS max_depth parameter ignored + - *Component*: `component:search` + +- **#789** - [BUG] BFS returns duplicate edges with swapped source/target + - *Component*: `component:search` + +- **#777** - [BUG] MMR reranker RuntimeWarning, no results + - *Component*: `component:search` + +**Related**: +- **#488** - edge_search_filter_query_constructor creating incorrect query +- **#534** - retrieve_episodes always returns no results + +**Analysis**: Search functionality has multiple bugs in filtering, BFS traversal, and reranking. Group ID handling is particularly problematic. + +--- + +### 7. Database Configuration (4 issues) + +- **#851** - [BUG] Search only connects to 'neo4j' db, add_episode works with any name + - *Component*: `component:database` + - *Priority*: HIGH - inconsistent behavior + +- **#798** - [BUG] Database name not passed through Graphiti object + - *Component*: `component:database` + - *Priority*: HIGH - related to #851 + +- **#715** - [Feature] Configure Neo4j database name (multi-DB support) + - *Component*: `component:database` + +**Analysis**: Database name configuration is broken. Hardcoded 'neo4j' default causes issues. Related to commit mentioned in CLAUDE.md about hardcoded database names. + +--- + +### 8. Datetime/Timezone Issues (4 issues) + +- **#920** - [BUG] edge_operations.py timezone-naive/aware comparison (marked duplicate) + - *Component*: `component:core` + - *Status*: Duplicate (possibly fixed in commit 73015e9) + +- **#893** - [BUG] Kuzu driver valid_at datetime format + - *Component*: `component:database` + +- **#606** - Add support for datetime fields in custom entities + - *Component*: `component:core` + +**Analysis**: Recent commit 73015e9 "Fix datetime comparison errors by normalizing to UTC" may have addressed some of these. Needs verification. + +--- + +### 9. API/Server Issues (4 issues) + +- **#961** - [Feature Request] Improve Episodes API (UUID, GET by ID, metadata) + - *Component*: `component:server` + +- **#921** - SDK client like zep-cloud/zep-python + - *Component*: `component:server` + +- **#566** - /messages endpoint doesn't persist episodes + - *Component*: `component:server` + +- **#904** - docker-compose.yml env vars override .env + - *Component*: `component:server`, `component:mcp` + +**Analysis**: API functionality gaps and configuration issues. Episode persistence bug is critical. + +--- + +### 10. Error Handling & Validation (3 issues) + +- **#941** - TaskGroup errors (marked duplicate) + - *Component*: `component:core` + +- **#937** - ArgumentError with empty query entities + - *Component*: `component:core` + +- **#951** - Incorrect try import for AsyncOpenSearch + - *Component*: `component:database` + +**Analysis**: Input validation needs improvement to handle edge cases like empty strings. + +--- + +### 11. Feature Requests - Core Functionality (10 issues) + +- **#935** - Create episodes based on DOM structure +- **#934** - Flag contradictions on merging facts +- **#925** - Monitor LLM conversations +- **#905** - ColbertV2 embeddings with Fastembed +- **#864** - How to forget knowledge +- **#819** - Count token usage +- **#669** - Metadata on chunks for RAG +- **#465** - Support ignoring non-custom entities +- **#467** - LLM inference expenses are high + +**Analysis**: Feature requests range from observability (#925, #819) to advanced RAG features (#669, #905). Token cost reduction (#467) is recurring concern. + +--- + +### 12. Documentation/Questions (9 issues) + +- **#913** - How to update data in graph +- **#909** - Reproduce LongMemEval results +- **#869** - Portuguese: Change OpenAI model +- **#853** - Document VCS/extension setup +- **#828** - Get a flowchart +- **#701** - Chinese video tutorial (informational) +- **#484** - OpenAIGenericClient documentation +- **#530** - Cursor AI + OpenRouter setup +- **#517** - OpenRouter and Voyage setup + +*Component*: `component:docs` + +**Analysis**: Many questions indicate documentation gaps, especially around custom LLM provider setup and basic operations. + +--- + +### 13. Code Quality/Refactoring (5 issues) + +- **#836** - [BUG] update_communities broken + - *Component*: `component:core` + +- **#811** - Refactor: hoist EXTRACT_EDGES_MAX_TOKENS constant + - *Component*: `component:core` + +- **#681** - Refactor: no internal ghost variables + - *Component*: `component:core` + +- **#451** - Minor type bug in bulk_utils + - *Component*: `component:bulk` + +- **#717** - Adopt uv workspace for monorepo + - *Component*: `component:core`, `component:server`, `component:mcp` + +**Analysis**: Technical debt items. #717 (uv workspace) would improve developer experience for monorepo. + +--- + +### 14. Cloud/Alternative Providers (2 issues) + +- **#739** - MCP server based on zep-cloud + - *Component*: `component:mcp` + +- **#538** - Azure Cosmos DB version (informational) + +--- + +### 15. Embeddings/Reranking (3 issues) + +- **#728** - Voyage embedder installation issue + - *Component*: `component:core` + +- **#485** - Better Embedder error information + - *Component*: `component:core` + +- **#543** - Hard-coded model in OpenAIRerankerClient + - *Component*: `component:core` + +--- + +### 16. Miscellaneous Bugs (5 issues) + +- **#800** - Blank disconnected nodes in sample project + - *Component*: `component:mcp` + +- **#686** - Relationships not extracted automatically + - *Component*: `component:core` + +- **#687** - Unable to add episodes with OpenAI agents SDK + - *Component*: `component:core` + +- **#587** - Node type not being set + - *Component*: `component:core` + +--- + +## Summary Statistics + +- **Total Clustered Issues**: 100+ +- **Major Clusters**: 16 +- **Confirmed Duplicates**: 6 (marked) +- **High Priority Bugs**: ~15-20 +- **Feature Requests**: ~20 +- **Documentation Gaps**: ~10 + +## Component Breakdown + +- `component:core` - 25+ issues +- `component:database` - 20+ issues (includes all DB drivers) +- `component:mcp` - 12+ issues +- `component:llm` - 12+ issues +- `component:search` - 8+ issues +- `component:bulk` - 5+ issues +- `component:server` - 4+ issues +- `component:docs` - 10+ issues diff --git a/PRIORITY_ROADMAP.md b/PRIORITY_ROADMAP.md new file mode 100644 index 00000000..0c97bdf0 --- /dev/null +++ b/PRIORITY_ROADMAP.md @@ -0,0 +1,283 @@ +# Priority Roadmap - GitHub Issues + +**Date**: 2025-10-07 +**Purpose**: Identify and prioritize critical issues requiring immediate attention + +--- + +## P0 - Critical (Fix Immediately) + +### 1. MCP Server Protocol Deprecation +**Issue #923** - Switch from SSE to Streamable HTTP + +- **Impact**: Using deprecated MCP protocol that will be removed +- **Affected Users**: All MCP server users +- **Risk**: Protocol will stop working when support is removed +- **Effort**: Medium +- **Component**: `component:mcp` + +**Action**: Migrate to new protocol ASAP + +--- + +### 2. Database Name Configuration Broken +**Issues #851, #798** - Database name not passed/respected + +- **Impact**: Cannot use non-default Neo4j databases for search operations +- **Affected Users**: Multi-database deployments, production environments +- **Root Cause**: Hardcoded 'neo4j' database name in search operations +- **Symptoms**: + - Search only works with default 'neo4j' database + - add_episode works with any database name (inconsistent) +- **Effort**: Low-Medium +- **Component**: `component:database`, `component:core` + +**Action**: Fix database name propagation through entire codebase + +--- + +### 3. Duplicate Entity Detection Broken +**Issue #875** (duplicate: #963) + +- **Impact**: Core deduplication functionality not working +- **Affected Users**: All users, especially with custom database names +- **Symptoms**: Same entities created multiple times instead of being merged +- **User Reports**: Tested with multiple LLM models, issue persists +- **Effort**: Medium-High +- **Component**: `component:core` + +**Action**: Debug entity resolution and deduplication logic + +--- + +## P1 - High Priority (Fix This Sprint) + +### 4. Bulk Upload Completely Broken +**Issues #882, #879, #871** - Multiple bulk upload failures + +- **Impact**: Bulk operations unusable +- **Affected Users**: Anyone trying to import large datasets +- **Symptoms**: + - IndexError during node resolution (#882) + - ValidationError for missing 'duplicates' field (#879) + - Invalid JSON errors (#871) +- **Effort**: Medium +- **Component**: `component:bulk` + +**Action**: Fix validation schema and error handling in bulk pipeline + +--- + +### 5. Datetime Comparison Issues +**Issues #920, #893, #606** + +- **Impact**: Crashes when comparing dates +- **Status**: May be partially fixed by commit 73015e9 +- **Affected Users**: All users with temporal queries +- **Effort**: Low (if recent fix is complete) +- **Component**: `component:core`, `component:database` + +**Action**: Verify recent datetime UTC normalization fix covers all cases + +--- + +### 6. Search Group ID Handling +**Issues #810, #838, #801** - Inconsistent group_id behavior + +- **Impact**: Search returns empty results or behaves inconsistently +- **Affected Users**: Multi-tenant deployments using group_ids +- **Symptoms**: + - Empty group_id handled differently in fulltext vs similarity search + - Cannot search across all groups +- **Effort**: Low-Medium +- **Component**: `component:search` + +**Action**: Standardize group_id handling across search methods + +--- + +### 7. BFS Search Bugs +**Issues #772, #789** - BFS traversal broken + +- **Impact**: Graph traversal doesn't work as expected +- **Symptoms**: + - max_depth parameter completely ignored (#772) + - Duplicate edges with swapped source/target (#789) +- **Effort**: Medium +- **Component**: `component:search` + +**Action**: Fix BFS implementation + +--- + +## P2 - Medium Priority (Address Soon) + +### 8. MCP Server Configuration Issues +**Issues #945, #840, #848, #565** - Various MCP bugs + +- **#945**: Custom OPENAI_BASE_URL causes NaN embeddings +- **#840**: "Failed to validate request" initialization timing +- **#848**: clear_graph fails silently (async session bug) +- **#565**: Cross-encoder ignores OPENAI_BASE_URL + +**Impact**: MCP server unreliable with custom configurations +**Effort**: Low-Medium per issue +**Component**: `component:mcp` + +--- + +### 9. LLM Provider Compatibility +**Issues #878, #902, #912, #791** - Provider-specific bugs + +- **#902**: OpenAI reasoning.effort parameter breaks API +- **#878**: GPT-5 doesn't support temperature parameter +- **#791**: Small model setting ignored, always defaults to gpt-4.1-nano +- **#912**: Pydantic validation errors with non-OpenAI models + +**Impact**: Specific LLM models/providers broken +**Effort**: Low per issue +**Component**: `component:llm` + +**Strategy**: Add provider capability detection and parameter filtering + +--- + +### 10. FalkorDB Driver Issues +**Issues #972, #815, #757, #731, #749** - FalkorDB broken + +- **Impact**: FalkorDB backend largely non-functional +- **Effort**: High (multiple issues) +- **Component**: `component:database` + +**Decision Point**: Fix comprehensively or deprecate? Appears unmaintained. + +--- + +### 11. Error Handling Improvements +**Issues #937, #951** - Edge case crashes + +- **#937**: Empty query strings cause ArgumentError +- **#951**: Incorrect import fallback for AsyncOpenSearch + +**Impact**: Crashes on edge cases +**Effort**: Low per issue +**Component**: `component:core` + +--- + +### 12. API Server Issues +**Issue #566** - /messages endpoint doesn't persist episodes + +- **Impact**: Core API functionality broken +- **Effort**: Medium +- **Component**: `component:server` + +**Action**: Debug why episodes aren't being persisted + +--- + +### 13. Code Quality Issues +**Issues #836, #811, #681, #451** - Technical debt + +- **#836**: update_communities broken (tuple unpacking error) +- **#811**: Hoist hardcoded token constant +- **#681**: Remove ghost variables +- **#451**: Type bug in bulk_utils + +**Impact**: Medium (functionality broken in #836, others are maintenance) +**Effort**: Low per issue + +--- + +## P3 - Lower Priority / Feature Requests + +### 14. Feature Enhancements +- **#961** - Improve Episodes API (UUID, GET by ID, metadata) +- **#935** - Create episodes based on DOM +- **#934** - Flag contradictions on merging facts +- **#925** - Monitor LLM conversations +- **#819** - Count token usage +- **#747** - Progress reporting for bulk upload +- **#669** - Metadata on chunks for RAG +- **#465** - Support ignoring non-custom entities + +### 15. New Provider Support +- **#907** - VSC Copilot models +- **#905** - ColbertV2 embeddings +- **#751** - Alibaba Cloud AI +- **#724** - Gemini with GCP credentials +- **#459** - Amazon Bedrock +- **#739** - zep-cloud based MCP + +### 16. Documentation Improvements +- **#913** - How to update data in graph +- **#853** - VCS/extension setup +- **#828** - Flowchart documentation +- **#484** - OpenAIGenericClient documentation +- All "How to" questions + +### 17. Alternative Database Support +- **#947** - Apache AGE +- **#933** - RDF support +- **#781** - NebulaGraph +- **#779** - Postgres/pgvector +- **#644** - AWS Neptune +- **#643** - Kuzu +- **#642** - MemGraph + +--- + +## Immediate Action Plan (Next 2 Weeks) + +### Week 1 +1. **Fix #923** - MCP protocol migration (CRITICAL) +2. **Fix #851/#798** - Database name configuration +3. **Fix #875** - Duplicate entity detection +4. **Verify #920** - Datetime issues resolved by recent commit + +### Week 2 +5. **Fix #882/#879/#871** - Bulk upload pipeline +6. **Fix #810** - Group ID handling in search +7. **Fix #772/#789** - BFS search issues +8. **Close duplicates** - Process 6+ confirmed duplicates + +### Ongoing +- **Investigate duplicate clusters** - Consolidate related issues +- **Document workarounds** - For issues that can't be fixed immediately +- **Provider compatibility matrix** - Document which LLMs work fully + +--- + +## Impact Summary + +**Critical Issues Blocking Production Use**: +- Database name configuration (multi-DB deployments) +- Duplicate entity detection (core functionality) +- Bulk upload failures (data ingestion at scale) +- MCP protocol deprecation (future compatibility) + +**High-Impact Bugs Affecting Many Users**: +- Search group_id handling +- BFS traversal issues +- Datetime comparison problems +- Various MCP server configuration bugs + +**Lower Impact**: +- Specific LLM provider incompatibilities (workarounds available) +- FalkorDB issues (alternative: Neo4j) +- Feature requests and enhancements + +--- + +## Component Labels to Apply + +Use these labels when triaging the issues above: + +- `component:core` - Core library +- `component:mcp` - MCP server +- `component:database` - Database drivers +- `component:search` - Search functionality +- `component:bulk` - Bulk operations +- `component:server` - FastAPI server +- `component:llm` - LLM providers +- `component:docs` - Documentation