Add comprehensive TypeScript migration documentation for LightRAG (#1)

* Initial plan * Initialize reverse documentation directory and create analysis plan Co-authored-by: raphaelmansuy <1003084+raphaelmansuy@users.noreply.github.com> * Add executive summary and comprehensive architecture documentation Co-authored-by: raphaelmansuy <1003084+raphaelmansuy@users.noreply.github.com> * Add comprehensive data models and dependency migration documentation Co-authored-by: raphaelmansuy <1003084+raphaelmansuy@users.noreply.github.com> * Complete comprehensive TypeScript migration documentation suite Co-authored-by: raphaelmansuy <1003084+raphaelmansuy@users.noreply.github.com> --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: raphaelmansuy <1003084+raphaelmansuy@users.noreply.github.com>
2025-10-01 12:00:26 +08:00 · 2025-10-01 12:00:26 +08:00 · c1b935a0b9
commit c1b935a0b9
parent 19073319c1
8 changed files with 5170 additions and 0 deletions
--- a/reverse_documentation/00-README.md
+++ b/reverse_documentation/00-README.md
@ -0,0 +1,290 @@
+# LightRAG TypeScript Migration Documentation
+
+## Overview
+
+This documentation suite provides comprehensive technical analysis and migration guidance for reimplementing LightRAG from Python to TypeScript/Node.js. The documentation is designed for senior developers and architects who need to build a production-ready TypeScript version of LightRAG while maintaining functional parity with the original implementation.
+
+## Documentation Structure
+
+### 1. [Executive Summary](./01-executive-summary.md) (16KB)
+High-level overview of the LightRAG system, its capabilities, architecture, and key migration challenges.
+
+**Contents:**
+- System overview and core capabilities
+- Architecture at a glance with Mermaid diagram
+- Key technical characteristics
+- Migration challenges and recommended solutions
+- TypeScript technology stack recommendations
+- Success metrics and next steps
+
+**Target Audience:** Decision makers, project managers, senior architects
+
+### 2. [Architecture Documentation](./02-architecture-documentation.md) (33KB)
+Detailed system architecture with comprehensive diagrams showing component interactions, data flows, and design patterns.
+
+**Contents:**
+- 5-layer system architecture
+- Component interaction patterns
+- Document indexing data flow (with sequence diagram)
+- Query processing data flow (with sequence diagram)
+- Storage layer architecture
+- Concurrency and state management patterns
+- TypeScript migration considerations for each pattern
+
+**Key Diagrams:**
+- System architecture (5 layers)
+- Component interactions
+- Indexing sequence diagram
+- Query sequence diagram
+- Storage layer structure
+- Concurrency control patterns
+
+**Target Audience:** System architects, technical leads
+
+### 3. [Data Models and Schemas](./03-data-models-and-schemas.md) (27KB)
+Complete type system documentation with Python and TypeScript definitions side-by-side.
+
+**Contents:**
+- Core data models (TextChunk, Entity, Relationship, DocStatus)
+- Storage schema definitions (KV, Vector, Graph, DocStatus)
+- Query and response models
+- Configuration models
+- Python to TypeScript type mapping guide
+- Validation and serialization strategies
+
+**Key Features:**
+- Every data structure documented with field descriptions
+- TypeScript type definitions provided
+- Validation rules and constraints
+- Storage patterns explained
+- Runtime validation examples with Zod
+
+**Target Audience:** Developers, data engineers
+
+### 4. [Dependency Migration Guide](./04-dependency-migration-guide.md) (27KB)
+Comprehensive mapping of Python packages to Node.js/npm equivalents with complexity assessment.
+
+**Contents:**
+- Core dependencies mapping (40+ packages)
+- Storage driver equivalents (PostgreSQL, MongoDB, Redis, Neo4j, etc.)
+- LLM and embedding provider equivalents (OpenAI, Anthropic, Ollama)
+- API and web framework alternatives (FastAPI → Fastify)
+- Utility library equivalents
+- Async/await pattern differences
+- Migration complexity assessment (Low/Medium/High)
+- Version compatibility matrix
+
+**Key Tables:**
+- Python package → npm package mapping
+- Migration complexity per category
+- Recommended versions with notes
+- Code comparison examples
+
+**Target Audience:** Developers, technical leads
+
+### 5. [TypeScript Project Structure and Migration Roadmap](./05-typescript-project-structure-and-roadmap.md) (29KB)
+Complete project organization, configuration, and phase-by-phase implementation plan.
+
+**Contents:**
+- Recommended directory structure
+- Module organization patterns
+- Complete configuration files (package.json, tsconfig.json, etc.)
+- Build and development workflow
+- Testing strategy (unit, integration, E2E)
+- 14-week phase-by-phase migration roadmap
+- CI/CD pipeline configuration
+- Docker and deployment setup
+
+**Key Sections:**
+- Detailed file structure with example code
+- Configuration for all build tools
+- Testing examples with Vitest
+- Phase-by-phase roadmap with deliverables
+- Docker and Kubernetes configuration
+- GitHub Actions CI/CD
+
+**Target Audience:** Developers, DevOps engineers
+
+## Quick Start Guide
+
+### For Decision Makers
+1. Read [Executive Summary](./01-executive-summary.md) for high-level overview
+2. Review migration challenges and technology stack recommendations
+3. Check estimated timeline in [Migration Roadmap](./05-typescript-project-structure-and-roadmap.md#phase-by-phase-migration-roadmap)
+
+### For Architects
+1. Read [Architecture Documentation](./02-architecture-documentation.md) to understand system design
+2. Study component interaction patterns and data flows
+3. Review [Data Models](./03-data-models-and-schemas.md) for data architecture
+4. Check [TypeScript Project Structure](./05-typescript-project-structure-and-roadmap.md) for implementation approach
+
+### For Developers
+1. Start with [Data Models and Schemas](./03-data-models-and-schemas.md) to understand types
+2. Review [Dependency Migration Guide](./04-dependency-migration-guide.md) for library equivalents
+3. Study [Project Structure](./05-typescript-project-structure-and-roadmap.md) for code organization
+4. Follow phase-by-phase roadmap for implementation sequence
+
+## Key Insights
+
+### System Architecture
+- **5-layer architecture**: Presentation → API Gateway → Business Logic → Integration → Infrastructure
+- **4 storage types**: KV (cache/chunks), Vector (embeddings), Graph (entities/relations), DocStatus (pipeline state)
+- **6 query modes**: local, global, hybrid, mix, naive, bypass
+- **Async-first design**: Semaphore-based rate limiting, keyed locks, task queues
+
+### Migration Feasibility
+- **Overall Complexity**: Medium (12-14 weeks with small team)
+- **High-Risk Areas**: Vector search (FAISS alternatives), NetworkX (use graphology)
+- **Low-Risk Areas**: PostgreSQL, MongoDB, Redis, Neo4j, OpenAI, API layer
+- **Recommended Stack**: Node.js 20 LTS, TypeScript 5.3+, Fastify, Zod, pnpm
+
+### Technology Choices
+
+**Storage**:
+- PostgreSQL: `pg` + optional `drizzle-orm` for type safety
+- MongoDB: Official `mongodb` driver
+- Redis: `ioredis` for best TypeScript support
+- Neo4j: Official `neo4j-driver`
+- Graph: `graphology` (NetworkX equivalent)
+- Vector: Qdrant, Milvus, or PostgreSQL with pgvector
+
+**LLM Integration**:
+- OpenAI: Official `openai` SDK (v4+)
+- Anthropic: `@anthropic-ai/sdk`
+- Ollama: Official `ollama` package
+- Tokenization: `@dqbd/tiktoken` (WASM port)
+
+**Web Framework**:
+- API: `fastify` (FastAPI equivalent)
+- Validation: `zod` (Pydantic equivalent)
+- Authentication: `@fastify/jwt`
+- Documentation: `@fastify/swagger`
+
+**Utilities**:
+- Async control: `p-limit`, `p-queue`, `p-retry`
+- Logging: `pino` (fast, structured)
+- Testing: `vitest` (fast, TypeScript-native)
+- Build: `tsup` (fast bundler)
+
+## Implementation Roadmap Summary
+
+### Phase 1-2: Foundation & Storage (Weeks 1-5)
+- Set up project structure and tooling
+- Implement storage abstractions and PostgreSQL reference implementation
+- Add alternative storage backends (MongoDB, Redis, File-based)
+- **Deliverable**: Working storage layer with tests
+
+### Phase 3-4: LLM & Core Engine (Weeks 6-8)
+- Integrate LLM providers (OpenAI, Anthropic, Ollama)
+- Implement document processing pipeline (chunking, extraction, merging)
+- Add vector embedding and indexing
+- **Deliverable**: Complete document ingestion pipeline
+
+### Phase 5: Query Engine (Weeks 9-10)
+- Implement all 6 query modes
+- Add token budget management
+- Integrate reranking
+- **Deliverable**: Complete query functionality
+
+### Phase 6: API Layer (Week 11)
+- Build REST API with Fastify
+- Add authentication and authorization
+- Implement streaming responses
+- **Deliverable**: Production API
+
+### Phase 7-8: Testing & Production (Weeks 12-14)
+- Comprehensive testing (unit, integration, E2E)
+- Performance optimization
+- Production hardening (monitoring, logging, deployment)
+- **Deliverable**: Production-ready system
+
+## Documentation Statistics
+
+- **Total Documentation**: ~140KB across 5 major documents
+- **Mermaid Diagrams**: 6 comprehensive architecture diagrams
+- **Code Examples**: 100+ Python/TypeScript comparison snippets
+- **Dependency Mapping**: 40+ Python packages → npm equivalents
+- **Type Definitions**: Complete TypeScript types for all data structures
+- **Configuration Files**: 10+ complete config examples
+
+## Success Criteria
+
+A successful TypeScript migration achieves:
+
+1. **Functional Parity**: All query modes, storage backends, and LLM providers working identically
+2. **API Compatibility**: Existing WebUI works without modification
+3. **Performance**: Comparable or better throughput and latency
+4. **Type Safety**: Full TypeScript coverage, minimal use of `any`
+5. **Test Coverage**: >80% code coverage with comprehensive tests
+6. **Production Ready**: Handles errors gracefully, provides observability, scales horizontally
+7. **Documentation**: Complete API docs, deployment guides, migration notes
+
+## Additional Resources
+
+### Python Repository
+- **URL**: https://github.com/HKUDS/LightRAG
+- **Paper**: EMNLP 2025 - "LightRAG: Simple and Fast Retrieval-Augmented Generation"
+- **License**: MIT
+
+### Related Documentation
+- **README.md**: Repository root documentation
+- **README-zh.md**: Chinese version of documentation
+- **API Documentation**: `lightrag/api/README.md`
+- **Examples**: `examples/` directory with usage samples
+
+### Existing TypeScript Reference
+The repository already includes a TypeScript WebUI (`lightrag_webui/`) that provides:
+- TypeScript type definitions for API responses
+- API client implementation patterns
+- Data model usage examples
+- Component patterns that can be referenced
+
+## Getting Help
+
+### Common Questions
+
+**Q: Can I use alternative storage backends not mentioned?**  
+A: Yes, as long as they implement the storage interfaces. The documentation provides patterns for implementing custom storage backends.
+
+**Q: Do I need to implement all query modes?**  
+A: For a complete migration, yes. However, you can prioritize modes based on your use case (mix and naive are most commonly used).
+
+**Q: Can I use a different web framework than Fastify?**  
+A: Yes, Express is a viable alternative. Fastify is recommended for performance and TypeScript support, but the core logic is framework-agnostic.
+
+**Q: How do I handle FAISS migration?**  
+A: Use Qdrant or Milvus for production, or hnswlib-node for a local alternative. See the dependency guide for detailed comparison.
+
+**Q: Is the 14-week timeline realistic?**  
+A: Yes, with a team of 2-3 experienced TypeScript developers working full-time. Adjust based on your team size and experience.
+
+### Contact
+
+For questions about this documentation or the migration:
+- Open an issue in the LightRAG repository
+- Refer to the original paper for algorithm details
+- Check existing examples in the `examples/` directory
+
+## Maintenance and Updates
+
+This documentation reflects the LightRAG codebase as of the repository snapshot date. Key version references:
+- **Python Version**: 3.10+
+- **Node.js Version**: 20 LTS recommended (18+ minimum)
+- **TypeScript Version**: 5.3+ recommended
+
+When updating this documentation:
+1. Keep Python and TypeScript examples in sync
+2. Update version numbers in dependency tables
+3. Validate code examples against latest library versions
+4. Update Mermaid diagrams if architecture changes
+5. Maintain consistent styling and formatting
+
+## License
+
+This documentation inherits the MIT license from the LightRAG project. Feel free to use, modify, and distribute as needed for your TypeScript implementation.
+
+---
+
+**Last Updated**: 2024  
+**Documentation Version**: 1.0  
+**Target LightRAG Version**: Latest (main branch)
--- a/reverse_documentation/01-executive-summary.md
+++ b/reverse_documentation/01-executive-summary.md
@ -0,0 +1,271 @@
+# Executive Summary: LightRAG TypeScript Migration
+
+## Overview
+
+LightRAG (Light Retrieval-Augmented Generation) is a sophisticated, graph-based RAG system implemented in Python that combines knowledge graph construction with vector retrieval to deliver contextually rich question-answering capabilities. This document provides comprehensive technical analysis to enable a production-ready TypeScript/Node.js reimplementation.
+
+**Repository**: [HKUDS/LightRAG](https://github.com/HKUDS/LightRAG)  
+**Paper**: EMNLP 2025 - "LightRAG: Simple and Fast Retrieval-Augmented Generation"  
+**Current Implementation**: Python 3.10+, ~58 Python files, ~500KB of core code  
+**License**: MIT
+
+## System Capabilities
+
+LightRAG delivers a comprehensive RAG solution with the following core capabilities:
+
+### Document Processing Pipeline
+The system implements a multi-stage document processing pipeline that transforms raw documents into a queryable knowledge graph. Documents are ingested, split into semantic chunks, processed through LLM-based entity and relationship extraction, and merged into a unified knowledge graph with vector embeddings for retrieval. The pipeline supports multiple file formats (PDF, DOCX, PPTX, CSV, TXT) and handles batch processing with status tracking and error recovery.
+
+### Knowledge Graph Construction
+At the heart of LightRAG is an automated knowledge graph construction system that extracts entities and relationships from text using large language models. The system identifies entity types (Person, Organization, Location, Event, Concept, etc.), establishes relationships between entities, and maintains entity descriptions and relationship metadata. Graph merging algorithms consolidate duplicate entities and relationships, while maintaining source attribution and citation tracking.
+
+### Multi-Modal Retrieval Strategies
+LightRAG implements six distinct query modes to balance between specificity and coverage:
+- **Local Mode**: Focuses on entity-centric retrieval using low-level keywords to find specific, context-dependent information
+- **Global Mode**: Emphasizes relationship-centric retrieval using high-level keywords for broader, interconnected insights
+- **Hybrid Mode**: Combines local and global results using round-robin merging for balanced coverage
+- **Mix Mode**: Integrates knowledge graph data with vector-retrieved document chunks for comprehensive context
+- **Naive Mode**: Pure vector retrieval without knowledge graph integration for simple similarity search
+- **Bypass Mode**: Direct LLM query without retrieval for general questions
+
+### Flexible Storage Architecture
+The system supports multiple storage backends through a pluggable architecture:
+- **Key-Value Storage**: JSON files, PostgreSQL, MongoDB, Redis (for LLM cache, chunks, and documents)
+- **Vector Storage**: NanoVectorDB, FAISS, Milvus, Qdrant, PostgreSQL with pgvector (for embeddings)
+- **Graph Storage**: NetworkX, Neo4j, Memgraph, PostgreSQL (for entity-relationship graphs)
+- **Document Status Storage**: JSON files, PostgreSQL, MongoDB (for pipeline tracking)
+
+### Production Features
+The system includes enterprise-ready features for production deployment:
+- RESTful API with FastAPI and OpenAPI documentation
+- WebUI for document management, graph visualization, and querying
+- Authentication and authorization (JWT-based)
+- Streaming responses for real-time user feedback
+- Ollama-compatible API for integration with AI chatbots
+- Workspace isolation for multi-tenant deployments
+- Pipeline status tracking and monitoring
+- Configurable rate limiting and concurrency control
+- Error handling and retry mechanisms
+- Citation and source attribution support
+
+## Architecture at a Glance
+
+```mermaid
+graph TB
+    subgraph "Client Layer"
+        WebUI["WebUI<br/>(TypeScript/React)"]
+        API["REST API Client"]
+    end
+    
+    subgraph "API Layer"
+        FastAPI["FastAPI Server<br/>(lightrag_server.py)"]
+        Routes["Route Handlers<br/>Query/Document/Graph"]
+        Auth["Authentication<br/>(JWT)"]
+    end
+    
+    subgraph "Core LightRAG Engine"
+        LightRAG["LightRAG Class<br/>(lightrag.py)"]
+        Operations["Operations Layer<br/>(operate.py)"]
+        Utils["Utilities<br/>(utils.py)"]
+    end
+    
+    subgraph "Processing Pipeline"
+        Chunking["Text Chunking<br/>(Token-based)"]
+        Extraction["Entity Extraction<br/>(LLM-based)"]
+        Merging["Graph Merging<br/>(Deduplication)"]
+        Indexing["Vector Indexing<br/>(Embeddings)"]
+    end
+    
+    subgraph "Query Pipeline"
+        KeywordExtract["Keyword Extraction<br/>(High/Low Level)"]
+        GraphRetrieval["Graph Retrieval<br/>(Entities/Relations)"]
+        VectorRetrieval["Vector Retrieval<br/>(Chunks)"]
+        ContextBuild["Context Building<br/>(Token Budget)"]
+        LLMGen["LLM Generation<br/>(Response)"]
+    end
+    
+    subgraph "LLM Integration"
+        LLMProvider["LLM Provider<br/>(OpenAI, Ollama, etc.)"]
+        EmbedProvider["Embedding Provider<br/>(text-embedding-3-small)"]
+    end
+    
+    subgraph "Storage Layer"
+        KVStorage["KV Storage<br/>(Cache/Chunks/Docs)"]
+        VectorStorage["Vector Storage<br/>(Embeddings)"]
+        GraphStorage["Graph Storage<br/>(Entities/Relations)"]
+        DocStatus["Doc Status Storage<br/>(Pipeline State)"]
+    end
+    
+    WebUI --> FastAPI
+    API --> FastAPI
+    FastAPI --> Routes
+    Routes --> Auth
+    Routes --> LightRAG
+    
+    LightRAG --> Operations
+    LightRAG --> Utils
+    
+    LightRAG --> Chunking
+    Chunking --> Extraction
+    Extraction --> Merging
+    Merging --> Indexing
+    
+    LightRAG --> KeywordExtract
+    KeywordExtract --> GraphRetrieval
+    KeywordExtract --> VectorRetrieval
+    GraphRetrieval --> ContextBuild
+    VectorRetrieval --> ContextBuild
+    ContextBuild --> LLMGen
+    
+    Operations --> LLMProvider
+    Operations --> EmbedProvider
+    
+    LightRAG --> KVStorage
+    LightRAG --> VectorStorage
+    LightRAG --> GraphStorage
+    LightRAG --> DocStatus
+    
+    style WebUI fill:#E6F3FF
+    style FastAPI fill:#FFE6E6
+    style LightRAG fill:#E6FFE6
+    style LLMProvider fill:#FFF5E6
+    style KVStorage fill:#FFE6E6
+    style VectorStorage fill:#FFE6E6
+    style GraphStorage fill:#FFE6E6
+    style DocStatus fill:#FFE6E6
+```
+
+## Key Technical Characteristics
+
+### Async-First Architecture
+The entire system is built on Python's asyncio, with extensive use of async/await patterns, semaphores for rate limiting, and task queues for concurrent processing. This design enables efficient handling of I/O-bound operations and supports high concurrency for embedding generation and LLM calls.
+
+### Storage Abstraction Pattern
+LightRAG implements a clean abstraction layer over storage backends through base classes (BaseKVStorage, BaseVectorStorage, BaseGraphStorage, DocStatusStorage). This pattern enables seamless switching between different storage implementations without modifying core logic, supporting everything from in-memory JSON files to enterprise databases like PostgreSQL and Neo4j.
+
+### Pipeline-Based Processing
+Document ingestion follows a multi-stage pipeline pattern: enqueue → validate → chunk → extract → merge → index. Each stage is idempotent and resumable, with comprehensive status tracking enabling fault tolerance and progress monitoring. Documents flow through the pipeline with track IDs for monitoring and debugging.
+
+### Token Budget Management
+The system implements sophisticated token budget management for query contexts, allocating tokens across entities, relationships, and chunks while respecting LLM context window limits. This unified token control system ensures optimal use of available context space and prevents token overflow errors.
+
+### Modular LLM Integration
+LLM and embedding providers are abstracted behind function interfaces, supporting multiple providers (OpenAI, Ollama, Anthropic, AWS Bedrock, Azure OpenAI, Hugging Face, and more) with consistent error handling, retry logic, and rate limiting across all providers.
+
+## Key Migration Challenges
+
+### Challenge 1: Monolithic File Structure
+**Issue**: Core logic is concentrated in large files (lightrag.py: 141KB, operate.py: 164KB, utils.py: 106KB) with high cyclomatic complexity.  
+**Impact**: Direct translation would create unmaintainable TypeScript code.  
+**Strategy**: Refactor into smaller, focused modules following single responsibility principle. Break down large classes into composition patterns. Leverage TypeScript's module system for better organization.
+
+### Challenge 2: Python-Specific Language Features
+**Issue**: Heavy use of Python dataclasses, decorators (@final, @dataclass), type hints (TypedDict, Literal, overload), and metaprogramming patterns.  
+**Impact**: These features don't have direct TypeScript equivalents.  
+**Strategy**: Use TypeScript classes with decorators from libraries like class-validator and class-transformer. Leverage TypeScript's type system for Literal types and union types. Replace overload decorators with function overloading syntax.
+
+### Challenge 3: Async/Await Pattern Differences
+**Issue**: Python's asyncio model differs from Node.js event loop, particularly in semaphore usage, task cancellation, and exception handling in concurrent operations.  
+**Impact**: Concurrency patterns require redesign for Node.js runtime.  
+**Strategy**: Use p-limit for semaphore-like behavior, AbortController for cancellation, and Promise.allSettled for concurrent operations with individual error handling. Leverage async iterators for streaming.
+
+### Challenge 4: Storage Driver Ecosystem
+**Issue**: Python has mature drivers for PostgreSQL (asyncpg), MongoDB (motor), Neo4j (neo4j-driver), Redis (redis-py), while Node.js alternatives have different APIs and capabilities.  
+**Impact**: Storage layer requires careful driver selection and adapter implementation.  
+**Strategy**: Use node-postgres for PostgreSQL, mongodb driver for MongoDB, neo4j-driver-lite for Neo4j, ioredis for Redis. Create consistent adapter layer to abstract driver differences.
+
+### Challenge 5: Embedding and Tokenization Libraries
+**Issue**: tiktoken (OpenAI's tokenizer) and sentence-transformers have limited or no Node.js support.  
+**Impact**: Need alternative approaches for tokenization and local embeddings.  
+**Strategy**: Use @dqbd/tiktoken (WASM port) for tokenization, or js-tiktoken as alternative. For embeddings, use OpenAI API, Hugging Face Inference API, or ONNX Runtime for local model inference.
+
+### Challenge 6: Complex State Management
+**Issue**: Python uses global dictionaries and namespace-based state management (global_config, pipeline_status, keyed locks) with multiprocessing considerations.  
+**Impact**: State management in Node.js requires different patterns.  
+**Strategy**: Use class-based state management with dependency injection. Implement singleton pattern for shared state. Use Redis or similar for distributed state in multi-process deployments.
+
+## Recommended TypeScript Technology Stack
+
+### Runtime and Core
+- **Runtime**: Node.js 20 LTS (for latest async features and stability)
+- **Language**: TypeScript 5.3+ (for latest type system features)
+- **Build Tool**: esbuild or swc (for fast builds)
+- **Package Manager**: pnpm (for efficient dependency management)
+
+### Web Framework
+- **API Framework**: Fastify or Express with TypeScript
+- **Validation**: Zod or class-validator
+- **OpenAPI**: @fastify/swagger or tsoa
+
+### Storage Drivers
+- **PostgreSQL**: pg with @types/pg, or Drizzle ORM for type-safe queries
+- **MongoDB**: mongodb driver with TypeScript support
+- **Neo4j**: neo4j-driver with TypeScript bindings
+- **Redis**: ioredis (best TypeScript support)
+- **Vector**: @pinecone-database/pinecone, qdrant-client, or pg with pgvector
+
+### LLM and Embeddings
+- **OpenAI**: openai (official SDK)
+- **Anthropic**: @anthropic-ai/sdk
+- **Generic LLM**: langchain or custom adapters
+- **Tokenization**: @dqbd/tiktoken or js-tiktoken
+- **Embeddings**: OpenAI API, or @xenova/transformers for local
+
+### Utilities
+- **Async Control**: p-limit, p-queue, bottleneck
+- **Logging**: pino or winston
+- **Configuration**: dotenv, convict
+- **Testing**: vitest (fast, TypeScript-native)
+- **Hashing**: crypto (built-in), or js-md5
+- **JSON Repair**: json-repair-ts
+
+## Migration Approach Recommendation
+
+### Phase 1: Core Abstractions (Weeks 1-2)
+Establish foundational abstractions: storage interfaces, base classes, type definitions, and configuration management. This creates the contract layer that all other components will depend on. Implement basic in-memory storage to enable early testing.
+
+### Phase 2: Storage Layer (Weeks 3-5)
+Implement storage adapters for primary backends (PostgreSQL, NetworkX-equivalent using graphology, NanoVectorDB-equivalent). Focus on KV and Vector storage first, then Graph storage, finally Doc Status storage. Each storage type should pass identical test suites regardless of backend.
+
+### Phase 3: LLM Integration (Weeks 4-6, parallel)
+Build LLM and embedding provider adapters, starting with OpenAI as reference implementation. Implement retry logic, rate limiting, and error handling. Create abstract interfaces that other providers can implement. Add streaming support for responses.
+
+### Phase 4: Core Engine (Weeks 6-8)
+Implement the LightRAG core engine: chunking, entity extraction, graph merging, and indexing pipeline. This requires integrating storage, LLM, and utility layers. Focus on making the pipeline idempotent and resumable with comprehensive state tracking.
+
+### Phase 5: Query Pipeline (Weeks 8-10)
+Build the query engine with all six retrieval modes. Implement keyword extraction, graph retrieval, vector retrieval, context building with token budgets, and response generation. Add support for conversation history and streaming responses.
+
+### Phase 6: API Layer (Weeks 10-11)
+Develop RESTful API with Fastify or Express, implementing all endpoints from the Python version. Add authentication, authorization, request validation, and OpenAPI documentation. Ensure API compatibility with existing WebUI.
+
+### Phase 7: Testing and Optimization (Weeks 11-13)
+Comprehensive testing including unit tests, integration tests, and end-to-end tests. Performance testing and optimization, particularly for concurrent operations. Load testing for production readiness. Documentation updates.
+
+### Phase 8: Production Hardening (Weeks 13-14)
+Add monitoring, logging, error tracking, health checks, and deployment configurations. Implement graceful shutdown, connection pooling, and resource cleanup. Create Docker images and Kubernetes configurations.
+
+## Success Metrics
+
+A successful TypeScript implementation should achieve:
+
+1. **Functional Parity**: All query modes, storage backends, and LLM providers working identically to Python version
+2. **API Compatibility**: Existing WebUI works without modification against TypeScript API
+3. **Performance**: Comparable or better throughput and latency for document ingestion and query operations
+4. **Type Safety**: Full TypeScript type coverage with no 'any' types in core logic
+5. **Test Coverage**: >80% code coverage with unit and integration tests
+6. **Production Ready**: Handles errors gracefully, provides observability, scales horizontally
+7. **Documentation**: Complete API documentation, deployment guides, and migration notes
+
+## Next Steps
+
+This executive summary provides the foundation for detailed technical documentation. The following documents dive deeper into:
+
+- **Architecture Documentation**: Detailed system design with comprehensive diagrams
+- **Data Models and Schemas**: Complete type definitions for all data structures
+- **Storage Layer Specification**: In-depth analysis of each storage implementation
+- **LLM Integration Guide**: Provider-specific integration patterns
+- **API Reference**: Complete endpoint documentation with TypeScript types
+- **Implementation Roadmap**: Detailed phase-by-phase migration guide
+
+Each subsequent document builds on this foundation, providing the specific technical details needed to implement a production-ready TypeScript version of LightRAG.
--- a/reverse_documentation/02-architecture-documentation.md
+++ b/reverse_documentation/02-architecture-documentation.md
@ -0,0 +1,568 @@
+# Architecture Documentation: LightRAG System Design
+
+## Table of Contents
+1. [System Architecture Overview](#system-architecture-overview)
+2. [Component Interaction Architecture](#component-interaction-architecture)
+3. [Document Indexing Data Flow](#document-indexing-data-flow)
+4. [Query Processing Data Flow](#query-processing-data-flow)
+5. [Storage Layer Architecture](#storage-layer-architecture)
+6. [Concurrency and State Management](#concurrency-and-state-management)
+
+## System Architecture Overview
+
+LightRAG follows a layered architecture pattern with clear separation of concerns. The system is structured into five primary layers, each with specific responsibilities and well-defined interfaces.
+
+```mermaid
+graph TB
+    subgraph "Presentation Layer"
+        A1["WebUI (TypeScript)<br/>React + Vite"]
+        A2["API Clients<br/>REST/OpenAPI"]
+    end
+    
+    subgraph "API Gateway Layer"
+        B1["FastAPI Server<br/>lightrag_server.py"]
+        B2["Authentication<br/>JWT + API Keys"]
+        B3["Request Validation<br/>Pydantic Models"]
+        B4["Route Handlers<br/>Query/Document/Graph"]
+    end
+    
+    subgraph "Business Logic Layer"
+        C1["LightRAG Core<br/>lightrag.py"]
+        C2["Operations Module<br/>operate.py"]
+        C3["Utilities<br/>utils.py + utils_graph.py"]
+        C4["Prompt Templates<br/>prompt.py"]
+    end
+    
+    subgraph "Integration Layer"
+        D1["LLM Providers<br/>OpenAI, Ollama, etc."]
+        D2["Embedding Providers<br/>text-embedding-*"]
+        D3["Storage Adapters<br/>KV/Vector/Graph/Status"]
+    end
+    
+    subgraph "Infrastructure Layer"
+        E1["PostgreSQL<br/>Relational + Vector"]
+        E2["Neo4j/Memgraph<br/>Graph Database"]
+        E3["Redis/MongoDB<br/>Cache + NoSQL"]
+        E4["File System<br/>JSON + FAISS"]
+    end
+    
+    A1 --> B1
+    A2 --> B1
+    B1 --> B2
+    B2 --> B3
+    B3 --> B4
+    B4 --> C1
+    C1 --> C2
+    C1 --> C3
+    C2 --> C4
+    C1 --> D1
+    C1 --> D2
+    C1 --> D3
+    D3 --> E1
+    D3 --> E2
+    D3 --> E3
+    D3 --> E4
+    
+    style A1 fill:#E6F3FF
+    style B1 fill:#FFE6E6
+    style C1 fill:#E6FFE6
+    style D1 fill:#FFF5E6
+    style E1 fill:#FFE6F5
+```
+
+### Layer Responsibilities
+
+**Presentation Layer**: Handles user interactions through a React-based WebUI and provides REST API client capabilities. Responsible for rendering data, handling user input, and managing client-side state. Written in TypeScript with React, this layer communicates with the API Gateway exclusively through HTTP/HTTPS.
+
+**API Gateway Layer**: Manages all external communication with the system. Implements authentication and authorization using JWT tokens and API keys, validates incoming requests using Pydantic models, handles rate limiting, and routes requests to appropriate handlers. Built with FastAPI, it provides automatic OpenAPI documentation and request/response validation.
+
+**Business Logic Layer**: Contains the core intelligence of LightRAG. The LightRAG class orchestrates all operations, managing document processing pipelines, query execution, and storage coordination. The Operations module handles entity extraction, graph merging, and retrieval algorithms. Utilities provide helper functions for text processing, tokenization, hashing, and caching. Prompt templates define structured prompts for LLM interactions.
+
+**Integration Layer**: Abstracts external dependencies through consistent interfaces. LLM provider adapters normalize different API formats (OpenAI, Anthropic, Ollama, etc.) into a common interface. Embedding provider adapters handle various embedding services. Storage adapters implement the abstract storage interfaces (BaseKVStorage, BaseVectorStorage, BaseGraphStorage, DocStatusStorage) for different backends.
+
+**Infrastructure Layer**: Provides the foundational data persistence and retrieval capabilities. Supports multiple database systems including PostgreSQL (with pgvector for vector storage), Neo4j and Memgraph (for graph storage), Redis and MongoDB (for caching and document storage), and file-based storage (JSON files, FAISS indexes) for development and small deployments.
+
+## Component Interaction Architecture
+
+This diagram illustrates how major components interact during typical operations, showing both document indexing and query execution flows.
+
+```mermaid
+graph LR
+    subgraph "External Systems"
+        LLM["LLM Service<br/>(OpenAI/Ollama)"]
+        Embed["Embedding Service"]
+    end
+    
+    subgraph "Core Components"
+        RAG["LightRAG<br/>Core Engine"]
+        OPS["Operations<br/>Module"]
+        PIPE["Pipeline<br/>Manager"]
+    end
+    
+    subgraph "Storage System"
+        KV["KV Storage<br/>Chunks/Cache"]
+        VEC["Vector Storage<br/>Embeddings"]
+        GRAPH["Graph Storage<br/>Entities/Relations"]
+        STATUS["Status Storage<br/>Pipeline State"]
+    end
+    
+    subgraph "Processing Components"
+        CHUNK["Chunking<br/>Engine"]
+        EXTRACT["Entity<br/>Extraction"]
+        MERGE["Graph<br/>Merging"]
+        QUERY["Query<br/>Engine"]
+    end
+    
+    RAG --> PIPE
+    PIPE --> CHUNK
+    CHUNK --> EXTRACT
+    EXTRACT --> OPS
+    OPS --> LLM
+    OPS --> Embed
+    OPS --> MERGE
+    MERGE --> VEC
+    MERGE --> GRAPH
+    CHUNK --> KV
+    PIPE --> STATUS
+    
+    RAG --> QUERY
+    QUERY --> OPS
+    QUERY --> VEC
+    QUERY --> GRAPH
+    QUERY --> KV
+    QUERY --> LLM
+    
+    style RAG fill:#E6FFE6
+    style LLM fill:#FFF5E6
+    style KV fill:#FFE6E6
+    style VEC fill:#FFE6E6
+    style GRAPH fill:#FFE6E6
+    style STATUS fill:#FFE6E6
+```
+
+### Component Interaction Patterns
+
+**Document Ingestion Pattern**: Client submits documents to the API Gateway, which authenticates and validates the request before passing it to the LightRAG core. The core initializes a pipeline instance with a unique track ID, stores the document in KV storage, and updates its status in Status storage. The Pipeline Manager coordinates the chunking, extraction, merging, and indexing stages, maintaining progress information throughout.
+
+**Entity Extraction Pattern**: The Operations module receives text chunks from the Pipeline Manager and constructs prompts using templates from prompt.py. These prompts are sent to the configured LLM service, which returns structured entity and relationship data. The Operations module parses the response, normalizes entity names, and prepares data for graph merging.
+
+**Graph Merging Pattern**: When new entities and relationships are extracted, the Merge component compares them against existing graph data. For matching entities (based on name similarity), it consolidates descriptions, merges metadata, and updates source references. For relationships, it deduplicates based on source-target pairs and aggregates weights. The merged data is then stored in Graph storage and vector representations are computed and stored in Vector storage.
+
+**Query Execution Pattern**: The Query Engine receives a user query and determines the appropriate retrieval strategy based on the query mode. It extracts high-level and low-level keywords using the LLM, retrieves relevant entities and relationships from Graph storage, fetches related chunks from Vector storage, builds a context respecting token budgets, and finally generates a response using the LLM with the assembled context.
+
+## Document Indexing Data Flow
+
+This sequence diagram details the complete flow of document processing from ingestion to indexing.
+
+```mermaid
+sequenceDiagram
+    participant Client
+    participant API as API Server
+    participant Core as LightRAG Core
+    participant Pipeline
+    participant Chunking
+    participant Extraction
+    participant Merging
+    participant KV as KV Storage
+    participant Vec as Vector Storage
+    participant Graph as Graph Storage
+    participant Status as Status Storage
+    participant LLM as LLM Service
+    participant Embed as Embedding Service
+    
+    Client->>API: POST /documents/upload
+    API->>API: Authenticate & Validate
+    API->>Core: ainsert(document, file_path)
+    
+    Note over Core: Initialization Phase
+    Core->>Core: Generate track_id
+    Core->>Status: Create doc status (PENDING)
+    Core->>KV: Store document content
+    Core->>Pipeline: apipeline_process_enqueue_documents()
+    
+    Note over Pipeline: Chunking Phase
+    Pipeline->>Status: Update status (CHUNKING)
+    Pipeline->>Chunking: chunking_by_token_size()
+    Chunking->>Chunking: Tokenize & split by overlap
+    Chunking-->>Pipeline: chunks[]
+    Pipeline->>KV: Store chunks with metadata
+    
+    Note over Pipeline: Extraction Phase
+    Pipeline->>Status: Update status (EXTRACTING)
+    loop For each chunk
+        Pipeline->>Extraction: extract_entities(chunk)
+        Extraction->>LLM: Generate entities/relations
+        LLM-->>Extraction: Structured output
+        Extraction->>Extraction: Parse & normalize
+        Extraction-->>Pipeline: entities[], relations[]
+    end
+    
+    Note over Pipeline: Merging Phase
+    Pipeline->>Status: Update status (MERGING)
+    Pipeline->>Merging: merge_nodes_and_edges()
+    
+    par Parallel Entity Processing
+        loop For each entity
+            Merging->>Graph: Check if entity exists
+            alt Entity exists
+                Merging->>LLM: Summarize descriptions
+                LLM-->>Merging: Merged description
+                Merging->>Graph: Update entity
+            else New entity
+                Merging->>Graph: Insert entity
+            end
+            Merging->>Embed: Generate embedding
+            Embed-->>Merging: embedding vector
+            Merging->>Vec: Store entity embedding
+        end
+    and Parallel Relationship Processing
+        loop For each relationship
+            Merging->>Graph: Check if relation exists
+            alt Relation exists
+                Merging->>Graph: Update weight & metadata
+            else New relation
+                Merging->>Graph: Insert relation
+            end
+            Merging->>Embed: Generate embedding
+            Embed-->>Merging: embedding vector
+            Merging->>Vec: Store relation embedding
+        end
+    end
+    
+    Note over Pipeline: Indexing Phase
+    Pipeline->>Status: Update status (INDEXING)
+    Pipeline->>Vec: Index chunk embeddings
+    Pipeline->>Graph: Build graph indices
+    Pipeline->>KV: Commit cache
+    
+    Note over Pipeline: Completion
+    Pipeline->>Status: Update status (COMPLETED)
+    Pipeline-->>Core: Success
+    Core-->>API: track_id, status
+    API-->>Client: 200 OK {track_id}
+    
+    Note over Client: Client can poll /status/{track_id}
+```
+
+### Indexing Phase Details
+
+**Document Reception and Validation**: The API server receives the document, validates the file format and size, authenticates the request, and generates a unique track ID for monitoring. The document content is immediately stored in KV storage with metadata including file path, upload timestamp, and original filename.
+
+**Chunking Strategy**: Documents are split into overlapping chunks using a token-based approach. The system tokenizes the entire document using tiktoken, creates chunks of configurable size (default 1200 tokens), adds overlap between consecutive chunks (default 100 tokens) to preserve context, and stores each chunk with position metadata and references to the source document.
+
+**Entity and Relationship Extraction**: For each chunk, the system constructs a specialized prompt that instructs the LLM to identify entities with specific types and relationships between them. The LLM returns structured output in a specific format (entity|name|type|description for entities, relation|source|target|keywords|description for relationships). The system parses this output, normalizes entity names using case-insensitive matching, and validates the structure before proceeding.
+
+**Graph Construction and Merging**: New entities are compared against existing entities in the graph using fuzzy matching. When duplicates are found, descriptions are merged using either LLM-based summarization (for complex cases) or simple concatenation (for simple cases). Relationships are deduplicated based on source-target pairs, with weights aggregated when duplicates are found. All graph modifications are protected by keyed locks to ensure consistency in concurrent operations.
+
+**Vector Embedding Generation**: Entity descriptions, relationship descriptions, and chunk content are sent to the embedding service in batches for efficient processing. Embeddings are generated using the configured model (e.g., text-embedding-3-small for OpenAI), and vectors are stored in Vector storage with metadata linking back to their source entities, relationships, or chunks. The system uses semaphores to limit concurrent embedding requests and prevent rate limit errors.
+
+**Status Tracking Throughout**: Every stage updates the document status in Status storage, recording the current phase, progress percentage, error messages if any, and timing information. This enables clients to poll for progress and provides diagnostic information for debugging failed indexing operations.
+
+## Query Processing Data Flow
+
+This sequence diagram illustrates the retrieval and response generation process for different query modes.
+
+```mermaid
+sequenceDiagram
+    participant Client
+    participant API as API Server
+    participant Core as LightRAG Core
+    participant Query as Query Engine
+    participant KW as Keyword Extractor
+    participant Graph as Graph Storage
+    participant Vec as Vector Storage
+    participant KV as KV Storage
+    participant Context as Context Builder
+    participant LLM as LLM Service
+    participant Rerank as Rerank Service
+    
+    Client->>API: POST /query (query, mode, params)
+    API->>API: Authenticate & Validate
+    API->>Core: aquery(query, QueryParam)
+    Core->>Query: Execute query
+    
+    Note over Query: Keyword Extraction Phase
+    Query->>KW: Extract keywords
+    KW->>LLM: Generate high/low level keywords
+    LLM-->>KW: {hl_keywords[], ll_keywords[]}
+    KW-->>Query: keywords
+    
+    alt Mode: local (Entity-centric)
+        Note over Query: Local Mode - Focus on Entities
+        Query->>Vec: Query entity vectors (ll_keywords)
+        Vec-->>Query: top_k entity_ids[]
+        Query->>Graph: Get entities by IDs
+        Graph-->>Query: entities[]
+        Query->>Graph: Get connected relations
+        Graph-->>Query: relations[]
+        
+    else Mode: global (Relationship-centric)
+        Note over Query: Global Mode - Focus on Relations
+        Query->>Vec: Query relation vectors (hl_keywords)
+        Vec-->>Query: top_k relation_ids[]
+        Query->>Graph: Get relations by IDs
+        Graph-->>Query: relations[]
+        Query->>Graph: Get connected entities
+        Graph-->>Query: entities[]
+        
+    else Mode: hybrid
+        Note over Query: Hybrid Mode - Combined
+        par Parallel Retrieval
+            Query->>Vec: Query entity vectors
+            Vec-->>Query: entity_ids[]
+        and
+            Query->>Vec: Query relation vectors
+            Vec-->>Query: relation_ids[]
+        end
+        Query->>Graph: Get entities and relations
+        Graph-->>Query: entities[], relations[]
+        Query->>Query: Merge with round-robin
+        
+    else Mode: mix
+        Note over Query: Mix Mode - KG + Chunks
+        par Parallel Retrieval
+            Query->>Vec: Query entity vectors
+            Vec-->>Query: entity_ids[]
+            Query->>Graph: Get entities
+            Graph-->>Query: entities[]
+        and
+            Query->>Vec: Query chunk vectors
+            Vec-->>Query: chunk_ids[]
+        end
+        
+    else Mode: naive
+        Note over Query: Naive Mode - Pure Vector
+        Query->>Vec: Query chunk vectors only
+        Vec-->>Query: top_k chunk_ids[]
+    end
+    
+    Note over Query: Chunk Retrieval Phase
+    alt Mode != bypass
+        Query->>Query: Get related chunks from entities/relations
+        Query->>KV: Get chunks by IDs
+        KV-->>Query: chunks[]
+        
+        opt Rerank enabled
+            Query->>Rerank: Rerank chunks
+            Rerank-->>Query: reranked_chunks[]
+        end
+    end
+    
+    Note over Query: Context Building Phase
+    Query->>Context: Build context with token budget
+    Context->>Context: Allocate tokens (entities/relations/chunks)
+    Context->>Context: Truncate to fit budget
+    Context->>Context: Format entities/relations/chunks
+    Context-->>Query: context_string, references[]
+    
+    Note over Query: Response Generation Phase
+    Query->>Query: Build prompt with context
+    opt Include conversation history
+        Query->>Query: Add history messages
+    end
+    
+    alt Stream enabled
+        Query->>LLM: Stream generate (prompt, context)
+        loop Streaming chunks
+            LLM-->>Query: chunk
+            Query-->>API: chunk
+            API-->>Client: SSE chunk
+        end
+    else Stream disabled
+        Query->>LLM: Generate (prompt, context)
+        LLM-->>Query: response
+        Query-->>Core: response, references
+        Core-->>API: {response, references, metadata}
+        API-->>Client: 200 OK {response}
+    end
+```
+
+### Query Processing Phase Details
+
+**Keyword Extraction Phase**: The system sends the user query to the LLM with a specialized prompt that asks for high-level keywords (abstract concepts and themes) and low-level keywords (specific entities and terms). The LLM returns structured JSON with both keyword types, which guide the subsequent retrieval strategy. This two-level keyword approach enables the system to retrieve both broad contextual information and specific detailed facts.
+
+**Mode-Specific Retrieval Strategies**: 
+
+*Local Mode* focuses on entity-centric retrieval by querying the vector storage using low-level keywords to find the most relevant entities, retrieving full entity details including descriptions and metadata, and then fetching all relationships connected to those entities. This mode is optimal for questions about specific entities or localized information.
+
+*Global Mode* emphasizes relationship-centric retrieval by querying vector storage using high-level keywords to find relevant relationships, retrieving relationship details including keywords and descriptions, and then fetching the entities connected by those relationships. This mode excels at questions about connections, patterns, and higher-level concepts.
+
+*Hybrid Mode* combines both approaches by running local and global retrieval in parallel and then merging results using a round-robin strategy to balance entity and relationship information. This provides comprehensive coverage for complex queries that require both types of information.
+
+*Mix Mode* integrates knowledge graph retrieval with direct chunk retrieval by querying entity vectors to get graph-based context, simultaneously querying chunk vectors for relevant document sections, and combining both types of results. This mode provides the most complete context by including both structured knowledge and raw document content.
+
+*Naive Mode* performs pure vector similarity search without using the knowledge graph, simply retrieving the most similar chunks based on embedding distance. This mode is fastest and works well for simple similarity-based retrieval without needing entity or relationship context.
+
+*Bypass Mode* skips retrieval entirely and sends the query directly to the LLM, useful for general questions that don't require specific document context or when testing the LLM's base knowledge.
+
+**Context Building with Token Budgets**: The system implements a sophisticated token budget management system that allocates a maximum number of tokens across different context components. It allocates tokens to entity descriptions (default 6000 tokens), relationship descriptions (default 8000 tokens), and chunk content (remaining budget, with a cap defined by chunk_top_k). The system truncates each component to fit within its budget using the tokenizer, prioritizing higher-ranked items when truncation is necessary, and ensures the total context doesn't exceed the max_total_tokens limit (default 30000 tokens).
+
+**Reranking for Improved Relevance**: When enabled, the reranking phase takes retrieved chunks and reranks them using a specialized reranking model (like Cohere rerank or Jina rerank). This cross-encoder approach provides more accurate relevance scoring than pure vector similarity, especially for semantic matching. Chunks below the minimum rerank score threshold are filtered out, and only the top-k chunks after reranking are included in the final context.
+
+**Response Generation with Streaming**: For streaming responses, the system establishes a connection to the LLM with stream=True, receives response tokens incrementally, and immediately forwards them to the client via Server-Sent Events (SSE). This provides real-time feedback to users and reduces perceived latency. For non-streaming responses, the system waits for the complete LLM response before returning it to the client along with metadata about entities, relationships, and chunks used in the context.
+
+## Storage Layer Architecture
+
+The storage layer implements a plugin architecture with abstract base classes defining the contract for each storage type.
+
+```mermaid
+graph TB
+    subgraph "Abstract Interfaces"
+        BASE["StorageNameSpace<br/>(Base Class)"]
+        KV_BASE["BaseKVStorage"]
+        VEC_BASE["BaseVectorStorage"]
+        GRAPH_BASE["BaseGraphStorage"]
+        STATUS_BASE["DocStatusStorage"]
+    end
+    
+    subgraph "KV Storage Implementations"
+        JSON_KV["JsonKVStorage<br/>(File-based)"]
+        PG_KV["PGKVStorage<br/>(PostgreSQL)"]
+        MONGO_KV["MongoKVStorage<br/>(MongoDB)"]
+        REDIS_KV["RedisKVStorage<br/>(Redis)"]
+    end
+    
+    subgraph "Vector Storage Implementations"
+        NANO["NanoVectorDBStorage<br/>(In-memory)"]
+        FAISS["FaissVectorDBStorage<br/>(FAISS)"]
+        PG_VEC["PGVectorStorage<br/>(pgvector)"]
+        MILVUS["MilvusVectorStorage<br/>(Milvus)"]
+        QDRANT["QdrantVectorStorage<br/>(Qdrant)"]
+    end
+    
+    subgraph "Graph Storage Implementations"
+        NX["NetworkXStorage<br/>(NetworkX)"]
+        NEO4J["Neo4jStorage<br/>(Neo4j)"]
+        MEMGRAPH["MemgraphStorage<br/>(Memgraph)"]
+        PG_GRAPH["PGGraphStorage<br/>(PostgreSQL)"]
+    end
+    
+    subgraph "Doc Status Implementations"
+        JSON_STATUS["JsonDocStatusStorage<br/>(File-based)"]
+        PG_STATUS["PGDocStatusStorage<br/>(PostgreSQL)"]
+        MONGO_STATUS["MongoDocStatusStorage<br/>(MongoDB)"]
+    end
+    
+    BASE --> KV_BASE
+    BASE --> VEC_BASE
+    BASE --> GRAPH_BASE
+    BASE --> STATUS_BASE
+    
+    KV_BASE --> JSON_KV
+    KV_BASE --> PG_KV
+    KV_BASE --> MONGO_KV
+    KV_BASE --> REDIS_KV
+    
+    VEC_BASE --> NANO
+    VEC_BASE --> FAISS
+    VEC_BASE --> PG_VEC
+    VEC_BASE --> MILVUS
+    VEC_BASE --> QDRANT
+    
+    GRAPH_BASE --> NX
+    GRAPH_BASE --> NEO4J
+    GRAPH_BASE --> MEMGRAPH
+    GRAPH_BASE --> PG_GRAPH
+    
+    STATUS_BASE --> JSON_STATUS
+    STATUS_BASE --> PG_STATUS
+    STATUS_BASE --> MONGO_STATUS
+    
+    style BASE fill:#E6FFE6
+    style JSON_KV fill:#E6F3FF
+    style NANO fill:#FFE6E6
+    style NX fill:#FFF5E6
+    style JSON_STATUS fill:#FFE6F5
+```
+
+### Storage Interface Contracts
+
+**BaseKVStorage Interface**: Key-value storage manages cached data, text chunks, and full documents. Core methods include get_by_id(id) for retrieving a single value, get_by_ids(ids) for batch retrieval, filter_keys(keys) to check which keys don't exist, upsert(data) for inserting or updating entries, delete(ids) for removing entries, and index_done_callback() for persisting changes to disk. This interface supports both in-memory implementations with persistence and direct database implementations.
+
+**BaseVectorStorage Interface**: Vector storage handles embeddings for entities, relationships, and chunks. Core methods include query(query, top_k, query_embedding) for similarity search, upsert(data) for storing vectors with metadata, delete(ids) for removing vectors, delete_entity(entity_name) for removing entity-related vectors, delete_entity_relation(entity_name) for removing relationship vectors, get_by_id(id) and get_by_ids(ids) for retrieving full vector data, and get_vectors_by_ids(ids) for efficient vector-only retrieval. All implementations must support cosine similarity search and metadata filtering.
+
+**BaseGraphStorage Interface**: Graph storage maintains the entity-relationship graph structure. Core methods include has_node(node_id) and has_edge(source, target) for existence checks, node_degree(node_id) and edge_degree(src, tgt) for connectivity metrics, get_node(node_id) and upsert_node(node_id, data) for node operations, upsert_edge(source, target, data) for relationship operations, get_knowledge_graph(node_label, max_depth, max_nodes) for graph traversal and export, and delete operations for nodes and edges. The interface treats all relationships as undirected unless explicitly specified.
+
+**DocStatusStorage Interface**: Document status storage tracks processing pipeline state for each document. Core methods include upsert(status) for updating document status, get_by_id(doc_id) for retrieving status, get_by_ids(doc_ids) for batch retrieval, filter_ids(doc_ids) for checking existence, delete(doc_ids) for cleanup, get_by_status(status) for finding documents in a specific state, and count_by_status() for pipeline metrics. This enables comprehensive monitoring and recovery of document processing operations.
+
+### Storage Implementation Patterns
+
+**File-Based Storage (JSON)**: Simple implementations store data in JSON files with in-memory caching for performance. All modifications are held in memory until index_done_callback() triggers a write to disk. These implementations are suitable for development, small deployments, and single-process scenarios. They provide atomic writes using temporary files and rename operations, handle concurrent access through file locking, and support workspace isolation through directory structure.
+
+**PostgreSQL Storage**: Comprehensive implementations that leverage PostgreSQL's capabilities including JSON columns for flexible metadata, pgvector extension for vector similarity search, advisory locks for distributed coordination, connection pooling for performance, and transaction support for consistency. PostgreSQL implementations can handle all four storage types in a single database, simplifying deployment and backup. They support multi-tenant deployments through schema-based isolation and provide excellent performance for mixed workloads.
+
+**Specialized Vector Databases**: Dedicated vector storage implementations like FAISS, Milvus, and Qdrant provide optimized vector similarity search with features like approximate nearest neighbor (ANN) search, GPU acceleration for large-scale similarity search, advanced indexing strategies (IVF, HNSW), and high-performance batch operations. These are recommended for deployments with large document sets (>1M chunks) or high query throughput requirements.
+
+**Graph Databases (Neo4j/Memgraph)**: Specialized graph implementations optimize graph traversal and pattern matching with native graph storage and indexing, Cypher query language for complex graph queries, visualization capabilities for knowledge graph exploration, and optimized algorithms for shortest path, centrality, and community detection. These are ideal for use cases requiring complex graph analytics and when graph visualization is a primary feature.
+
+## Concurrency and State Management
+
+LightRAG implements sophisticated concurrency control to handle parallel document processing and query execution.
+
+```mermaid
+graph TB
+    subgraph "Concurrency Control"
+        SEM1["Semaphore<br/>LLM Calls<br/>(max_async)"]
+        SEM2["Semaphore<br/>Embeddings<br/>(embedding_func_max_async)"]
+        SEM3["Semaphore<br/>Graph Merging<br/>(graph_max_async)"]
+        LOCK1["Keyed Locks<br/>Entity Processing"]
+        LOCK2["Keyed Locks<br/>Relation Processing"]
+        LOCK3["Pipeline Status Lock"]
+    end
+    
+    subgraph "State Management"
+        GLOBAL["Global Config<br/>workspace, paths, settings"]
+        PIPELINE["Pipeline Status<br/>current job, progress"]
+        NAMESPACE["Namespace Data<br/>storage instances, locks"]
+    end
+    
+    subgraph "Task Coordination"
+        QUEUE["Task Queue<br/>Document Processing"]
+        PRIORITY["Priority Limiter<br/>Async Function Calls"]
+        TRACK["Track ID System<br/>Monitoring & Logging"]
+    end
+    
+    SEM1 --> PRIORITY
+    SEM2 --> PRIORITY
+    SEM3 --> PRIORITY
+    
+    LOCK1 --> NAMESPACE
+    LOCK2 --> NAMESPACE
+    LOCK3 --> PIPELINE
+    
+    QUEUE --> TRACK
+    PRIORITY --> TRACK
+    
+    GLOBAL --> NAMESPACE
+    PIPELINE --> TRACK
+    
+    style SEM1 fill:#FFE6E6
+    style GLOBAL fill:#E6FFE6
+    style QUEUE fill:#E6F3FF
+```
+
+### Concurrency Patterns
+
+**Semaphore-Based Rate Limiting**: The system uses asyncio semaphores to limit concurrent operations and prevent overwhelming external services or exhausting resources. Different semaphores control different types of operations: LLM calls are limited by max_async (default 4), embedding function calls by embedding_func_max_async (default 8), and graph merging operations by graph_max_async (calculated as llm_model_max_async * 2). These semaphores ensure respectful API usage and prevent rate limit errors.
+
+**Keyed Locks for Data Consistency**: When processing entities and relationships concurrently, the system uses keyed locks to ensure that multiple processes don't modify the same entity or relationship simultaneously. Each entity or relationship gets a unique lock based on its identifier, preventing race conditions during graph merging while still allowing parallel processing of different entities. This pattern enables high concurrency without sacrificing data consistency.
+
+**Pipeline Status Tracking**: A shared pipeline status object tracks the current state of document processing including the active job name, number of documents being processed, current batch number, latest status message, and history of messages for debugging. This status is protected by an async lock and can be queried by clients to monitor progress. The status persists across the entire pipeline and provides visibility into long-running operations.
+
+**Workspace Isolation**: The workspace concept provides multi-tenant isolation by prefixing all storage namespaces with a workspace identifier. Different workspaces maintain completely separate data including separate storage instances, independent configuration, isolated locks and semaphores, and separate pipeline status. This enables running multiple LightRAG instances in the same infrastructure without interference.
+
+### TypeScript Migration Considerations for Concurrency
+
+**Semaphore Implementation**: Node.js doesn't have built-in semaphores, but the pattern can be implemented using the p-limit library, which provides similar functionality with a cleaner API. Example: `const limiter = pLimit(4); await limiter(() => callLLM())`.
+
+**Keyed Locks**: For single-process deployments, a Map<string, Promise> can implement keyed locks. For multi-process deployments, consider using Redis with the Redlock algorithm or a dedicated lock service. The key insight is ensuring that operations on the same entity/relationship are serialized while different entities can be processed in parallel.
+
+**Shared State Management**: Python's global dictionaries need to be replaced with class-based state management in TypeScript. For multi-process deployments, shared state should be externalized to Redis or a similar store. For single-process deployments, singleton classes can manage state with proper TypeScript visibility controls.
+
+**Pipeline Status Updates**: Real-time status updates can be implemented using EventEmitter in Node.js for in-process communication, or Redis Pub/Sub for multi-process scenarios. WebSocket connections can provide real-time updates to clients without polling.
+
+## Summary
+
+This architecture documentation provides a comprehensive view of LightRAG's design, from high-level layer organization to detailed component interactions and concurrency patterns. The system's modular design, with clear interfaces and abstractions, makes it well-suited for migration to TypeScript. The key architectural principles—layered separation of concerns, plugin-based storage abstraction, async-first concurrency, and comprehensive state tracking—translate well to TypeScript and Node.js idioms.
+
+The subsequent documentation sections build on this architectural foundation, providing detailed specifications for data models, storage implementations, LLM integrations, and API contracts. Together, these documents form a complete blueprint for implementing a production-ready TypeScript version of LightRAG.
--- a/reverse_documentation/03-data-models-and-schemas.md
+++ b/reverse_documentation/03-data-models-and-schemas.md
@ -0,0 +1,890 @@
+# Data Models and Schemas: LightRAG Type System
+
+## Table of Contents
+1. [Core Data Models](#core-data-models)
+2. [Storage Schema Definitions](#storage-schema-definitions)
+3. [Query and Response Models](#query-and-response-models)
+4. [Configuration Models](#configuration-models)
+5. [TypeScript Type Mapping](#typescript-type-mapping)
+
+## Core Data Models
+
+### Text Chunk Schema
+
+Text chunks are the fundamental unit of document processing in LightRAG. Documents are split into overlapping chunks that preserve context while fitting within token limits.
+
+**Python Definition** (`lightrag/base.py:75-79`):
+```python
+class TextChunkSchema(TypedDict):
+    tokens: int
+    content: str
+    full_doc_id: str
+    chunk_order_index: int
+```
+
+**TypeScript Definition**:
+```typescript
+interface TextChunkSchema {
+  tokens: number;
+  content: string;
+  full_doc_id: string;
+  chunk_order_index: number;
+}
+```
+
+**Field Descriptions**:
+- `tokens`: Number of tokens in the chunk according to the configured tokenizer (e.g., tiktoken for GPT models)
+- `content`: The actual text content of the chunk, UTF-8 encoded
+- `full_doc_id`: MD5 hash of the complete document, used as a foreign key to link chunks to their source document
+- `chunk_order_index`: Zero-based index indicating the chunk's position in the original document sequence
+
+**Storage Pattern**: Chunks are stored in KV storage with keys following the pattern `{full_doc_id}_{chunk_order_index}`. The chunk content is also embedded and stored in Vector storage for similarity search.
+
+**Validation Rules**:
+- `tokens` must be > 0 and typically < 2048
+- `content` must not be empty
+- `full_doc_id` must be a valid MD5 hash (32 hexadecimal characters)
+- `chunk_order_index` must be >= 0
+
+### Entity Schema
+
+Entities represent key concepts, people, organizations, locations, and other named entities extracted from documents.
+
+**Python Definition** (Implicit in `lightrag/operate.py`):
+```python
+entity_data = {
+    "entity_name": str,        # Normalized entity name (title case)
+    "entity_type": str,        # One of DEFAULT_ENTITY_TYPES
+    "description": str,        # Consolidated description
+    "source_id": str,          # Chunk IDs joined by GRAPH_FIELD_SEP
+    "file_path": str,          # File paths joined by GRAPH_FIELD_SEP
+    "created_at": str,         # ISO 8601 timestamp
+    "updated_at": str,         # ISO 8601 timestamp
+}
+```
+
+**TypeScript Definition**:
+```typescript
+interface EntityData {
+  entity_name: string;
+  entity_type: EntityType;
+  description: string;
+  source_id: string;      // Pipe-separated chunk IDs
+  file_path: string;      // Pipe-separated file paths
+  created_at: string;     // ISO 8601
+  updated_at: string;     // ISO 8601
+}
+
+type EntityType = 
+  | "Person"
+  | "Creature"
+  | "Organization"
+  | "Location"
+  | "Event"
+  | "Concept"
+  | "Method"
+  | "Content"
+  | "Data"
+  | "Artifact"
+  | "NaturalObject"
+  | "Other";
+```
+
+**Field Descriptions**:
+- `entity_name`: Normalized name using title case for consistency (e.g., "John Smith", "OpenAI")
+- `entity_type`: Classification of the entity using predefined types from `DEFAULT_ENTITY_TYPES`
+- `description`: Rich text description of the entity's attributes, activities, and context. May be merged from multiple sources using LLM summarization
+- `source_id`: Pipe-separated (`<SEP>`) list of chunk IDs where this entity was mentioned, enabling citation tracking
+- `file_path`: Pipe-separated list of source file paths for traceability
+- `created_at`: ISO 8601 timestamp when the entity was first created
+- `updated_at`: ISO 8601 timestamp when the entity was last modified
+
+**Storage Locations**:
+1. Graph Storage: Entity as a node with `entity_name` as the node ID
+2. Vector Storage: Entity description embedding with metadata
+3. Full Entities KV Storage: Complete entity data for retrieval
+
+**Normalization Rules**:
+- Entity names are case-insensitive for matching but stored in title case
+- Multiple mentions of the same entity (fuzzy matched) are merged
+- Descriptions are consolidated using LLM summarization when they exceed token limits
+- `source_id` and `file_path` are deduplicated when merging
+
+### Relationship Schema
+
+Relationships represent connections between entities, forming the edges of the knowledge graph.
+
+**Python Definition** (Implicit in `lightrag/operate.py`):
+```python
+relationship_data = {
+    "src_id": str,             # Source entity name
+    "tgt_id": str,             # Target entity name
+    "description": str,        # Relationship description
+    "keywords": str,           # Comma-separated keywords
+    "weight": float,           # Relationship strength (0-1)
+    "source_id": str,          # Chunk IDs joined by GRAPH_FIELD_SEP
+    "file_path": str,          # File paths joined by GRAPH_FIELD_SEP
+    "created_at": str,         # ISO 8601 timestamp
+    "updated_at": str,         # ISO 8601 timestamp
+}
+```
+
+**TypeScript Definition**:
+```typescript
+interface RelationshipData {
+  src_id: string;
+  tgt_id: string;
+  description: string;
+  keywords: string;      // Comma-separated
+  weight: number;        // 0.0 to 1.0
+  source_id: string;     // Pipe-separated chunk IDs
+  file_path: string;     // Pipe-separated file paths
+  created_at: string;    // ISO 8601
+  updated_at: string;    // ISO 8601
+}
+```
+
+**Field Descriptions**:
+- `src_id`: Name of the source entity (must match an existing entity)
+- `tgt_id`: Name of the target entity (must match an existing entity)
+- `description`: Explanation of how and why the entities are related
+- `keywords`: High-level keywords summarizing the relationship nature (e.g., "collaboration, project, research")
+- `weight`: Numeric weight indicating relationship strength, aggregated when merging duplicates
+- `source_id`: Chunk IDs where this relationship was mentioned
+- `file_path`: Source file paths for citation
+- `created_at`: Creation timestamp
+- `updated_at`: Last modification timestamp
+
+**Storage Locations**:
+1. Graph Storage: Edge between source and target nodes
+2. Vector Storage: Relationship description embedding with metadata
+3. Full Relations KV Storage: Complete relationship data
+
+**Validation Rules**:
+- Relationships are treated as undirected (bidirectional)
+- `src_id` and `tgt_id` must reference existing entities
+- `weight` must be between 0.0 and 1.0
+- Duplicate relationships (same src_id, tgt_id pair) are merged with weights summed
+
+### Document Processing Status
+
+Tracks the processing state of documents through the ingestion pipeline.
+
+**Python Definition** (`lightrag/base.py:679-724`):
+```python
+@dataclass
+class DocProcessingStatus:
+    content_summary: str
+    content_length: int
+    file_path: str
+    status: DocStatus
+    created_at: str
+    updated_at: str
+    track_id: str | None = None
+    chunks_count: int | None = None
+    chunks_list: list[str] | None = field(default_factory=list)
+    entities_count: int | None = None
+    relations_count: int | None = None
+    batch_number: int | None = None
+    error_message: str | None = None
+```
+
+**TypeScript Definition**:
+```typescript
+enum DocStatus {
+  PENDING = "PENDING",
+  CHUNKING = "CHUNKING",
+  EXTRACTING = "EXTRACTING",
+  MERGING = "MERGING",
+  INDEXING = "INDEXING",
+  COMPLETED = "COMPLETED",
+  FAILED = "FAILED"
+}
+
+interface DocProcessingStatus {
+  content_summary: string;
+  content_length: number;
+  file_path: string;
+  status: DocStatus;
+  created_at: string;      // ISO 8601
+  updated_at: string;      // ISO 8601
+  track_id?: string;
+  chunks_count?: number;
+  chunks_list?: string[];
+  entities_count?: number;
+  relations_count?: number;
+  batch_number?: number;
+  error_message?: string;
+}
+```
+
+**Field Descriptions**:
+- `content_summary`: First 100 characters of document for preview
+- `content_length`: Total character length of the document
+- `file_path`: Original file path or identifier
+- `status`: Current processing stage (see DocStatus enum)
+- `created_at`: Document submission timestamp
+- `updated_at`: Last status update timestamp
+- `track_id`: Optional tracking ID for batch monitoring (shared across multiple documents)
+- `chunks_count`: Number of chunks created during splitting
+- `chunks_list`: Array of chunk IDs for reference
+- `entities_count`: Number of entities extracted
+- `relations_count`: Number of relationships extracted
+- `batch_number`: Batch identifier for processing order
+- `error_message`: Error details if status is FAILED
+
+**State Transitions**:
+```
+PENDING → CHUNKING → EXTRACTING → MERGING → INDEXING → COMPLETED
+                                                           ↓
+                                                        FAILED
+```
+
+Any stage can transition to FAILED on error, with `error_message` populated with diagnostic information.
+
+## Storage Schema Definitions
+
+### KV Storage Schema
+
+KV storage handles three types of data: LLM response cache, text chunks, and full documents.
+
+#### LLM Cache Entry
+**Key Format**: `cache:{hash(prompt+model+params)}`
+
+**Value Schema**:
+```typescript
+interface LLMCacheEntry {
+  return_message: string;
+  embedding_dim: number;
+  model: string;
+  timestamp: string;
+}
+```
+
+#### Text Chunk Entry
+**Key Format**: `{full_doc_id}_{chunk_order_index}`
+
+**Value Schema**:
+```typescript
+interface ChunkEntry extends TextChunkSchema {
+  // Additional metadata can be stored
+  file_path?: string;
+  created_at?: string;
+}
+```
+
+#### Full Document Entry
+**Key Format**: Document ID (MD5 hash of content)
+
+**Value Schema**:
+```typescript
+interface FullDocEntry {
+  content: string;
+  file_path?: string;
+  created_at?: string;
+  metadata?: Record<string, any>;
+}
+```
+
+### Vector Storage Schema
+
+Vector storage maintains embeddings for entities, relationships, and chunks.
+
+**Entry Schema**:
+```typescript
+interface VectorEntry {
+  id: string;                    // Unique identifier
+  vector: number[];              // Embedding vector (e.g., 1536 dimensions for OpenAI)
+  metadata: {
+    content: string;             // Original text that was embedded
+    type: "entity" | "relation" | "chunk";
+    entity_name?: string;        // For entities
+    source_id?: string;          // Chunk IDs where this appears
+    file_path?: string;          // Source file paths
+    [key: string]: any;          // Additional metadata
+  };
+}
+```
+
+**Index Requirements**:
+- Cosine similarity search support
+- Efficient top-k retrieval (ANN algorithms recommended for large datasets)
+- Metadata filtering capabilities
+- Batch upsert and deletion support
+
+**Storage Size Estimates**:
+- Entity vectors: ~6KB each (1536 floats × 4 bytes)
+- Relationship vectors: ~6KB each
+- Chunk vectors: ~6KB each
+- 10,000 documents ≈ 100,000 chunks ≈ 600MB vector storage
+
+### Graph Storage Schema
+
+Graph storage maintains the entity-relationship graph structure.
+
+#### Node Schema
+**Node ID**: Entity name (normalized, title case)
+
+**Node Properties**:
+```typescript
+interface GraphNode {
+  entity_name: string;           // Node ID
+  entity_type: string;           // Entity classification
+  description: string;           // Entity description
+  source_id: string;             // Pipe-separated chunk IDs
+  file_path: string;             // Pipe-separated file paths
+  created_at: string;            // ISO 8601
+  updated_at: string;            // ISO 8601
+}
+```
+
+#### Edge Schema
+**Edge ID**: Combination of source and target node IDs (undirected)
+
+**Edge Properties**:
+```typescript
+interface GraphEdge {
+  src_id: string;                // Source entity name
+  tgt_id: string;                // Target entity name
+  description: string;           // Relationship description
+  keywords: string;              // Comma-separated
+  weight: number;                // 0.0 to 1.0
+  source_id: string;             // Pipe-separated chunk IDs
+  file_path: string;             // Pipe-separated file paths
+  created_at: string;            // ISO 8601
+  updated_at: string;            // ISO 8601
+}
+```
+
+**Graph Constraints**:
+- Undirected edges: (A, B) and (B, A) represent the same relationship
+- No self-loops: src_id ≠ tgt_id
+- Unique edge constraint: Only one edge per (src_id, tgt_id) pair
+- Node must exist before creating edges
+
+#### Query Capabilities Required
+- Node existence check: `has_node(node_id)`
+- Edge existence check: `has_edge(src_id, tgt_id)`
+- Degree calculation: `node_degree(node_id)`, `edge_degree(src_id, tgt_id)`
+- Node retrieval: `get_node(node_id)`, `get_nodes_batch(node_ids[])`
+- Edge retrieval: `get_edge(src_id, tgt_id)`, `get_edges_batch(pairs[])`
+- Neighborhood query: `get_node_edges(node_id)`
+- Graph traversal: `get_knowledge_graph(start_node, max_depth, max_nodes)`
+- Label listing: `get_all_labels()`
+
+### Document Status Storage Schema
+
+Document status storage is a specialized KV storage for tracking pipeline state.
+
+**Key Format**: Document ID (MD5 hash)
+
+**Value Schema**: `DocProcessingStatus` (see above)
+
+**Required Capabilities**:
+- Get by ID: `get_by_id(doc_id)`
+- Get by IDs: `get_by_ids(doc_ids[])`
+- Get by status: `get_by_status(status)` → all documents in that state
+- Get by track ID: `get_by_track_id(track_id)` → all documents in that batch
+- Status counts: `get_status_counts()` → count of documents in each state
+- Upsert: `upsert(doc_id, status_data)`
+- Delete: `delete(doc_ids[])`
+
+## Query and Response Models
+
+### Query Parameter Model
+
+Comprehensive configuration for query execution.
+
+**Python Definition** (`lightrag/base.py:86-171`):
+```python
+@dataclass
+class QueryParam:
+    mode: Literal["local", "global", "hybrid", "naive", "mix", "bypass"] = "mix"
+    only_need_context: bool = False
+    only_need_prompt: bool = False
+    response_type: str = "Multiple Paragraphs"
+    stream: bool = False
+    top_k: int = 40
+    chunk_top_k: int = 20
+    max_entity_tokens: int = 6000
+    max_relation_tokens: int = 8000
+    max_total_tokens: int = 30000
+    hl_keywords: list[str] = field(default_factory=list)
+    ll_keywords: list[str] = field(default_factory=list)
+    conversation_history: list[dict[str, str]] = field(default_factory=list)
+    history_turns: int = 0
+    model_func: Callable[..., object] | None = None
+    user_prompt: str | None = None
+    enable_rerank: bool = True
+    include_references: bool = False
+```
+
+**TypeScript Definition**:
+```typescript
+type QueryMode = "local" | "global" | "hybrid" | "naive" | "mix" | "bypass";
+
+interface ConversationMessage {
+  role: "user" | "assistant" | "system";
+  content: string;
+}
+
+interface QueryParam {
+  mode?: QueryMode;
+  only_need_context?: boolean;
+  only_need_prompt?: boolean;
+  response_type?: string;
+  stream?: boolean;
+  top_k?: number;
+  chunk_top_k?: number;
+  max_entity_tokens?: number;
+  max_relation_tokens?: number;
+  max_total_tokens?: number;
+  hl_keywords?: string[];
+  ll_keywords?: string[];
+  conversation_history?: ConversationMessage[];
+  history_turns?: number;
+  model_func?: (...args: any[]) => Promise<any>;
+  user_prompt?: string;
+  enable_rerank?: boolean;
+  include_references?: boolean;
+}
+```
+
+**Field Descriptions**:
+
+**Retrieval Configuration**:
+- `mode`: Query strategy (see Query Processing documentation)
+- `top_k`: Number of entities (local) or relations (global) to retrieve
+- `chunk_top_k`: Number of text chunks to keep after reranking
+
+**Token Budget**:
+- `max_entity_tokens`: Token budget for entity descriptions in context
+- `max_relation_tokens`: Token budget for relationship descriptions
+- `max_total_tokens`: Total context budget including system prompt
+
+**Keyword Guidance**:
+- `hl_keywords`: High-level keywords for global retrieval (themes, concepts)
+- `ll_keywords`: Low-level keywords for local retrieval (specific terms)
+
+**Conversation Context**:
+- `conversation_history`: Previous messages for multi-turn dialogue
+- `history_turns`: Number of conversation turns to include (deprecated, all history sent)
+
+**Response Configuration**:
+- `response_type`: Desired format ("Multiple Paragraphs", "Single Paragraph", "Bullet Points", etc.)
+- `stream`: Enable streaming responses via SSE
+- `user_prompt`: Additional instructions to inject into the LLM prompt
+- `enable_rerank`: Use reranking model for chunk relevance scoring
+- `include_references`: Include citation information in response
+
+**Debug Options**:
+- `only_need_context`: Return retrieved context without LLM generation
+- `only_need_prompt`: Return the constructed prompt without generation
+
+### Query Result Model
+
+Unified response structure for all query types.
+
+**Python Definition** (`lightrag/base.py:778-820`):
+```python
+@dataclass
+class QueryResult:
+    content: Optional[str] = None
+    response_iterator: Optional[AsyncIterator[str]] = None
+    raw_data: Optional[Dict[str, Any]] = None
+    is_streaming: bool = False
+```
+
+**TypeScript Definition**:
+```typescript
+interface QueryResult {
+  content?: string;
+  response_iterator?: AsyncIterableIterator<string>;
+  raw_data?: QueryRawData;
+  is_streaming: boolean;
+}
+
+interface QueryRawData {
+  response: string;
+  references?: ReferenceEntry[];
+  entities?: EntityData[];
+  relationships?: RelationshipData[];
+  chunks?: ChunkData[];
+  processing_info?: ProcessingInfo;
+}
+
+interface ReferenceEntry {
+  reference_id: string;
+  file_path: string;
+}
+
+interface ChunkData {
+  content: string;
+  tokens: number;
+  source_id: string;
+  file_path: string;
+}
+
+interface ProcessingInfo {
+  mode: QueryMode;
+  keyword_extraction: {
+    high_level_keywords: string[];
+    low_level_keywords: string[];
+  };
+  retrieval_stats: {
+    entities_retrieved: number;
+    relationships_retrieved: number;
+    chunks_retrieved: number;
+    chunks_after_rerank?: number;
+  };
+  context_stats: {
+    entity_tokens: number;
+    relation_tokens: number;
+    chunk_tokens: number;
+    total_tokens: number;
+  };
+  token_budget: {
+    max_entity_tokens: number;
+    max_relation_tokens: number;
+    max_total_tokens: number;
+    final_entity_tokens: number;
+    final_relation_tokens: number;
+    final_chunk_tokens: number;
+  };
+}
+```
+
+**Usage Patterns**:
+
+For non-streaming responses:
+```typescript
+const result = await rag.query("What is AI?", { stream: false });
+console.log(result.content);  // Complete response text
+console.log(result.raw_data?.references);  // Citation information
+```
+
+For streaming responses:
+```typescript
+const result = await rag.query("What is AI?", { stream: true });
+for await (const chunk of result.response_iterator!) {
+  process.stdout.write(chunk);  // Stream to output
+}
+```
+
+For context-only retrieval:
+```typescript
+const result = await rag.query("What is AI?", { only_need_context: true });
+console.log(result.raw_data?.entities);  // Retrieved entities
+console.log(result.raw_data?.chunks);    // Retrieved chunks
+```
+
+## Configuration Models
+
+### LightRAG Configuration
+
+Complete configuration for a LightRAG instance.
+
+**Python Definition** (`lightrag/lightrag.py:116-384`):
+```python
+@dataclass
+class LightRAG:
+    # Storage
+    working_dir: str = "./rag_storage"
+    kv_storage: str = "JsonKVStorage"
+    vector_storage: str = "NanoVectorDBStorage"
+    graph_storage: str = "NetworkXStorage"
+    doc_status_storage: str = "JsonDocStatusStorage"
+    workspace: str = ""
+    
+    # LLM and Embedding
+    llm_model_func: Callable | None = None
+    llm_model_name: str = "gpt-4o-mini"
+    llm_model_max_async: int = 4
+    llm_model_timeout: int = 180
+    embedding_func: EmbeddingFunc | None = None
+    embedding_batch_num: int = 10
+    embedding_func_max_async: int = 8
+    default_embedding_timeout: int = 30
+    
+    # Chunking
+    chunk_token_size: int = 1200
+    chunk_overlap_token_size: int = 100
+    tokenizer: Optional[Tokenizer] = None
+    tiktoken_model_name: str = "gpt-4o-mini"
+    
+    # Extraction
+    entity_extract_max_gleaning: int = 1
+    entity_types: list[str] = field(default_factory=lambda: DEFAULT_ENTITY_TYPES)
+    force_llm_summary_on_merge: int = 8
+    summary_max_tokens: int = 1200
+    summary_language: str = "English"
+    
+    # Query
+    top_k: int = 40
+    chunk_top_k: int = 20
+    max_entity_tokens: int = 6000
+    max_relation_tokens: int = 8000
+    max_total_tokens: int = 30000
+    cosine_threshold: int = 0.2
+    related_chunk_number: int = 5
+    kg_chunk_pick_method: str = "VECTOR"
+    
+    # Reranking
+    enable_rerank: bool = True
+    rerank_model_func: Callable | None = None
+    min_rerank_score: float = 0.0
+    
+    # Concurrency
+    max_async: int = 4
+    max_parallel_insert: int = 2
+    
+    # Optional
+    addon_params: dict[str, Any] = field(default_factory=dict)
+```
+
+**TypeScript Definition**:
+```typescript
+interface LightRAGConfig {
+  // Storage
+  working_dir?: string;
+  kv_storage?: string;
+  vector_storage?: string;
+  graph_storage?: string;
+  doc_status_storage?: string;
+  workspace?: string;
+  
+  // LLM and Embedding
+  llm_model_func?: LLMFunction;
+  llm_model_name?: string;
+  llm_model_max_async?: number;
+  llm_model_timeout?: number;
+  embedding_func?: EmbeddingFunction;
+  embedding_batch_num?: number;
+  embedding_func_max_async?: number;
+  default_embedding_timeout?: number;
+  
+  // Chunking
+  chunk_token_size?: number;
+  chunk_overlap_token_size?: number;
+  tokenizer?: Tokenizer;
+  tiktoken_model_name?: string;
+  
+  // Extraction
+  entity_extract_max_gleaning?: number;
+  entity_types?: string[];
+  force_llm_summary_on_merge?: number;
+  summary_max_tokens?: number;
+  summary_language?: string;
+  
+  // Query
+  top_k?: number;
+  chunk_top_k?: number;
+  max_entity_tokens?: number;
+  max_relation_tokens?: number;
+  max_total_tokens?: number;
+  cosine_threshold?: number;
+  related_chunk_number?: number;
+  kg_chunk_pick_method?: "VECTOR" | "WEIGHT";
+  
+  // Reranking
+  enable_rerank?: boolean;
+  rerank_model_func?: RerankFunction;
+  min_rerank_score?: number;
+  
+  // Concurrency
+  max_async?: number;
+  max_parallel_insert?: number;
+  
+  // Optional
+  addon_params?: Record<string, any>;
+}
+
+type LLMFunction = (
+  prompt: string,
+  system_prompt?: string,
+  history_messages?: ConversationMessage[],
+  stream?: boolean,
+  **kwargs: any
+) => Promise<string> | AsyncIterableIterator<string>;
+
+type EmbeddingFunction = (texts: string[]) => Promise<number[][]>;
+
+type RerankFunction = (
+  query: string,
+  documents: string[]
+) => Promise<Array<{ index: number; score: number }>>;
+
+interface Tokenizer {
+  encode(text: string): number[];
+  decode(tokens: number[]): string;
+}
+```
+
+## TypeScript Type Mapping
+
+### Python to TypeScript Type Conversion
+
+| Python Type | TypeScript Type | Notes |
+|------------|----------------|-------|
+| `str` | `string` | Direct mapping |
+| `int` | `number` | JavaScript/TypeScript uses `number` for all numerics |
+| `float` | `number` | Same as `int` |
+| `bool` | `boolean` | Direct mapping |
+| `list[T]` | `T[]` or `Array<T>` | Both notations are valid in TypeScript |
+| `dict[K, V]` | `Record<K, V>` or `Map<K, V>` | `Record` for simple objects, `Map` for dynamic keys |
+| `set[T]` | `Set<T>` | Direct mapping |
+| `tuple[T1, T2]` | `[T1, T2]` | TypeScript tuple syntax |
+| `Literal["a", "b"]` | `"a" \| "b"` | Union of literal types |
+| `Optional[T]` | `T \| undefined` or `T?` | Optional property syntax |
+| `Union[T1, T2]` | `T1 \| T2` | Union type |
+| `Any` | `any` | Avoid if possible, use `unknown` for type-safe any |
+| `TypedDict` | `interface` | TypeScript interface |
+| `@dataclass` | `class` or `interface` | Use `class` for behavior, `interface` for pure data |
+| `Callable[..., T]` | `(...args: any[]) => T` | Function type |
+| `AsyncIterator[T]` | `AsyncIterableIterator<T>` | Async iteration support |
+
+### Python-Specific Features Requiring Special Handling
+
+**Dataclasses with field defaults**:
+```python
+# Python
+@dataclass
+class Example:
+    name: str
+    items: list[str] = field(default_factory=list)
+```
+
+```typescript
+// TypeScript - Option 1: Class with constructor
+class Example {
+  name: string;
+  items: string[];
+  
+  constructor(name: string, items: string[] = []) {
+    this.name = name;
+    this.items = items;
+  }
+}
+
+// TypeScript - Option 2: Interface with builder
+interface Example {
+  name: string;
+  items: string[];
+}
+
+function createExample(name: string, items: string[] = []): Example {
+  return { name, items };
+}
+```
+
+**Multiple inheritance from ABC**:
+```python
+# Python
+@dataclass
+class BaseGraphStorage(StorageNameSpace, ABC):
+    pass
+```
+
+```typescript
+// TypeScript - Use composition over inheritance
+abstract class StorageNameSpace {
+  abstract initialize(): Promise<void>;
+}
+
+abstract class BaseGraphStorage extends StorageNameSpace {
+  // Additional abstract methods
+}
+
+// Or use interfaces for pure contracts
+interface IStorageNameSpace {
+  initialize(): Promise<void>;
+}
+
+interface IGraphStorage extends IStorageNameSpace {
+  // Graph-specific methods
+}
+```
+
+**Overloaded functions**:
+```python
+# Python
+@overload
+def get(id: str) -> dict | None: ...
+@overload
+def get(ids: list[str]) -> list[dict]: ...
+```
+
+```typescript
+// TypeScript - Native overload support
+function get(id: string): Promise<Record<string, any> | null>;
+function get(ids: string[]): Promise<Record<string, any>[]>;
+function get(idOrIds: string | string[]): Promise<any> {
+  if (typeof idOrIds === 'string') {
+    // Single ID logic
+  } else {
+    // Multiple IDs logic
+  }
+}
+```
+
+### Validation and Serialization
+
+For runtime validation and serialization in TypeScript, consider using:
+
+**Zod for schema validation**:
+```typescript
+import { z } from 'zod';
+
+const TextChunkSchema = z.object({
+  tokens: z.number().positive(),
+  content: z.string().min(1),
+  full_doc_id: z.string().regex(/^[a-f0-9]{32}$/),
+  chunk_order_index: z.number().nonnegative(),
+});
+
+type TextChunk = z.infer<typeof TextChunkSchema>;
+
+// Validate at runtime
+const chunk = TextChunkSchema.parse(data);
+```
+
+**class-transformer for class serialization**:
+```typescript
+import { plainToClass, classToPlain } from 'class-transformer';
+import { IsString, IsNumber } from 'class-validator';
+
+class TextChunk {
+  @IsNumber()
+  tokens: number;
+  
+  @IsString()
+  content: string;
+  
+  @IsString()
+  full_doc_id: string;
+  
+  @IsNumber()
+  chunk_order_index: number;
+}
+
+// Convert plain object to class instance
+const chunk = plainToClass(TextChunk, jsonData);
+
+// Convert class instance to plain object
+const json = classToPlain(chunk);
+```
+
+## Summary
+
+This comprehensive data models documentation provides:
+
+1. **Complete type definitions** for all core data structures in both Python and TypeScript
+2. **Storage schemas** detailing how data is persisted in each storage layer
+3. **Query and response models** with full field descriptions and usage patterns
+4. **Configuration models** for system setup and customization
+5. **Type mapping guide** for Python-to-TypeScript conversion
+6. **Validation strategies** using TypeScript libraries
+
+These type definitions form the contract layer between all components of the system, ensuring type safety and consistent data structures throughout the implementation. The TypeScript definitions leverage the language's strong type system to provide compile-time safety while maintaining compatibility with the original Python design.
+
+The next documentation sections will use these type definitions extensively when describing storage implementations, API contracts, and LLM integrations.
--- a/reverse_documentation/04-dependency-migration-guide.md
+++ b/reverse_documentation/04-dependency-migration-guide.md
@ -0,0 +1,894 @@
+# Dependency Migration Guide: Python to TypeScript/Node.js
+
+## Table of Contents
+1. [Core Dependencies Mapping](#core-dependencies-mapping)
+2. [Storage Driver Dependencies](#storage-driver-dependencies)
+3. [LLM and Embedding Dependencies](#llm-and-embedding-dependencies)
+4. [API and Web Framework Dependencies](#api-and-web-framework-dependencies)
+5. [Utility and Helper Dependencies](#utility-and-helper-dependencies)
+6. [Migration Complexity Assessment](#migration-complexity-assessment)
+
+## Core Dependencies Mapping
+
+### Python Standard Library → Node.js/TypeScript
+
+| Python Package | Purpose | TypeScript/Node.js Equivalent | Migration Notes |
+|----------------|---------|-------------------------------|-----------------|
+| `asyncio` | Async/await runtime | Native Node.js (async/await) | Direct support, different patterns (see below) |
+| `dataclasses` | Data class definitions | TypeScript classes/interfaces | Use `class` with constructor or `interface` |
+| `typing` | Type hints | Native TypeScript types | Superior type system in TS |
+| `functools` | Function tools (partial, lru_cache) | `lodash/partial`, custom decorators | `lodash.partial` or arrow functions |
+| `collections` | Counter, defaultdict, deque | Native Map/Set, or `collections-js` | Map/Set cover most cases |
+| `json` | JSON parsing | Native `JSON` | Direct support |
+| `hashlib` | MD5, SHA hashing | `crypto` (built-in) | `crypto.createHash('md5')` |
+| `os` | OS operations | `fs`, `path` (built-in) | Similar APIs |
+| `time` | Time operations | Native `Date`, `performance.now()` | Different epoch (JS uses ms) |
+| `datetime` | Date/time handling | Native `Date` or `date-fns` | `date-fns` recommended for manipulation |
+| `configparser` | INI file parsing | `ini` npm package | Direct equivalent |
+| `warnings` | Warning system | `console.warn()` or custom | Simpler in Node.js |
+| `traceback` | Stack traces | Native Error stack | `error.stack` property |
+
+### Core Python Packages → npm Packages
+
+| Python Package | Purpose | npm Package | Version | Migration Complexity |
+|----------------|---------|-------------|---------|---------------------|
+| `aiohttp` | Async HTTP client | `axios` or `undici` | ^1.6.0 or ^6.0.0 | Low - similar APIs |
+| `json_repair` | Fix malformed JSON | `json-repair` | ^0.2.0 | Low - direct port exists |
+| `numpy` | Numerical arrays | `@tensorflow/tfjs` or `ndarray` | ^4.0.0 or ^1.0.0 | Medium - different paradigm |
+| `pandas` | Data manipulation | `danfojs` | ^1.1.0 | Medium - similar but less mature |
+| `pydantic` | Data validation | `zod` or `class-validator` | ^3.22.0 or ^0.14.0 | Low - excellent TS support |
+| `python-dotenv` | Environment variables | `dotenv` | ^16.0.0 | Low - identical functionality |
+| `tenacity` | Retry logic | `async-retry` or `p-retry` | ^3.0.0 or ^6.0.0 | Low - similar patterns |
+| `tiktoken` | OpenAI tokenizer | `@dqbd/tiktoken` or `js-tiktoken` | ^1.0.0 or ^1.0.0 | Medium - WASM-based port |
+| `pypinyin` | Chinese pinyin | `pinyin` npm package | ^3.0.0 | Low - direct equivalent |
+
+### Async/Await Pattern Differences
+
+**Python asyncio patterns:**
+```python
+import asyncio
+
+# Semaphore
+semaphore = asyncio.Semaphore(4)
+async with semaphore:
+    await do_work()
+
+# Gather with error handling
+results = await asyncio.gather(*tasks, return_exceptions=True)
+
+# Task cancellation
+task.cancel()
+await task  # Raises CancelledError
+
+# Wait with timeout
+try:
+    result = await asyncio.wait_for(coro, timeout=30)
+except asyncio.TimeoutError:
+    pass
+```
+
+**TypeScript/Node.js equivalents:**
+```typescript
+import pLimit from 'p-limit';
+import pTimeout from 'p-timeout';
+
+// Semaphore using p-limit
+const limit = pLimit(4);
+await limit(() => doWork());
+
+// Promise.allSettled for error handling
+const results = await Promise.allSettled(promises);
+results.forEach(result => {
+  if (result.status === 'fulfilled') {
+    console.log(result.value);
+  } else {
+    console.error(result.reason);
+  }
+});
+
+// AbortController for cancellation
+const controller = new AbortController();
+fetch(url, { signal: controller.signal });
+controller.abort();
+
+// Timeout using p-timeout
+try {
+  const result = await pTimeout(promise, { milliseconds: 30000 });
+} catch (error) {
+  if (error.name === 'TimeoutError') {
+    // Handle timeout
+  }
+}
+```
+
+**Recommended npm packages for async patterns:**
+- `p-limit` (^5.0.0): Rate limiting / semaphore
+- `p-queue` (^8.0.0): Priority queue for async tasks
+- `p-retry` (^6.0.0): Retry with exponential backoff
+- `p-timeout` (^6.0.0): Timeout for promises
+- `bottleneck` (^2.19.0): Advanced rate limiting
+
+## Storage Driver Dependencies
+
+### PostgreSQL
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `asyncpg` | `pg` | ^8.11.0 | Most popular, excellent TypeScript support |
+| | `drizzle-orm` | ^0.29.0 | Optional: Type-safe query builder |
+| | `@neondatabase/serverless` | ^0.9.0 | For serverless environments |
+
+**Migration complexity**: Low  
+**Recommendation**: Use `pg` with connection pooling. Consider `drizzle-orm` for type-safe queries.
+
+```typescript
+// PostgreSQL connection with pg
+import { Pool } from 'pg';
+
+const pool = new Pool({
+  host: process.env.PG_HOST,
+  port: parseInt(process.env.PG_PORT || '5432'),
+  database: process.env.PG_DATABASE,
+  user: process.env.PG_USER,
+  password: process.env.PG_PASSWORD,
+  max: 20,  // Connection pool size
+  idleTimeoutMillis: 30000,
+  connectionTimeoutMillis: 2000,
+});
+
+// With Drizzle ORM for type safety
+import { drizzle } from 'drizzle-orm/node-postgres';
+const db = drizzle(pool);
+```
+
+### PostgreSQL pgvector Extension
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `pgvector` | `pgvector` | ^0.1.0 | Official Node.js client |
+
+**Migration complexity**: Low  
+**Implementation**: Use `pgvector` npm package with `pg`:
+
+```typescript
+import pgvector from 'pgvector/pg';
+
+// Register pgvector type
+await pgvector.registerType(pool);
+
+// Insert vector
+await pool.query(
+  'INSERT INTO items (embedding) VALUES ($1)',
+  [pgvector.toSql([1.0, 2.0, 3.0])]
+);
+
+// Query by similarity
+const result = await pool.query(
+  'SELECT * FROM items ORDER BY embedding <-> $1 LIMIT 10',
+  [pgvector.toSql(queryVector)]
+);
+```
+
+### MongoDB
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `motor` | `mongodb` | ^6.3.0 | Official MongoDB driver |
+| | `mongoose` | ^8.0.0 | Optional: ODM for schemas |
+
+**Migration complexity**: Low  
+**Recommendation**: Use official `mongodb` driver. Add `mongoose` if you need schema validation.
+
+```typescript
+import { MongoClient } from 'mongodb';
+
+const client = new MongoClient(process.env.MONGODB_URI!, {
+  maxPoolSize: 50,
+  minPoolSize: 10,
+  serverSelectionTimeoutMS: 5000,
+});
+
+await client.connect();
+const db = client.db('lightrag');
+const collection = db.collection('entities');
+```
+
+### Redis
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `redis-py` | `ioredis` | ^5.3.0 | Better TypeScript support than `redis` |
+| | `redis` | ^4.6.0 | Official client, good but less TS-friendly |
+
+**Migration complexity**: Low  
+**Recommendation**: Use `ioredis` for better TypeScript experience and cluster support.
+
+```typescript
+import Redis from 'ioredis';
+
+const redis = new Redis({
+  host: process.env.REDIS_HOST,
+  port: parseInt(process.env.REDIS_PORT || '6379'),
+  password: process.env.REDIS_PASSWORD,
+  db: 0,
+  maxRetriesPerRequest: 3,
+  enableReadyCheck: true,
+  lazyConnect: false,
+});
+
+// Cluster support
+const cluster = new Redis.Cluster([
+  { host: 'node1', port: 6379 },
+  { host: 'node2', port: 6379 },
+]);
+```
+
+### Neo4j
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `neo4j` | `neo4j-driver` | ^5.15.0 | Official driver with TypeScript support |
+
+**Migration complexity**: Low  
+**Recommendation**: Use official driver with TypeScript type definitions.
+
+```typescript
+import neo4j from 'neo4j-driver';
+
+const driver = neo4j.driver(
+  process.env.NEO4J_URI!,
+  neo4j.auth.basic(
+    process.env.NEO4J_USER!,
+    process.env.NEO4J_PASSWORD!
+  ),
+  {
+    maxConnectionPoolSize: 50,
+    connectionTimeout: 30000,
+  }
+);
+
+const session = driver.session({ database: 'neo4j' });
+try {
+  const result = await session.run(
+    'MATCH (n:Entity {name: $name}) RETURN n',
+    { name: 'John' }
+  );
+} finally {
+  await session.close();
+}
+```
+
+### Memgraph
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `neo4j` | `neo4j-driver` | ^5.15.0 | Compatible with Neo4j driver |
+
+**Migration complexity**: Low  
+**Note**: Memgraph uses the Neo4j protocol, so the same driver works.
+
+### NetworkX (Graph Library)
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `networkx` | `graphology` | ^0.25.0 | Modern graph library for JS |
+| | `cytoscape` | ^3.28.0 | Alternative with visualization |
+
+**Migration complexity**: Medium  
+**Recommendation**: Use `graphology` - most feature-complete and maintained.
+
+```typescript
+import Graph from 'graphology';
+
+const graph = new Graph({ type: 'undirected' });
+
+// Add nodes and edges
+graph.addNode('A', { name: 'Node A', type: 'entity' });
+graph.addNode('B', { name: 'Node B', type: 'entity' });
+graph.addEdge('A', 'B', { weight: 0.5, description: 'related to' });
+
+// Query
+const degree = graph.degree('A');
+const neighbors = graph.neighbors('A');
+const hasEdge = graph.hasEdge('A', 'B');
+
+// Serialization
+import { serialize, deserialize } from 'graphology-utils';
+const json = serialize(graph);
+const newGraph = deserialize(json);
+```
+
+### FAISS (Vector Search)
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `faiss-cpu` | **No direct port** | N/A | Need alternative approach |
+
+**Migration complexity**: High  
+**Alternatives**:
+1. **hnswlib-node** (^2.0.0): HNSW algorithm implementation
+2. **vectra** (^0.4.0): Simple vector database for Node.js
+3. Use cloud services: Pinecone, Weaviate, Qdrant
+
+```typescript
+// Option 1: hnswlib-node (closest to FAISS)
+import { HierarchicalNSW } from 'hnswlib-node';
+
+const index = new HierarchicalNSW('cosine', 1536);
+index.initIndex(10000); // Max elements
+index.addItems(vectors, labels);
+
+const results = index.searchKnn(queryVector, 10);
+
+// Option 2: vectra (simpler, file-based)
+import { LocalIndex } from 'vectra';
+
+const index = new LocalIndex('./vectors');
+await index.createIndex();
+await index.insertItem({
+  id: 'item1',
+  vector: [0.1, 0.2, ...],
+  metadata: { text: 'content' }
+});
+
+const results = await index.queryItems(queryVector, 10);
+```
+
+### Milvus
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `pymilvus` | `@zilliz/milvus2-sdk-node` | ^2.3.0 | Official client |
+
+**Migration complexity**: Low  
+**Recommendation**: Use official SDK.
+
+```typescript
+import { MilvusClient } from '@zilliz/milvus2-sdk-node';
+
+const client = new MilvusClient({
+  address: process.env.MILVUS_ADDRESS!,
+  username: process.env.MILVUS_USER,
+  password: process.env.MILVUS_PASSWORD,
+});
+
+// Create collection
+await client.createCollection({
+  collection_name: 'entities',
+  fields: [
+    { name: 'id', data_type: DataType.VarChar, is_primary_key: true },
+    { name: 'vector', data_type: DataType.FloatVector, dim: 1536 },
+  ],
+});
+
+// Insert vectors
+await client.insert({
+  collection_name: 'entities',
+  data: [{ id: '1', vector: [...] }],
+});
+
+// Search
+const results = await client.search({
+  collection_name: 'entities',
+  vector: queryVector,
+  limit: 10,
+});
+```
+
+### Qdrant
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `qdrant-client` | `@qdrant/js-client-rest` | ^1.8.0 | Official REST client |
+
+**Migration complexity**: Low  
+**Recommendation**: Use official client.
+
+```typescript
+import { QdrantClient } from '@qdrant/js-client-rest';
+
+const client = new QdrantClient({
+  url: process.env.QDRANT_URL!,
+  apiKey: process.env.QDRANT_API_KEY,
+});
+
+// Create collection
+await client.createCollection('entities', {
+  vectors: { size: 1536, distance: 'Cosine' },
+});
+
+// Upsert vectors
+await client.upsert('entities', {
+  points: [
+    {
+      id: 1,
+      vector: [...],
+      payload: { text: 'content' },
+    },
+  ],
+});
+
+// Search
+const results = await client.search('entities', {
+  vector: queryVector,
+  limit: 10,
+});
+```
+
+## LLM and Embedding Dependencies
+
+### OpenAI
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `openai` | `openai` | ^4.28.0 | Official SDK with TypeScript |
+
+**Migration complexity**: Low  
+**Recommendation**: Use official SDK - excellent TypeScript support.
+
+```typescript
+import OpenAI from 'openai';
+
+const openai = new OpenAI({
+  apiKey: process.env.OPENAI_API_KEY,
+  maxRetries: 3,
+  timeout: 60000,
+});
+
+// Chat completion
+const completion = await openai.chat.completions.create({
+  model: 'gpt-4o-mini',
+  messages: [{ role: 'user', content: 'Hello!' }],
+  temperature: 0.7,
+});
+
+// Streaming
+const stream = await openai.chat.completions.create({
+  model: 'gpt-4o-mini',
+  messages: [{ role: 'user', content: 'Hello!' }],
+  stream: true,
+});
+
+for await (const chunk of stream) {
+  process.stdout.write(chunk.choices[0]?.delta?.content || '');
+}
+
+// Embeddings
+const embedding = await openai.embeddings.create({
+  model: 'text-embedding-3-small',
+  input: 'Text to embed',
+});
+```
+
+### Anthropic
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `anthropic` | `@anthropic-ai/sdk` | ^0.17.0 | Official SDK |
+
+**Migration complexity**: Low
+
+```typescript
+import Anthropic from '@anthropic-ai/sdk';
+
+const client = new Anthropic({
+  apiKey: process.env.ANTHROPIC_API_KEY,
+});
+
+const message = await client.messages.create({
+  model: 'claude-3-opus-20240229',
+  max_tokens: 1024,
+  messages: [{ role: 'user', content: 'Hello!' }],
+});
+
+// Streaming
+const stream = await client.messages.stream({
+  model: 'claude-3-opus-20240229',
+  max_tokens: 1024,
+  messages: [{ role: 'user', content: 'Hello!' }],
+});
+
+for await (const chunk of stream) {
+  process.stdout.write(chunk.delta.text || '');
+}
+```
+
+### Ollama
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `ollama` | `ollama` | ^0.5.0 | Official Node.js library |
+
+**Migration complexity**: Low
+
+```typescript
+import ollama from 'ollama';
+
+const response = await ollama.chat({
+  model: 'llama2',
+  messages: [{ role: 'user', content: 'Hello!' }],
+});
+
+// Streaming
+const stream = await ollama.chat({
+  model: 'llama2',
+  messages: [{ role: 'user', content: 'Hello!' }],
+  stream: true,
+});
+
+for await (const chunk of stream) {
+  process.stdout.write(chunk.message.content);
+}
+
+// Embeddings
+const embedding = await ollama.embeddings({
+  model: 'llama2',
+  prompt: 'Text to embed',
+});
+```
+
+### Hugging Face
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `transformers` | `@xenova/transformers` | ^2.12.0 | Transformers.js (ONNX-based) |
+| `sentence-transformers` | **Use Inference API** | N/A | No direct equivalent |
+
+**Migration complexity**: Medium to High  
+**Recommendation**: Use Hugging Face Inference API for server-side, or Transformers.js for client-side.
+
+```typescript
+// Option 1: Hugging Face Inference API (recommended for server)
+import { HfInference } from '@huggingface/inference';
+
+const hf = new HfInference(process.env.HF_TOKEN);
+
+const result = await hf.textGeneration({
+  model: 'meta-llama/Llama-2-7b-chat-hf',
+  inputs: 'The answer to the universe is',
+});
+
+// Option 2: Transformers.js (ONNX models, client-side or server)
+import { pipeline } from '@xenova/transformers';
+
+const embedder = await pipeline('feature-extraction', 
+  'Xenova/all-MiniLM-L6-v2');
+const embeddings = await embedder('Text to embed', {
+  pooling: 'mean',
+  normalize: true,
+});
+```
+
+### Tokenization (tiktoken)
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `tiktoken` | `@dqbd/tiktoken` | ^1.0.7 | WASM-based port |
+| | `js-tiktoken` | ^1.0.10 | Alternative pure JS |
+
+**Migration complexity**: Medium  
+**Recommendation**: Use `@dqbd/tiktoken` for best compatibility.
+
+```typescript
+import { encoding_for_model } from '@dqbd/tiktoken';
+
+// Initialize encoder for specific model
+const encoder = encoding_for_model('gpt-4o-mini');
+
+// Encode text to tokens
+const tokens = encoder.encode('Hello, world!');
+console.log(`Token count: ${tokens.length}`);
+
+// Decode tokens back to text
+const text = encoder.decode(tokens);
+
+// Don't forget to free resources
+encoder.free();
+
+// Alternative: js-tiktoken (pure JS, no WASM)
+import { encodingForModel } from 'js-tiktoken';
+const enc = encodingForModel('gpt-4o-mini');
+const tokenCount = enc.encode('Hello, world!').length;
+```
+
+## API and Web Framework Dependencies
+
+### FastAPI → Node.js Framework
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `fastapi` | `fastify` | ^4.25.0 | Fast, low overhead, TypeScript-friendly |
+| | `@fastify/swagger` | ^8.13.0 | OpenAPI documentation |
+| | `@fastify/swagger-ui` | ^2.1.0 | Swagger UI |
+| | `@fastify/cors` | ^8.5.0 | CORS support |
+| | `@fastify/jwt` | ^7.2.0 | JWT authentication |
+
+**Alternative**: Express.js (^4.18.0) - More familiar but slower
+
+**Migration complexity**: Low  
+**Recommendation**: Use Fastify for similar performance to FastAPI.
+
+```typescript
+import Fastify from 'fastify';
+import fastifySwagger from '@fastify/swagger';
+import fastifySwaggerUi from '@fastify/swagger-ui';
+import fastifyJwt from '@fastify/jwt';
+
+const app = Fastify({ logger: true });
+
+// Swagger/OpenAPI
+await app.register(fastifySwagger, {
+  openapi: {
+    info: { title: 'LightRAG API', version: '1.0.0' },
+  },
+});
+await app.register(fastifySwaggerUi, {
+  routePrefix: '/docs',
+});
+
+// JWT
+await app.register(fastifyJwt, {
+  secret: process.env.JWT_SECRET!,
+});
+
+// Route with schema
+app.post('/query', {
+  schema: {
+    body: {
+      type: 'object',
+      required: ['query'],
+      properties: {
+        query: { type: 'string' },
+        mode: { type: 'string', enum: ['local', 'global', 'hybrid'] },
+      },
+    },
+  },
+}, async (request, reply) => {
+  return { response: 'Answer' };
+});
+
+await app.listen({ port: 9621, host: '0.0.0.0' });
+```
+
+### Pydantic → TypeScript Validation
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `pydantic` | `zod` | ^3.22.0 | Runtime validation, best TS integration |
+| | `class-validator` | ^0.14.0 | Decorator-based validation |
+| | `joi` | ^17.12.0 | Traditional validation library |
+
+**Migration complexity**: Low  
+**Recommendation**: Use `zod` for runtime validation with excellent TypeScript inference.
+
+```typescript
+import { z } from 'zod';
+
+// Define schema
+const QuerySchema = z.object({
+  query: z.string().min(3),
+  mode: z.enum(['local', 'global', 'hybrid', 'mix', 'naive', 'bypass']).default('mix'),
+  top_k: z.number().positive().default(40),
+  stream: z.boolean().default(false),
+});
+
+// Infer TypeScript type from schema
+type QueryInput = z.infer<typeof QuerySchema>;
+
+// Validate at runtime
+function handleQuery(data: unknown) {
+  const validated = QuerySchema.parse(data); // Throws on invalid
+  // validated is now typed as QueryInput
+  return validated;
+}
+
+// Safe parse (returns result object)
+const result = QuerySchema.safeParse(data);
+if (result.success) {
+  console.log(result.data);
+} else {
+  console.error(result.error);
+}
+```
+
+### Uvicorn → Node.js Server
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `uvicorn` | Native Node.js | N/A | Node.js has built-in HTTP server |
+
+**Migration complexity**: None  
+**Note**: Fastify/Express handle the server internally. For clustering:
+
+```typescript
+import cluster from 'cluster';
+import os from 'os';
+
+if (cluster.isPrimary) {
+  const numCPUs = os.cpus().length;
+  for (let i = 0; i < numCPUs; i++) {
+    cluster.fork();
+  }
+} else {
+  // Start Fastify server
+  await app.listen({ port: 9621 });
+}
+```
+
+## Utility and Helper Dependencies
+
+### Text Processing and Utilities
+
+| Python Package | npm Package | Version | Migration Complexity |
+|----------------|-------------|---------|---------------------|
+| `pypinyin` | `pinyin` | ^3.0.0 | Low - direct equivalent |
+| `xlsxwriter` | `xlsx` or `exceljs` | ^0.18.0 or ^4.3.0 | Low - good alternatives |
+| `pypdf2` | `pdf-parse` | ^1.1.1 | Medium - different API |
+| `python-docx` | `docx` | ^8.5.0 | Low - similar API |
+| `python-pptx` | `pptxgenjs` | ^3.12.0 | Medium - different focus |
+| `openpyxl` | `xlsx` | ^0.18.0 | Low - handles both read/write |
+
+### Logging and Monitoring
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `logging` | `pino` | ^8.17.0 | Fast, structured logging |
+| | `winston` | ^3.11.0 | Feature-rich alternative |
+| `psutil` | `systeminformation` | ^5.21.0 | System monitoring |
+
+```typescript
+import pino from 'pino';
+
+const logger = pino({
+  level: process.env.LOG_LEVEL || 'info',
+  transport: {
+    target: 'pino-pretty',
+    options: { colorize: true },
+  },
+});
+
+logger.info({ user: 'John' }, 'User logged in');
+logger.error({ err: error }, 'Operation failed');
+```
+
+### Authentication
+
+| Python Package | npm Package | Version | Notes |
+|----------------|-------------|---------|-------|
+| `PyJWT` | `jsonwebtoken` | ^9.0.2 | JWT creation and verification |
+| `python-jose` | `jose` | ^5.2.0 | More complete JWT/JWE/JWS library |
+| `passlib` | `bcrypt` | ^5.1.0 | Password hashing |
+
+```typescript
+import jwt from 'jsonwebtoken';
+import bcrypt from 'bcrypt';
+
+// JWT
+const token = jwt.sign(
+  { user_id: 123 },
+  process.env.JWT_SECRET!,
+  { expiresIn: '24h' }
+);
+
+const decoded = jwt.verify(token, process.env.JWT_SECRET!);
+
+// Password hashing
+const hash = await bcrypt.hash('password', 10);
+const isValid = await bcrypt.compare('password', hash);
+```
+
+## Migration Complexity Assessment
+
+### Complexity Levels Defined
+
+**Low Complexity** (1-2 days per component):
+- Direct npm equivalent exists with similar API
+- Well-documented TypeScript support
+- Minimal code changes required
+- Examples: PostgreSQL, MongoDB, Redis, OpenAI, Anthropic
+
+**Medium Complexity** (3-5 days per component):
+- npm equivalent exists but with different API patterns
+- Requires adapter layer or wrapper
+- Good TypeScript support but needs configuration
+- Examples: NetworkX → graphology, tiktoken, Hugging Face
+
+**High Complexity** (1-2 weeks per component):
+- No direct npm equivalent
+- Requires custom implementation or major architectural changes
+- May need cloud service integration
+- Examples: FAISS → alternatives, sentence-transformers
+
+### Overall Migration Complexity by Category
+
+| Category | Complexity | Estimated Effort | Risk Level |
+|----------|-----------|------------------|-----------|
+| Core Storage (PostgreSQL, MongoDB, Redis) | Low | 1 week | Low |
+| Graph Storage (Neo4j, NetworkX) | Medium | 1-2 weeks | Medium |
+| Vector Storage (FAISS alternatives) | Medium-High | 2 weeks | Medium |
+| LLM Integration (OpenAI, Anthropic, Ollama) | Low | 3 days | Low |
+| Tokenization (tiktoken) | Medium | 3 days | Medium |
+| API Framework (FastAPI → Fastify) | Low | 1 week | Low |
+| Validation (Pydantic → Zod) | Low | 3 days | Low |
+| Authentication & Security | Low | 3 days | Low |
+| File Processing | Medium | 1 week | Medium |
+| Utilities & Helpers | Low | 3 days | Low |
+
+### Version Compatibility Matrix
+
+| Dependency | Recommended Version | Minimum Node.js | Notes |
+|-----------|-------------------|----------------|-------|
+| Node.js | 20 LTS | 18.0.0 | Use LTS for production |
+| TypeScript | 5.3.0+ | N/A | For latest type features |
+| pg | ^8.11.0 | 14.0.0 | PostgreSQL client |
+| mongodb | ^6.3.0 | 16.0.0 | MongoDB driver |
+| ioredis | ^5.3.0 | 14.0.0 | Redis client |
+| neo4j-driver | ^5.15.0 | 14.0.0 | Neo4j client |
+| graphology | ^0.25.0 | 14.0.0 | Graph library |
+| openai | ^4.28.0 | 18.0.0 | OpenAI SDK |
+| fastify | ^4.25.0 | 18.0.0 | Web framework |
+| zod | ^3.22.0 | 16.0.0 | Validation |
+| pino | ^8.17.0 | 14.0.0 | Logging |
+| @dqbd/tiktoken | ^1.0.7 | 14.0.0 | Tokenization |
+
+### Migration Strategy Recommendations
+
+**Phase 1 - Core Infrastructure** (Weeks 1-2):
+- Set up TypeScript project structure
+- Migrate storage abstractions and interfaces
+- Implement PostgreSQL storage (reference implementation)
+- Set up testing framework
+
+**Phase 2 - Storage Implementations** (Weeks 3-4):
+- Migrate MongoDB, Redis implementations
+- Implement NetworkX → graphology migration
+- Set up vector storage alternatives (Qdrant or Milvus)
+- Implement document status storage
+
+**Phase 3 - LLM Integration** (Week 5):
+- Migrate OpenAI integration
+- Add Anthropic and Ollama support
+- Implement tiktoken for tokenization
+- Add retry and rate limiting logic
+
+**Phase 4 - Core Logic** (Weeks 6-8):
+- Migrate chunking logic
+- Implement entity extraction
+- Implement graph merging
+- Set up pipeline processing
+
+**Phase 5 - Query Engine** (Weeks 8-9):
+- Implement query modes
+- Add context building
+- Integrate reranking
+- Add streaming support
+
+**Phase 6 - API Layer** (Week 10):
+- Build Fastify server
+- Implement all endpoints
+- Add authentication
+- Set up OpenAPI docs
+
+**Phase 7 - Testing & Polish** (Weeks 11-12):
+- Comprehensive testing
+- Performance optimization
+- Documentation
+- Deployment setup
+
+## Summary
+
+This dependency migration guide provides:
+
+1. **Complete mapping** of Python packages to Node.js/npm equivalents
+2. **Code examples** showing API differences and migration patterns
+3. **Complexity assessment** for each dependency category
+4. **Version recommendations** with compatibility notes
+5. **Migration strategy** with phased approach
+6. **Risk assessment** for high-complexity migrations
+
+The migration is very feasible with most dependencies having good or excellent npm equivalents. The main challenges are:
+- FAISS vector search (use Qdrant, Milvus, or hnswlib-node)
+- NetworkX graph library (use graphology)
+- Sentence transformers (use Hugging Face Inference API)
+
+With proper planning and the recommended npm packages, a production-ready TypeScript implementation can be achieved in 12-14 weeks with a small team.
--- a/reverse_documentation/05-typescript-project-structure-and-roadmap.md
+++ b/reverse_documentation/05-typescript-project-structure-and-roadmap.md
--- a/reverse_documentation/06-implementation-guide.md
+++ b/reverse_documentation/06-implementation-guide.md
--- a/reverse_documentation/scratchpad.md
+++ b/reverse_documentation/scratchpad.md
@ -0,0 +1,84 @@
+# LightRAG Analysis Scratchpad
+
+## Initial Observations (Phase 1 - Repository Structure)
+
+### Core Files Analysis
+- **lightrag.py** (141KB): Main class implementation - massive file with ~2200 lines
+- **operate.py** (164KB): Core operations for entity extraction, chunking, querying - ~1782 lines
+- **utils.py** (106KB): Utility functions - extensive helper library
+- **base.py** (30KB): Base classes and interfaces for storage abstractions
+- **prompt.py** (28KB): LLM prompt templates
+- **utils_graph.py** (43KB): Graph utility functions
+
+### Architecture Components Identified
+
+#### 1. Storage Layer (lightrag/kg/)
+Storage implementations (15+ files):
+- **JSON-based**: json_kv_impl.py, json_doc_status_impl.py
+- **Vector DBs**: nano_vector_db_impl.py, faiss_impl.py, milvus_impl.py, qdrant_impl.py
+- **Graph DBs**: networkx_impl.py, neo4j_impl.py (79KB), memgraph_impl.py (49KB)
+- **SQL**: postgres_impl.py (200KB - largest storage impl!)
+- **NoSQL**: mongo_impl.py (95KB), redis_impl.py (46KB)
+- **Shared**: shared_storage.py (48KB) - centralized storage management
+
+Key insight: PostgreSQL implementation is massive - suggests it's a comprehensive reference implementation
+
+#### 2. LLM Integration Layer (lightrag/llm/)
+LLM providers (14 files):
+- openai.py (24KB - most comprehensive)
+- binding_options.py (27KB) - configuration management
+- ollama.py, azure_openai.py, anthropic.py, bedrock.py
+- hf.py, llama_index_impl.py, lmdeploy.py
+- jina.py (embedding), zhipu.py, nvidia_openai.py, siliconcloud.py, lollms.py
+
+#### 3. API Layer (lightrag/api/)
+REST API implementation:
+- lightrag_server.py - FastAPI server
+- routers/ - modular route handlers
+  - query_routes.py - query endpoints
+  - document_routes.py - document management
+  - graph_routes.py - graph visualization
+  - ollama_api.py - Ollama compatibility layer
+
+#### 4. WebUI (lightrag_webui/)
+TypeScript/React frontend - already exists! This provides reference for:
+- TypeScript type definitions (lightrag.ts)
+- API client patterns
+- Data models used by frontend
+
+## Priority Order for Documentation
+1. Executive Summary + Architecture Overview
+2. Core Data Models (base.py, types.py)
+3. Storage Layer Architecture
+4. LLM Integration Patterns
+5. Query Pipeline
+6. Indexing Pipeline
+7. Dependency Migration Guide
+8. TypeScript Implementation Roadmap
+
+## Documentation Progress - Update
+
+### Completed Documents (4/8):
+1. ✅ Executive Summary (16KB) - Complete system overview
+2. ✅ Architecture Documentation (33KB) - 6 comprehensive Mermaid diagrams
+3. ✅ Data Models and Schemas (27KB) - Complete type system
+4. ✅ Dependency Migration Guide (27KB) - Full npm mapping with complexity assessment
+
+### Next Priority Documents:
+5. Storage Layer Implementation Guide - Deep dive into each storage backend
+6. TypeScript Project Structure and Migration Roadmap
+7. LLM Integration Patterns
+8. API Reference with TypeScript Types
+
+### Key Insights for Remaining Docs:
+- Focus on practical implementation examples
+- Include performance considerations
+- Document error handling patterns
+- Provide testing strategies
+- Add deployment configurations
+
+### Total Documentation So Far:
+- ~103KB of technical documentation
+- 6 Mermaid architecture diagrams
+- 50+ code comparison examples
+- Complete dependency mapping for 40+ packages