LightRAG/docs/diff_hku/waves/wave_3.csv
2025-12-04 19:13:48 +08:00

3.7 KiB

1commitauth_dateauthorsubjectcategorypriority_idxgit_cherry_pick_cmd
2075399ff2025-11-12Daniel.yMerge pull request #2346 from danielaskdd/optimize-json-sanitizationjson11git cherry-pick 075399ff
323cbb9c92025-11-12yangdxAdd data sanitization to JSON writing to prevent UTF-8 encoding errorsjson11git cherry-pick 23cbb9c9
46de4123f2025-11-12yangdxOptimize JSON string sanitization with precompiled regex and zero-copyjson11git cherry-pick 6de4123f
570cc24192025-11-12yangdxFix empty dict handling after JSON sanitizationjson11git cherry-pick 70cc2419
67f54f4702025-11-12yangdxOptimize JSON string sanitization with precompiled regex and zero-copyjson11git cherry-pick 7f54f470
7a08bc7262025-11-12yangdxFix empty dict handling after JSON sanitizationjson11git cherry-pick a08bc726
8abeaac842025-11-12yangdxImprove JSON data sanitization to handle tuples and dict keysjson11git cherry-pick abeaac84
9cca0800e2025-11-12yangdxFix migration to reload sanitized data and prevent memory corruptionjson11git cherry-pick cca0800e
10d1f4b6e52025-11-12yangdxAdd data sanitization to JSON writing to prevent UTF-8 encoding errorsjson11git cherry-pick d1f4b6e5
11dcf1d2862025-11-12yangdxFix migration to reload sanitized data and prevent memory corruptionjson11git cherry-pick dcf1d286
12f28a0c252025-11-12yangdxImprove JSON data sanitization to handle tuples and dict keysjson11git cherry-pick f28a0c25
13c46c1b262025-10-31yangdxAdd pycryptodome dependency for PDF encryption supportpdf12git cherry-pick c46c1b26
1461b57cbb2025-11-01yangdxAdd PDF decryption support for password-protected filespdf12git cherry-pick 61b57cbb
15ece0398d2025-11-01Daniel.yMerge pull request #2296 from danielaskdd/pdf-decryptionpdf12git cherry-pick ece0398d
165a6bb6582025-11-11Daniel.yMerge pull request #2338 from danielaskdd/migrate-to-pypdfpdf12git cherry-pick 5a6bb658
17c434879c2025-11-11yangdxReplace PyPDF2 with pypdf for PDF processingpdf12git cherry-pick c434879c
18fdcb4d0b2025-11-11yangdxReplace PyPDF2 with pypdf for PDF processingpdf12git cherry-pick fdcb4d0b
19186c8f0e2025-11-19yangdxPreserve blank paragraphs in DOCX extraction to maintain spacingdocx13git cherry-pick 186c8f0e
204438ba412025-11-19yangdxEnhance DOCX extraction to preserve document order with tablesdocx13git cherry-pick 4438ba41
2195cd0ece2025-11-19yangdxFix DOCX table extraction by escaping special characters in cellsdocx13git cherry-pick 95cd0ece
22e7d2803a2025-11-19yangdxRemove text stripping in DOCX extraction to preserve whitespacedocx13git cherry-pick e7d2803a
23fa887d812025-11-19yangdxFix table column structure preservation in DOCX extractiondocx13git cherry-pick fa887d81
243f6423df2025-12-01yangdxFix KaTeX extension loading by moving imports to app startupkatex14git cherry-pick 3f6423df
258f4bfbf12025-12-01yangdxAdd KaTeX copy-tex extension support for formula copyingkatex14git cherry-pick 8f4bfbf1
260244699d2025-11-19yangdxOptimize XLSX extraction by using sheet.max_column instead of two-pass scanxlsx20git cherry-pick 0244699d
272b1601632025-11-19yangdxOptimize XLSX extraction to avoid storing all rows in memoryxlsx20git cherry-pick 2b160163
283efb17162025-11-19yangdxEnhance XLSX extraction with structured tab-delimited format and escapingxlsx20git cherry-pick 3efb1716
2987de2b3e2025-11-19yangdxUpdate XLSX extraction documentation to reflect current implementationxlsx20git cherry-pick 87de2b3e
30af4d2a3d2025-11-19Daniel.yMerge pull request #2386 from danielaskdd/excel-optimizationxlsx20git cherry-pick af4d2a3d
31ef659a1e2025-11-19yangdxPreserve column alignment in XLSX extraction with two-pass processingxlsx20git cherry-pick ef659a1e