yangdx
|
05852e1ab2
|
Add max_token_size parameter to embedding function decorators
- Add max_token_size=8192 to all embed funcs
- Move siliconcloud to deprecated folder
- Import wrap_embedding_func_with_attrs
- Update EmbeddingFunc docstring
- Fix langfuse import type annotation
|
2025-11-14 18:41:43 +08:00 |
|
yangdx
|
3d9de5ed03
|
feat: improve Gemini client error handling and retry logic
• Add google-api-core dependency
• Add specific exception handling
• Create InvalidResponseError class
• Update retry decorators
• Fix empty response handling
|
2025-11-08 22:10:09 +08:00 |
|
yangdx
|
de4ed73652
|
Add Gemini embedding support
- Implement gemini_embed function
- Add gemini to embedding binding choices
- Add L2 normalization for dims < 3072
|
2025-11-08 03:34:30 +08:00 |
|
yangdx
|
fc40a36968
|
Add timeout support to Gemini LLM and improve parameter handling
• Add timeout parameter to Gemini client
• Convert timeout seconds to milliseconds
• Update function signatures consistently
• Add Gemini thinking config example
• Clean up parameter documentation
|
2025-11-07 15:50:14 +08:00 |
|
yangdx
|
3cb4eae492
|
Add Chain of Thought support to Gemini LLM integration
- Extract thoughts from response parts
- Add COT enable/disable parameter
|
2025-11-07 15:22:14 +08:00 |
|
yangdx
|
8c27555358
|
Fix Gemini response parsing to avoid warnings from non-text parts
|
2025-11-07 04:00:37 +08:00 |
|
yangdx
|
6e36ff41e1
|
Fix linting
|
2025-11-06 16:01:24 +08:00 |
|
Humphry
|
0b3d31507e
|
extended to use gemini, sswitched to use gemini-flash-latest
|
2025-10-20 13:17:16 +03:00 |
|