ragflow/rag
paresh2806 ddeac9ab3d
added SVG for Groq model model providers (#1470)
#1432  #1447 
This PR adds support for the GROQ LLM (Large Language Model).

Groq is an AI solutions company delivering ultra-low latency inference
with the first-ever LPU™ Inference Engine. The Groq API enables
developers to integrate state-of-the-art LLMs, such as Llama-2 and
llama3-70b-8192, into low latency applications with the request limits
specified below. Learn more at [groq.com](https://groq.com/).
Supported Models


| ID | Requests per Minute | Requests per Day | Tokens per Minute |

|----------------------|---------------------|------------------|-------------------|
| gemma-7b-it | 30 | 14,400 | 15,000 |
| gemma2-9b-it | 30 | 14,400 | 15,000 |
| llama3-70b-8192 | 30 | 14,400 | 6,000 |
| llama3-8b-8192 | 30 | 14,400 | 30,000 |
| mixtral-8x7b-32768 | 30 | 14,400 | 5,000 |

---------

Co-authored-by: paresh0628 <paresh.tuvoc@gmail.com>
Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>
2024-07-12 09:25:44 +08:00
..
app upgrade laws parser of docx (#1332) 2024-07-01 15:50:24 +08:00
llm added SVG for Groq model model providers (#1470) 2024-07-12 09:25:44 +08:00
nlp examples empty in categorize (#1422) 2024-07-08 17:40:50 +08:00
res build python version rag-flow (#21) 2024-01-15 08:46:22 +08:00
svr Add file rag/svr/discord_svr.py (#1008) 2024-05-31 13:47:15 +08:00
utils feat: Support Password Access for ElasticSearch (#1072) 2024-06-06 13:19:26 +08:00
__init__.py build python version rag-flow (#21) 2024-01-15 08:46:22 +08:00
raptor.py fix raptor bugs (#928) 2024-05-27 11:01:20 +08:00
settings.py optimize srv broker and executor logic (#630) 2024-05-07 11:43:33 +08:00