gpu mode tip partial

2026-01-13 09:24:52 -08:00 · 2026-01-13 09:24:52 -08:00 · 42df43d21a
commit 42df43d21a
parent ebaa1f1a4e
3 changed files with 17 additions and 9 deletions
--- a/docs/docs/_partial-gpu-mode-tip.mdx
+++ b/docs/docs/_partial-gpu-mode-tip.mdx
@ -0,0 +1,8 @@
+GPU acceleration isn't required for most use cases.
+OpenRAG's CPU-only deployment doesn't prevent you from using GPU acceleration in external services, such as Ollama servers.
+
+GPU acceleration is required only for specific use cases, typically involving customization of the ingestion flows or ingestion logic.
+For example, writing alternate ingest logic in OpenRAG that uses GPUs directly in the container, or customizing the ingestion flows to use Langflow's Docling component with GPU acceleration instead of OpenRAG's `docling serve` service.
+
+To deploy OpenRAG on an Amazon EC2 instance with GPU acceleration enabled, you must edit the `docker-compose.gpu.yml` file before starting the OpenRAG containers.
+For more information, see [Some images failed to pull on Amazon EC2 with GPU support](/support/troubleshoot#some-images-failed-to-pull-on-amazon-ec2-with-gpu-support).
--- a/docs/docs/get-started/docker.mdx
+++ b/docs/docs/get-started/docker.mdx
@ -12,6 +12,7 @@ import PartialPrereqWindows from '@site/docs/_partial-prereq-windows.mdx';
 import PartialPrereqPython from '@site/docs/_partial-prereq-python.mdx';
 import PartialInstallNextSteps from '@site/docs/_partial-install-next-steps.mdx';
 import PartialOllamaModels from '@site/docs/_partial-ollama-models.mdx';
+import PartialGpuModeTip from '@site/docs/_partial-gpu-mode-tip.mdx';

 To manage your own OpenRAG services, deploy OpenRAG with Docker or Podman.

@ -136,15 +137,8 @@ The following variables are required or recommended:
   podman compose -f docker-compose.yml -f docker-compose.gpu.yml up -d
   ```

-   tip:::
-   GPU acceleration isn't required for most use cases.
-   OpenRAG's CPU-only deployment doesn't prevent you from using GPU acceleration in external services, such as Ollama servers.
-
-   GPU acceleration is required only for specific use cases, typically involving customization of the ingestion flows or ingestion logic.
-   For example, writing alternate ingest logic in OpenRAG that uses GPUs directly in the container, or customizing the ingestion flows to use Langflow's Docling component with GPU acceleration instead of the `docling serve` service.
-
-   If you are deploying OpenRAG on an Amazon EC2 instance with GPU acceleration enabled, you must edit the `docker-compose.gpu.yml` file before deploying the OpenRAG containers.
-   For more information, see [Some images failed to pull on Amazon EC2 with GPU support](/support/troubleshoot#some-images-failed-to-pull-on-amazon-ec2-with-gpu-support).
+   :::tip
+   <PartialGpuModeTip />
   :::

 4. Wait for the OpenRAG containers to start, and then confirm that all containers are running:
--- a/docs/docs/get-started/tui.mdx
+++ b/docs/docs/get-started/tui.mdx
@ -3,6 +3,8 @@ title: Use the TUI
 slug: /tui
 ---

+import PartialGpuModeTip from '@site/docs/_partial-gpu-mode-tip.mdx';
+
 The OpenRAG Terminal User Interface (TUI) provides a simplified and guided experience for configuring, managing, and monitoring your OpenRAG deployment directly from the terminal.

 ![OpenRAG TUI Interface](@site/static/img/openrag_tui_dec_2025.png)
@ -36,6 +38,10 @@ In the TUI, click **Status**, and then click **Switch to GPU Mode** or **Switch

 This change requires restarting all OpenRAG services because each mode has its own `docker-compose` file.

+:::tip
+<PartialGpuModeTip />
+:::
+
 ## Exit the OpenRAG TUI

 To exit the OpenRAG TUI, press <kbd>q</kbd> on the TUI main page.