feat(ai-proxy): Add provider: nvidia's triton-server (#2843)

2026-06-09 04:37:31 +08:00 · 2025-09-08 13:37:30 +08:00
parent 4a429bf147
commit d053e01540
4 changed files with 345 additions and 0 deletions
--- a/plugins/wasm-go/extensions/ai-proxy/README_EN.md
+++ b/plugins/wasm-go/extensions/ai-proxy/README_EN.md
@@ -1748,6 +1748,53 @@ provider:
 }
 ```

+### Utilizing OpenAI Protocol Proxy for NVIDIA Triton Interference Server Services
+
+**Configuration Information**
+
+```yaml
+providers:
+  - type: triton
+    tritonDomain: <LOCAL_TRITON_DOMAIN>
+    tritonModelVersion: <MODEL_VERSION>
+    apiTokens:
+      - "****"
+    modelMapping:
+      "*": gpt2
+```
+
+**Request Example**
+
+```json
+{
+  "model": "gpt2",
+  "messages": [
+    {
+      "role": "user",
+      "content": "Hi, who are you？"
+    }
+  ],
+  "stream": false
+}
+```
+**Response Example**
+
+```json
+{
+    "choices": [
+        {
+            "index": 0,
+            "message": {
+                "role": "assistant",
+                "content": "I am a lagguage model."
+            },
+            "finish_reason": "stop",
+        }
+    ],
+    "model": "gpt2",
+}
+```
+
 ## Full Configuration Example

 ### Kubernetes Example