Commit Graph

182 Commits

Author SHA1 Message Date
woody
5e4c262814 Feat/vllm provider (#3067) 2025-10-29 14:31:38 +08:00
victorserbu2709
5fc1d6b222 Add ApiNameAnthropicMessages in claude capabilities (#3040) 2025-10-23 09:54:47 +08:00
澄潭
271e6036fa feat(ai-proxy): enable Qwen compatible mode by default and add missing API endpoints (#3032) 2025-10-22 11:17:09 +08:00
rinfx
6b2d06a330 [bugfix] sometimes bedrock EventStream chunk is not complete (#3010) 2025-10-16 09:52:51 +08:00
johnlanni
15db773e24 update wasm-go dependency 2025-10-11 16:41:08 +08:00
johnlanni
fe69084c04 enable ai-proxy&ai-statistics rebuild logic with new key 2025-10-11 16:32:32 +08:00
Kent Dong
b351dc45e3 doc: Update the description of azureServiceUrl in ai-proxy README files (#2965) 2025-10-06 15:09:53 +08:00
johnlanni
45a11734bd remove rebuild logic in ai-proxy&ai-statistics 2025-09-26 16:26:06 +08:00
johnlanni
063bfbfcfe fix(ai-proxy): fix streaming process 2025-09-23 19:44:30 +08:00
澄潭
623c8da8d8 fix(ai-proxy): Fix Azure OpenAI Response API handling and service URL type detection (#2948) 2025-09-23 18:49:55 +08:00
aias00
88a679ee07 feat(ai-proxy): add Fireworks AI support (#2917) 2025-09-21 14:32:04 +08:00
rinfx
ef12f40c0e deduplicate think tag for bedrock and vertex (#2933) 2025-09-18 14:35:36 +08:00
rinfx
caae3ee068 [feature] bedrock provider support multimodal and thinking (#2897) 2025-09-18 14:22:37 +08:00
rinfx
d7bebf79e1 vertex support multi-modal, function call and thinking (#2926) 2025-09-18 14:22:22 +08:00
johnlanni
1cc977c6d4 add RebuildAfterRequests config for some plugins 2025-09-12 17:31:46 +08:00
澄潭
89d414e49a fix(ai-proxy): Avoid overwriting the existing original auth header (#2904) 2025-09-11 16:54:26 +08:00
澄潭
e2011cb805 fix(claude): support array content format in tool_result and remove duplicate structs (#2892) 2025-09-10 14:18:44 +08:00
johnlanni
4edf79a1f6 Revert "remove qwen Anthropic compatiable mode"
This reverts commit f7d80373f9.
2025-09-09 21:35:31 +08:00
johnlanni
f7d80373f9 remove qwen Anthropic compatiable mode 2025-09-09 11:12:49 +08:00
aias00
91a44ea7aa feat(provider): add support for meituan longcat || feat(provider): add support for meituan longcat (#2883) 2025-09-08 13:38:40 +08:00
xingpiaoliang
d053e01540 feat(ai-proxy): Add provider: nvidia's triton-server (#2843) 2025-09-08 13:37:30 +08:00
澄潭
4a429bf147 fix(ai-proxy): resolve Claude streaming response conversion and SSE event chunking issues (#2882) 2025-09-08 09:54:18 +08:00
Jingze
a00b810be5 feat(wasm-go): add wasm go plugin unit test and ci workflow (#2809) 2025-08-28 20:02:03 +08:00
澄潭
44c33617fa feat(ai-proxy): add OpenRouter provider support (#2823) 2025-08-28 19:26:21 +08:00
澄潭
16a18c6609 feat(ai-proxy): add auto protocol compatibility for OpenAI and Claude APIs (#2810) 2025-08-25 14:13:51 +08:00
Xijun Dai
72b98ab6cf feat(ai-proxy): add anthropic/v1/messages and openai/v1/models support for DeepSeek (#2808)
Signed-off-by: Xijun Dai <daixijun1990@gmail.com>
2025-08-21 17:40:13 +08:00
co63oc
eaea782693 fix RegisteTickFunc (#2787) 2025-08-19 20:53:53 +08:00
rinfx
890a802481 feat(ai-proxy): bedrock support tool use (#2730) 2025-08-19 16:54:50 +08:00
Kent Dong
c9fa8d15db chore: Restructure the path-to-api-name mapping logic in ai-proxy (#2773) 2025-08-18 19:05:23 +08:00
Kent Dong
0f1afcdcca fix(ai-proxy): Do not change the configured components of Azure URL (#2782) 2025-08-18 16:27:25 +08:00
Kent Dong
24dca0455e fix: Fix bugs in the bedrock model name escaping logic (#2663)
Co-authored-by: 澄潭 <zty98751@alibaba-inc.com>
2025-08-15 17:40:13 +08:00
xingpiaoliang
0af00bef6b feat(ai-proxy): gemini model multimodal support (#2698) 2025-08-11 15:54:34 +08:00
aias00
a1bf1ff009 feat(provider): add support for Grok provider in AI proxy (#2713)
Co-authored-by: 韩贤涛 <601803023@qq.com>
2025-08-07 17:22:47 +08:00
NOBODY
5ee878198c feat(ai-proxy): gemini model thinking support (#2712) 2025-08-06 06:50:46 +08:00
澄潭
645646fe22 Fix the issue where AI route fallback does not work when using Bedrock. (#2653) 2025-07-31 20:16:16 +08:00
Kent Dong
7348c265b5 feat: Support model mapping and more URL configuration formats for Azure OpenAI (#2649) 2025-07-25 11:28:02 +08:00
OxalisCu
ea0bf7c1b7 feat(ai-proxy): support first byte timeout for llm streaming requests (#2652)
Signed-off-by: OxalisCu <2127298698@qq.com>
2025-07-23 21:49:42 +08:00
Xijun Dai
ba1bf353b8 feat(ai-proxy): add anthropic/v1/messages support for qwen (#2648)
Signed-off-by: Xijun Dai <daixijun1990@gmail.com>
2025-07-23 21:06:56 +08:00
澄潭
30d5b4d32e upgrade wasm-go sdk of some wasm plugins (#2615) 2025-07-17 21:03:42 +08:00
xingpiaoliang
ce271849de docs(wasm-go): update README related to wasm-go (#2586)
Co-authored-by: 澄潭 <zty98751@alibaba-inc.com>
2025-07-16 10:27:40 +08:00
Xijun Dai
c3eb8d0447 feat(ai-proxy): add anthropic && gemini apiName (#2551)
Signed-off-by: Xijun Dai <daixijun1990@gmail.com>
2025-07-15 19:15:07 +08:00
xingpiaoliang
081ab6ee8d Migrate WASM Go Plugins to New SDK and Go 1.24 (#2532) 2025-07-11 10:43:00 +08:00
woody
ff068258a1 [ai-proxy]qwen text-rerank support (#2537) 2025-07-07 20:27:56 +08:00
woody
0996ad21b1 feat(ai-proxy): add basePathHandling option (#2535)
Co-authored-by: Se7en <chengzw258@163.com>
2025-07-03 20:34:14 +08:00
澄潭
44566f5259 feat: ai-proxy support config subPath field (#2533) 2025-07-02 11:35:28 +08:00
Xijun Dai
41a1455874 fix(ai-proxy): restrict the stream_options parameter to be effective only in the openai/v1/chatcompletions (#2524)
Signed-off-by: Xijun Dai <daixijun1990@gmail.com>
2025-07-01 21:44:34 +08:00
Kent Dong
db7dbb24a2 fix: Fix an incorrect config property name in the README of ai-proxy plugin (#2503) 2025-07-01 16:29:18 +08:00
HaoJie Liu
9a0cf9b762 fix(ai-proxy): add system message handling for Bedrock requests (#2516) 2025-06-30 10:35:14 +08:00
Xijun Dai
bb786c9618 feat(ai-proxy): add responses support for doubao (#2509)
Signed-off-by: Xijun Dai <daixijun1990@gmail.com>
2025-06-28 10:39:21 +08:00
HaoJie Liu
864bf5af39 fix(ai-proxy): bedrock support additional request fields (#2480) 2025-06-26 11:29:32 +08:00