Commit Graph

574 Commits

Author SHA1 Message Date
澄潭
d30f6c6f0a feat(model-router): add auto routing based on user message content (#3403) 2026-01-29 00:08:07 +08:00
zikunchang
f2fcd68ef8 feature: Support getting the API key from the request header when provider.apiTokens is not configured. (#3394)
Co-authored-by: 澄潭 <zty98751@alibaba-inc.com>
2026-01-28 14:03:24 +08:00
rinfx
cbcc3ecf43 bugfix for model-mapper & model-router (#3370) 2026-01-28 10:52:45 +08:00
woody
a2eb599eff Implement Vertex Raw mode support in AI Proxy (#3375) 2026-01-21 14:45:06 +08:00
rinfx
3a28a9b6a7 update wasm-go dependency (#3367) 2026-01-20 15:13:59 +08:00
woody
399d2f372e add support for image generation in Vertex AI provider (#3335) 2026-01-19 16:40:29 +08:00
johnlanni
9d8a1c2e95 Fix the issue of backend errors not being propagated in streamable proxy mode 2026-01-15 20:36:49 +08:00
johnlanni
fb71d7b33d fix(mcp): remove accept-encoding header to prevent response compression 2026-01-15 16:43:14 +08:00
woody
f1a5f18c78 feat/ai proxy vertex ai compatible (#3324) 2026-01-14 10:13:00 +08:00
rinfx
5e787b3258 Replace model-router and model-mapper with Go implementation (#3317) 2026-01-13 20:14:29 +08:00
woody
23fbe0e9e9 feat(vertex): 为 ai-proxy 插件的 Vertex AI Provider 添加 Express Mode 支持 || feat(vertex): Add Express Mode support to Vertex AI Provider of ai-proxy plug-in (#3301) 2026-01-13 20:00:05 +08:00
qshuai
72c87b3e15 docs: unknown config entry <show_limit_quota_header> in ai-token-ratelimit plugin (#3241) 2026-01-10 11:07:43 +08:00
CZJCC
78d4b33424 feat(ai-proxy): add Bearer Token authentication support for Bedrock p… (#3305) 2026-01-07 19:39:20 +08:00
nixidexiangjiao
b98b51ef06 feat(ai-load-balancer): enhance global least request load balancer (#3255) 2025-12-29 09:28:56 +08:00
Kent Dong
08a7204085 feat: Add traffic-editor plugin (#2825) 2025-12-26 17:29:55 +08:00
Jingze
38d50bbdad feat: Add response-cache plugin (#3061)
Co-authored-by: mirror58229 <674958229@qq.com>
2025-12-26 17:22:03 +08:00
澄潭
85791e4866 fix(mcp-server): fix MCP server version negotiation to comply with spec (#3258) 2025-12-26 17:04:20 +08:00
rinfx
5cc9f65aaa support disable thinking and add reasoning token usage (#3261) 2025-12-26 17:04:07 +08:00
Bingkun Zhao
e7e3ab5ff6 fix: ai-proxy dify provider extract hostname from difyApiUrl (#3257) 2025-12-24 09:58:54 +08:00
rinfx
3191bb1bf5 special handling for cases where extracted content is empty and add unit test (#3251) 2025-12-23 16:55:06 +08:00
rinfx
00d0ad0f5e Cross provider lb bugfix (#3252) 2025-12-23 16:54:15 +08:00
Kent Dong
5327a598ac fix: Switch to the new HasRequestBody logic in ai-proxy (#3211) 2025-12-22 09:41:31 +08:00
rinfx
e1e8e55c83 [feat] ai-security-guard support checking prompt and image in request body (#3206) 2025-12-22 09:36:49 +08:00
rinfx
f4905cbba7 add rebuild logic for ai-cache (#3185) 2025-12-22 09:36:02 +08:00
rinfx
e8bcbde5f4 support vertex's claude (#3236) 2025-12-20 10:33:53 +08:00
woody
6998800c64 fix(ai-proxy): ensure basePathHandling works with original protocol (#3225) 2025-12-16 20:49:21 +08:00
rinfx
7f5b37ae6d vertex support global region (#3213) 2025-12-15 17:19:59 +08:00
Liang Deng
5c17d3faa3 feat(ai-proxy): support handle array content in chatToolMessage2BedrockMessage (#3200)
Signed-off-by: Liang Deng <ytdengliang@gmail.com>
Co-authored-by: rinfx <yucheng.lxr@alibaba-inc.com>
2025-12-11 14:15:38 +08:00
johnlanni
b6e94b1f60 fix(ai-proxy): only perform protocol conversion for non-original protocols
Change-Id: Ib8ae3ebf6b47284108663c97777032d6282bb53c
2025-12-10 18:50:23 +08:00
johnlanni
8deceb4d2c update go sum 2025-12-09 20:48:14 +08:00
johnlanni
6bf587a4d1 add wrapper.WithRebuildMaxMemBytes(200MB) to ai-statistics&ai-proxy 2025-12-09 20:44:23 +08:00
johnlanni
7bee45b022 update wasm-go dep of mcp-server 2025-12-08 10:20:57 +08:00
rinfx
8a7a375ebd doubao support configuration for domain (#3184) 2025-12-04 21:17:50 +08:00
rinfx
896bcacf4c [feat] ai-security-guard refactor & support checking multimoadl input (#3075) 2025-12-04 16:33:59 +08:00
woody
116e7c6904 implement generic provider for vendor-agnostic passthrough (#3175) 2025-12-03 09:52:47 +08:00
woody
ae0bb41885 Fix OpenAI capability rewrite dropping query string (#3168) 2025-11-28 17:44:22 +08:00
rinfx
42334f21df [feat] load balancing across different clusters and endpoints based on metrics (#3063) 2025-11-25 10:32:34 +08:00
rinfx
7a504fd67d remove omitempty for toolcall index (#3148)
Co-authored-by: 澄潭 <zty98751@alibaba-inc.com>
2025-11-24 16:14:00 +08:00
xujingfeng
c3077d7981 fix(ai-proxy): 调整日志级别以减少冗余警告信息 || fix(ai-proxy): Adjust log level to reduce redundant warning messages (#3120) 2025-11-13 19:24:14 +08:00
woody
8a3c0bb342 feat(ai-proxy): add video-related API paths and capabilities (#3108) 2025-11-11 19:39:49 +08:00
victorserbu2709
1300e09e28 groq add responses capability (#3029)
Co-authored-by: 澄潭 <zty98751@alibaba-inc.com>
2025-11-11 19:36:26 +08:00
rinfx
d4e6704f33 [bugfix] add claude usage & bedrock tool_call index (#3095) 2025-11-10 10:03:01 +08:00
澄潭
36df9ba5e8 test(mcp-server): add UT (#3097) 2025-11-06 10:36:00 +08:00
澄潭
826c4e8b4a feat(mcp-server): add server-level default authentication and MCP proxy server support (#3096) 2025-11-05 22:23:41 +08:00
rinfx
1900609fd5 include usage if stream is true (#3084) 2025-11-03 15:36:18 +08:00
johnlanni
f79e3b9556 update wasm-go sdk for wasmplugins which use redis call
Change-Id: Ifc5efb21f4860fc85d096604a53a10e85797d813
2025-11-03 15:05:03 +08:00
Jingze
ccbb542fec fix(log-request-response): enhance response body logging by checking Content-Encoding (#3074) 2025-10-30 10:59:04 +08:00
rinfx
af8748d754 add inject_encoded_data_to_filter_chain_on_header example (#3071) 2025-10-30 10:58:25 +08:00
johnlanni
d386739e48 upgrade wasm-go to 1.0.4 in jsonrpc-converter
Change-Id: I6a5fc136907e9864a6450d53f6ec5b926af8887c
Signed-off-by: johnlanni <zty98751@alibaba-inc.com>
2025-10-29 17:53:50 +08:00
woody
5e4c262814 Feat/vllm provider (#3067) 2025-10-29 14:31:38 +08:00