582 Commits

Author SHA1 Message Date
lvshui
9edb709ca4 fix(ai-statistics): 修复请求模型上下文未设置问题 || fix(ai-statistics): Fix the problem that the request model context is not set (#3380)
Co-authored-by: rinfx <yucheng.lxr@alibaba-inc.com>
2026-02-04 21:11:55 +08:00
daofeng
07cfdaf88a fix(ai-proxy): 处理 Qwen 响应无选择项的情况 (#3448) 2026-02-03 20:33:57 +08:00
澄潭
38dedae47d feat: support use_default_attributes for ai-statistics plugin (#3427) 2026-02-01 13:47:55 +08:00
澄潭
0c0ec53a50 feat(ai-statistics): support token details and builtin keys for reasoning_tokens/cached_tokens (#3424) 2026-02-01 11:54:52 +08:00
澄潭
1b0ee6e837 feat(ai-statistics): add session ID tracking for multi-turn agent conversations (#3420) 2026-02-01 00:35:50 +08:00
澄潭
93075cbc03 fix(model-router): sync model field in request body for auto routing mode (#3422) 2026-01-31 23:41:17 +08:00
澄潭
6c3fd46c6f feat(ai-proxy): add context cleanup command support (#3409) 2026-01-30 17:56:31 +08:00
rinfx
8eaa385a56 support mcp security guard (#3295) 2026-01-29 19:25:43 +08:00
澄潭
d30f6c6f0a feat(model-router): add auto routing based on user message content (#3403) 2026-01-29 00:08:07 +08:00
zikunchang
f2fcd68ef8 feature: Support getting the API key from the request header when provider.apiTokens is not configured. (#3394)
Co-authored-by: 澄潭 <zty98751@alibaba-inc.com>
2026-01-28 14:03:24 +08:00
rinfx
cbcc3ecf43 bugfix for model-mapper & model-router (#3370) 2026-01-28 10:52:45 +08:00
woody
a2eb599eff Implement Vertex Raw mode support in AI Proxy (#3375) 2026-01-21 14:45:06 +08:00
rinfx
3a28a9b6a7 update wasm-go dependency (#3367) 2026-01-20 15:13:59 +08:00
woody
399d2f372e add support for image generation in Vertex AI provider (#3335) 2026-01-19 16:40:29 +08:00
johnlanni
9d8a1c2e95 Fix the issue of backend errors not being propagated in streamable proxy mode 2026-01-15 20:36:49 +08:00
johnlanni
fb71d7b33d fix(mcp): remove accept-encoding header to prevent response compression 2026-01-15 16:43:14 +08:00
woody
f1a5f18c78 feat/ai proxy vertex ai compatible (#3324) 2026-01-14 10:13:00 +08:00
rinfx
5e787b3258 Replace model-router and model-mapper with Go implementation (#3317) 2026-01-13 20:14:29 +08:00
woody
23fbe0e9e9 feat(vertex): 为 ai-proxy 插件的 Vertex AI Provider 添加 Express Mode 支持 || feat(vertex): Add Express Mode support to Vertex AI Provider of ai-proxy plug-in (#3301) 2026-01-13 20:00:05 +08:00
qshuai
72c87b3e15 docs: unknown config entry <show_limit_quota_header> in ai-token-ratelimit plugin (#3241) 2026-01-10 11:07:43 +08:00
CZJCC
78d4b33424 feat(ai-proxy): add Bearer Token authentication support for Bedrock p… (#3305) 2026-01-07 19:39:20 +08:00
nixidexiangjiao
b98b51ef06 feat(ai-load-balancer): enhance global least request load balancer (#3255) 2025-12-29 09:28:56 +08:00
Kent Dong
08a7204085 feat: Add traffic-editor plugin (#2825) 2025-12-26 17:29:55 +08:00
Jingze
38d50bbdad feat: Add response-cache plugin (#3061)
Co-authored-by: mirror58229 <674958229@qq.com>
2025-12-26 17:22:03 +08:00
澄潭
85791e4866 fix(mcp-server): fix MCP server version negotiation to comply with spec (#3258) 2025-12-26 17:04:20 +08:00
rinfx
5cc9f65aaa support disable thinking and add reasoning token usage (#3261) 2025-12-26 17:04:07 +08:00
Bingkun Zhao
e7e3ab5ff6 fix: ai-proxy dify provider extract hostname from difyApiUrl (#3257) 2025-12-24 09:58:54 +08:00
rinfx
3191bb1bf5 special handling for cases where extracted content is empty and add unit test (#3251) 2025-12-23 16:55:06 +08:00
rinfx
00d0ad0f5e Cross provider lb bugfix (#3252) 2025-12-23 16:54:15 +08:00
Kent Dong
5327a598ac fix: Switch to the new HasRequestBody logic in ai-proxy (#3211) 2025-12-22 09:41:31 +08:00
rinfx
e1e8e55c83 [feat] ai-security-guard support checking prompt and image in request body (#3206) 2025-12-22 09:36:49 +08:00
rinfx
f4905cbba7 add rebuild logic for ai-cache (#3185) 2025-12-22 09:36:02 +08:00
rinfx
e8bcbde5f4 support vertex's claude (#3236) 2025-12-20 10:33:53 +08:00
woody
6998800c64 fix(ai-proxy): ensure basePathHandling works with original protocol (#3225) 2025-12-16 20:49:21 +08:00
rinfx
7f5b37ae6d vertex support global region (#3213) 2025-12-15 17:19:59 +08:00
Liang Deng
5c17d3faa3 feat(ai-proxy): support handle array content in chatToolMessage2BedrockMessage (#3200)
Signed-off-by: Liang Deng <ytdengliang@gmail.com>
Co-authored-by: rinfx <yucheng.lxr@alibaba-inc.com>
2025-12-11 14:15:38 +08:00
johnlanni
b6e94b1f60 fix(ai-proxy): only perform protocol conversion for non-original protocols
Change-Id: Ib8ae3ebf6b47284108663c97777032d6282bb53c
2025-12-10 18:50:23 +08:00
johnlanni
8deceb4d2c update go sum 2025-12-09 20:48:14 +08:00
johnlanni
6bf587a4d1 add wrapper.WithRebuildMaxMemBytes(200MB) to ai-statistics&ai-proxy 2025-12-09 20:44:23 +08:00
johnlanni
7bee45b022 update wasm-go dep of mcp-server 2025-12-08 10:20:57 +08:00
rinfx
8a7a375ebd doubao support configuration for domain (#3184) 2025-12-04 21:17:50 +08:00
rinfx
896bcacf4c [feat] ai-security-guard refactor & support checking multimoadl input (#3075) 2025-12-04 16:33:59 +08:00
woody
116e7c6904 implement generic provider for vendor-agnostic passthrough (#3175) 2025-12-03 09:52:47 +08:00
woody
ae0bb41885 Fix OpenAI capability rewrite dropping query string (#3168) 2025-11-28 17:44:22 +08:00
rinfx
42334f21df [feat] load balancing across different clusters and endpoints based on metrics (#3063) 2025-11-25 10:32:34 +08:00
rinfx
7a504fd67d remove omitempty for toolcall index (#3148)
Co-authored-by: 澄潭 <zty98751@alibaba-inc.com>
2025-11-24 16:14:00 +08:00
xujingfeng
c3077d7981 fix(ai-proxy): 调整日志级别以减少冗余警告信息 || fix(ai-proxy): Adjust log level to reduce redundant warning messages (#3120) 2025-11-13 19:24:14 +08:00
woody
8a3c0bb342 feat(ai-proxy): add video-related API paths and capabilities (#3108) 2025-11-11 19:39:49 +08:00
victorserbu2709
1300e09e28 groq add responses capability (#3029)
Co-authored-by: 澄潭 <zty98751@alibaba-inc.com>
2025-11-11 19:36:26 +08:00
rinfx
d4e6704f33 [bugfix] add claude usage & bedrock tool_call index (#3095) 2025-11-10 10:03:01 +08:00