Commit Graph

718 Commits

Author SHA1 Message Date
Kent Dong
22ae1aaf69 fix: Fix the incorrect api-version appending logic in AzureProvider (#3289) 2026-02-11 17:44:29 +08:00
Kent Dong
cd0a6116ce fix: Fix jwt-auth plugin related typos (#3291) 2026-02-11 17:43:49 +08:00
woody
5e2892f18c fix(provider/bedrock.go): 优化工具调用消息处理逻辑 || fix(provider/bedrock.go): Optimization tool calls message processing logic (#3470) 2026-02-11 12:33:12 +08:00
澄潭
cd670e957f refactor(ai-proxy): remove automatic Bash tool injection in Claude Code mode (#3462) 2026-02-07 20:24:43 +08:00
澄潭
083bae0e73 feat(ai-proxy): add Claude Code mode support for Claude provider (#3459) 2026-02-07 15:57:19 +08:00
lvshui
9edb709ca4 fix(ai-statistics): 修复请求模型上下文未设置问题 || fix(ai-statistics): Fix the problem that the request model context is not set (#3380)
Co-authored-by: rinfx <yucheng.lxr@alibaba-inc.com>
2026-02-04 21:11:55 +08:00
daofeng
07cfdaf88a fix(ai-proxy): 处理 Qwen 响应无选择项的情况 (#3448) 2026-02-03 20:33:57 +08:00
澄潭
38dedae47d feat: support use_default_attributes for ai-statistics plugin (#3427) 2026-02-01 13:47:55 +08:00
澄潭
0c0ec53a50 feat(ai-statistics): support token details and builtin keys for reasoning_tokens/cached_tokens (#3424) 2026-02-01 11:54:52 +08:00
澄潭
1b0ee6e837 feat(ai-statistics): add session ID tracking for multi-turn agent conversations (#3420) 2026-02-01 00:35:50 +08:00
澄潭
93075cbc03 fix(model-router): sync model field in request body for auto routing mode (#3422) 2026-01-31 23:41:17 +08:00
澄潭
6c3fd46c6f feat(ai-proxy): add context cleanup command support (#3409) 2026-01-30 17:56:31 +08:00
rinfx
8eaa385a56 support mcp security guard (#3295) 2026-01-29 19:25:43 +08:00
澄潭
d30f6c6f0a feat(model-router): add auto routing based on user message content (#3403) 2026-01-29 00:08:07 +08:00
zikunchang
f2fcd68ef8 feature: Support getting the API key from the request header when provider.apiTokens is not configured. (#3394)
Co-authored-by: 澄潭 <zty98751@alibaba-inc.com>
2026-01-28 14:03:24 +08:00
rinfx
cbcc3ecf43 bugfix for model-mapper & model-router (#3370) 2026-01-28 10:52:45 +08:00
woody
a2eb599eff Implement Vertex Raw mode support in AI Proxy (#3375) 2026-01-21 14:45:06 +08:00
rinfx
3a28a9b6a7 update wasm-go dependency (#3367) 2026-01-20 15:13:59 +08:00
woody
399d2f372e add support for image generation in Vertex AI provider (#3335) 2026-01-19 16:40:29 +08:00
TianHao Zhang
ac69eb5b27 fix concurrent SSE connections returning wrong endpoint (#3341) 2026-01-19 10:22:50 +08:00
johnlanni
9d8a1c2e95 Fix the issue of backend errors not being propagated in streamable proxy mode 2026-01-15 20:36:49 +08:00
johnlanni
fb71d7b33d fix(mcp): remove accept-encoding header to prevent response compression 2026-01-15 16:43:14 +08:00
woody
f1a5f18c78 feat/ai proxy vertex ai compatible (#3324) 2026-01-14 10:13:00 +08:00
rinfx
5e787b3258 Replace model-router and model-mapper with Go implementation (#3317) 2026-01-13 20:14:29 +08:00
woody
23fbe0e9e9 feat(vertex): 为 ai-proxy 插件的 Vertex AI Provider 添加 Express Mode 支持 || feat(vertex): Add Express Mode support to Vertex AI Provider of ai-proxy plug-in (#3301) 2026-01-13 20:00:05 +08:00
qshuai
72c87b3e15 docs: unknown config entry <show_limit_quota_header> in ai-token-ratelimit plugin (#3241) 2026-01-10 11:07:43 +08:00
CZJCC
78d4b33424 feat(ai-proxy): add Bearer Token authentication support for Bedrock p… (#3305) 2026-01-07 19:39:20 +08:00
nixidexiangjiao
b98b51ef06 feat(ai-load-balancer): enhance global least request load balancer (#3255) 2025-12-29 09:28:56 +08:00
Kent Dong
08a7204085 feat: Add traffic-editor plugin (#2825) 2025-12-26 17:29:55 +08:00
Jingze
38d50bbdad feat: Add response-cache plugin (#3061)
Co-authored-by: mirror58229 <674958229@qq.com>
2025-12-26 17:22:03 +08:00
澄潭
85791e4866 fix(mcp-server): fix MCP server version negotiation to comply with spec (#3258) 2025-12-26 17:04:20 +08:00
rinfx
5cc9f65aaa support disable thinking and add reasoning token usage (#3261) 2025-12-26 17:04:07 +08:00
xingpiaoliang
17e80b30fe feat: implement hgctl agent module (#3267) 2025-12-26 13:47:32 +08:00
Bingkun Zhao
e7e3ab5ff6 fix: ai-proxy dify provider extract hostname from difyApiUrl (#3257) 2025-12-24 09:58:54 +08:00
rinfx
3191bb1bf5 special handling for cases where extracted content is empty and add unit test (#3251) 2025-12-23 16:55:06 +08:00
rinfx
00d0ad0f5e Cross provider lb bugfix (#3252) 2025-12-23 16:54:15 +08:00
Wangzy
4cf1e5e6a0 Add tool-search server (#3136)
Co-authored-by: 澄潭 <zty98751@alibaba-inc.com>
2025-12-22 09:46:31 +08:00
Kent Dong
5327a598ac fix: Switch to the new HasRequestBody logic in ai-proxy (#3211) 2025-12-22 09:41:31 +08:00
rinfx
e1e8e55c83 [feat] ai-security-guard support checking prompt and image in request body (#3206) 2025-12-22 09:36:49 +08:00
rinfx
f4905cbba7 add rebuild logic for ai-cache (#3185) 2025-12-22 09:36:02 +08:00
Kent Dong
ebbcb15811 fix: Enlarge the request body buffer size when processing multipart data in model-router (#3237) 2025-12-20 10:35:55 +08:00
rinfx
e8bcbde5f4 support vertex's claude (#3236) 2025-12-20 10:33:53 +08:00
007gzs
4fa7fcba01 Rust Plugin add Rule matcher test (#3230) 2025-12-19 14:40:27 +08:00
woody
6998800c64 fix(ai-proxy): ensure basePathHandling works with original protocol (#3225) 2025-12-16 20:49:21 +08:00
rinfx
7f5b37ae6d vertex support global region (#3213) 2025-12-15 17:19:59 +08:00
澄潭
0ada107ec5 feat: enhance model mapper and router with rebuild triggers and path extensions (#3218) 2025-12-12 18:10:57 +08:00
Liang Deng
5c17d3faa3 feat(ai-proxy): support handle array content in chatToolMessage2BedrockMessage (#3200)
Signed-off-by: Liang Deng <ytdengliang@gmail.com>
Co-authored-by: rinfx <yucheng.lxr@alibaba-inc.com>
2025-12-11 14:15:38 +08:00
johnlanni
b6e94b1f60 fix(ai-proxy): only perform protocol conversion for non-original protocols
Change-Id: Ib8ae3ebf6b47284108663c97777032d6282bb53c
2025-12-10 18:50:23 +08:00
johnlanni
8deceb4d2c update go sum 2025-12-09 20:48:14 +08:00
johnlanni
6bf587a4d1 add wrapper.WithRebuildMaxMemBytes(200MB) to ai-statistics&ai-proxy 2025-12-09 20:44:23 +08:00