Commit Graph

678 Commits

Author SHA1 Message Date
Kent Dong
5c225de080 fix: Enlarge the request body buffer size when processing multipart data in model-router (#3237) 2026-01-19 11:53:58 +08:00
rinfx
bf9ef5eefd support vertex's claude (#3236) 2026-01-19 11:53:57 +08:00
007gzs
c9253264ef Rust Plugin add Rule matcher test (#3230) 2026-01-19 11:53:43 +08:00
woody
8c80084ada fix(ai-proxy): ensure basePathHandling works with original protocol (#3225) 2026-01-19 11:53:43 +08:00
rinfx
698a395e89 vertex support global region (#3213) 2026-01-19 11:53:43 +08:00
澄潭
2c72767203 feat: enhance model mapper and router with rebuild triggers and path extensions (#3218) 2026-01-19 11:53:43 +08:00
Liang Deng
bb3ac59834 feat(ai-proxy): support handle array content in chatToolMessage2BedrockMessage (#3200)
Signed-off-by: Liang Deng <ytdengliang@gmail.com>
Co-authored-by: rinfx <yucheng.lxr@alibaba-inc.com>
2026-01-19 11:53:42 +08:00
johnlanni
6c1fe57034 fix(ai-proxy): only perform protocol conversion for non-original protocols
Change-Id: Ib8ae3ebf6b47284108663c97777032d6282bb53c
2026-01-19 11:53:25 +08:00
johnlanni
5c5cc6ac90 update go sum 2026-01-19 11:53:25 +08:00
johnlanni
265da8e4d6 add wrapper.WithRebuildMaxMemBytes(200MB) to ai-statistics&ai-proxy 2026-01-19 11:53:25 +08:00
johnlanni
119698eea4 update wasm-go dep of mcp-server 2026-01-19 11:53:25 +08:00
rinfx
18d20ca135 doubao support configuration for domain (#3184) 2026-01-19 11:53:24 +08:00
rinfx
9978db2ac6 [feat] ai-security-guard refactor & support checking multimoadl input (#3075) 2026-01-19 11:53:24 +08:00
Kent Dong
1582fa6ef9 fix: Bypass the response body processing for MCP streamable transport (#3187) 2026-01-19 11:53:24 +08:00
woody
2b49fd5b26 implement generic provider for vendor-agnostic passthrough (#3175) 2026-01-19 11:53:24 +08:00
woody
48433a6549 Fix OpenAI capability rewrite dropping query string (#3168) 2026-01-19 11:52:50 +08:00
rinfx
8ec48b3b85 [feat] load balancing across different clusters and endpoints based on metrics (#3063) 2026-01-19 11:52:43 +08:00
rinfx
32007d2ab8 remove omitempty for toolcall index (#3148)
Co-authored-by: 澄潭 <zty98751@alibaba-inc.com>
2026-01-19 11:52:33 +08:00
xujingfeng
c3077d7981 fix(ai-proxy): 调整日志级别以减少冗余警告信息 || fix(ai-proxy): Adjust log level to reduce redundant warning messages (#3120) 2025-11-13 19:24:14 +08:00
澄潭
ca64c9a1c7 Update proxy release binanry and fix golang-filter dependencies (#3123) 2025-11-13 15:20:18 +08:00
woody
8a3c0bb342 feat(ai-proxy): add video-related API paths and capabilities (#3108) 2025-11-11 19:39:49 +08:00
victorserbu2709
1300e09e28 groq add responses capability (#3029)
Co-authored-by: 澄潭 <zty98751@alibaba-inc.com>
2025-11-11 19:36:26 +08:00
rinfx
d4e6704f33 [bugfix] add claude usage & bedrock tool_call index (#3095) 2025-11-10 10:03:01 +08:00
澄潭
36df9ba5e8 test(mcp-server): add UT (#3097) 2025-11-06 10:36:00 +08:00
澄潭
826c4e8b4a feat(mcp-server): add server-level default authentication and MCP proxy server support (#3096) 2025-11-05 22:23:41 +08:00
rinfx
1900609fd5 include usage if stream is true (#3084) 2025-11-03 15:36:18 +08:00
johnlanni
f79e3b9556 update wasm-go sdk for wasmplugins which use redis call
Change-Id: Ifc5efb21f4860fc85d096604a53a10e85797d813
2025-11-03 15:05:03 +08:00
Tsukilc
1602b6f94a feat: add higress api mcp server (#2923)
Co-authored-by: 澄潭 <zty98751@alibaba-inc.com>
Co-authored-by: Se7en <chengzw258@163.com>
2025-10-31 15:46:14 +08:00
nohup
d745bc0d0b feat: impl nginx migration mcp server (#2916)
Co-authored-by: 韩贤涛 <601803023@qq.com>
2025-10-31 13:59:15 +08:00
Jun
ef6baf29e8 fix: rag add python example code (#3043) 2025-10-30 20:16:11 +08:00
Jingze
ccbb542fec fix(log-request-response): enhance response body logging by checking Content-Encoding (#3074) 2025-10-30 10:59:04 +08:00
rinfx
af8748d754 add inject_encoded_data_to_filter_chain_on_header example (#3071) 2025-10-30 10:58:25 +08:00
johnlanni
d386739e48 upgrade wasm-go to 1.0.4 in jsonrpc-converter
Change-Id: I6a5fc136907e9864a6450d53f6ec5b926af8887c
Signed-off-by: johnlanni <zty98751@alibaba-inc.com>
2025-10-29 17:53:50 +08:00
woody
5e4c262814 Feat/vllm provider (#3067) 2025-10-29 14:31:38 +08:00
rinfx
2a320f87a6 [feature] add checking of maliciousUrl & modelHallucination, and adjust consumer specific configs (#3024) 2025-10-28 14:12:54 +08:00
SaladDay
1bcef0c00c feature: support secret reference for Redis password in MCP Server (#3006)
Co-authored-by: 澄潭 <zty98751@alibaba-inc.com>
2025-10-27 13:33:52 +08:00
Libres-coder
7c4899ad38 feat(mcp): add list-plugin-instances tool for AI Agent (#3038) 2025-10-25 20:39:22 +08:00
victorserbu2709
5fc1d6b222 Add ApiNameAnthropicMessages in claude capabilities (#3040) 2025-10-23 09:54:47 +08:00
澄潭
271e6036fa feat(ai-proxy): enable Qwen compatible mode by default and add missing API endpoints (#3032) 2025-10-22 11:17:09 +08:00
澄潭
67b92b76fe fix(jsonrpc-converter): Use raw JSON instead of incorrect JSON string formatting (#2988) 2025-10-16 09:58:06 +08:00
rinfx
6b2d06a330 [bugfix] sometimes bedrock EventStream chunk is not complete (#3010) 2025-10-16 09:52:51 +08:00
韩贤涛
1f301be851 fix: Optimization of Rate Limiting Logic for Cluster, AI Token and WASM Plugin (#2997) 2025-10-15 17:24:42 +08:00
johnlanni
15db773e24 update wasm-go dependency 2025-10-11 16:41:08 +08:00
johnlanni
fe69084c04 enable ai-proxy&ai-statistics rebuild logic with new key 2025-10-11 16:32:32 +08:00
rinfx
fcc7fc0139 record consumer name even the consumer is not allowed (#2992) 2025-10-10 20:00:05 +08:00
rinfx
ac2f7dedaa [key-auth] record consumer name once the consumer name is determined (#2978) 2025-10-09 11:22:09 +08:00
Kent Dong
b351dc45e3 doc: Update the description of azureServiceUrl in ai-proxy README files (#2965) 2025-10-06 15:09:53 +08:00
Jun
aebe354055 add vectordb mapping (#2968) 2025-10-06 15:08:13 +08:00
johnlanni
45a11734bd remove rebuild logic in ai-proxy&ai-statistics 2025-09-26 16:26:06 +08:00
johnlanni
063bfbfcfe fix(ai-proxy): fix streaming process 2025-09-23 19:44:30 +08:00