Commit Graph

663 Commits

Author SHA1 Message Date
woody
ae0bb41885 Fix OpenAI capability rewrite dropping query string (#3168) 2025-11-28 17:44:22 +08:00
rinfx
42334f21df [feat] load balancing across different clusters and endpoints based on metrics (#3063) 2025-11-25 10:32:34 +08:00
rinfx
7a504fd67d remove omitempty for toolcall index (#3148)
Co-authored-by: 澄潭 <zty98751@alibaba-inc.com>
2025-11-24 16:14:00 +08:00
xujingfeng
c3077d7981 fix(ai-proxy): 调整日志级别以减少冗余警告信息 || fix(ai-proxy): Adjust log level to reduce redundant warning messages (#3120) 2025-11-13 19:24:14 +08:00
澄潭
ca64c9a1c7 Update proxy release binanry and fix golang-filter dependencies (#3123) 2025-11-13 15:20:18 +08:00
woody
8a3c0bb342 feat(ai-proxy): add video-related API paths and capabilities (#3108) 2025-11-11 19:39:49 +08:00
victorserbu2709
1300e09e28 groq add responses capability (#3029)
Co-authored-by: 澄潭 <zty98751@alibaba-inc.com>
2025-11-11 19:36:26 +08:00
rinfx
d4e6704f33 [bugfix] add claude usage & bedrock tool_call index (#3095) 2025-11-10 10:03:01 +08:00
澄潭
36df9ba5e8 test(mcp-server): add UT (#3097) 2025-11-06 10:36:00 +08:00
澄潭
826c4e8b4a feat(mcp-server): add server-level default authentication and MCP proxy server support (#3096) 2025-11-05 22:23:41 +08:00
rinfx
1900609fd5 include usage if stream is true (#3084) 2025-11-03 15:36:18 +08:00
johnlanni
f79e3b9556 update wasm-go sdk for wasmplugins which use redis call
Change-Id: Ifc5efb21f4860fc85d096604a53a10e85797d813
2025-11-03 15:05:03 +08:00
Tsukilc
1602b6f94a feat: add higress api mcp server (#2923)
Co-authored-by: 澄潭 <zty98751@alibaba-inc.com>
Co-authored-by: Se7en <chengzw258@163.com>
2025-10-31 15:46:14 +08:00
nohup
d745bc0d0b feat: impl nginx migration mcp server (#2916)
Co-authored-by: 韩贤涛 <601803023@qq.com>
2025-10-31 13:59:15 +08:00
Jun
ef6baf29e8 fix: rag add python example code (#3043) 2025-10-30 20:16:11 +08:00
Jingze
ccbb542fec fix(log-request-response): enhance response body logging by checking Content-Encoding (#3074) 2025-10-30 10:59:04 +08:00
rinfx
af8748d754 add inject_encoded_data_to_filter_chain_on_header example (#3071) 2025-10-30 10:58:25 +08:00
johnlanni
d386739e48 upgrade wasm-go to 1.0.4 in jsonrpc-converter
Change-Id: I6a5fc136907e9864a6450d53f6ec5b926af8887c
Signed-off-by: johnlanni <zty98751@alibaba-inc.com>
2025-10-29 17:53:50 +08:00
woody
5e4c262814 Feat/vllm provider (#3067) 2025-10-29 14:31:38 +08:00
rinfx
2a320f87a6 [feature] add checking of maliciousUrl & modelHallucination, and adjust consumer specific configs (#3024) 2025-10-28 14:12:54 +08:00
SaladDay
1bcef0c00c feature: support secret reference for Redis password in MCP Server (#3006)
Co-authored-by: 澄潭 <zty98751@alibaba-inc.com>
2025-10-27 13:33:52 +08:00
Libres-coder
7c4899ad38 feat(mcp): add list-plugin-instances tool for AI Agent (#3038) 2025-10-25 20:39:22 +08:00
victorserbu2709
5fc1d6b222 Add ApiNameAnthropicMessages in claude capabilities (#3040) 2025-10-23 09:54:47 +08:00
澄潭
271e6036fa feat(ai-proxy): enable Qwen compatible mode by default and add missing API endpoints (#3032) 2025-10-22 11:17:09 +08:00
澄潭
67b92b76fe fix(jsonrpc-converter): Use raw JSON instead of incorrect JSON string formatting (#2988) 2025-10-16 09:58:06 +08:00
rinfx
6b2d06a330 [bugfix] sometimes bedrock EventStream chunk is not complete (#3010) 2025-10-16 09:52:51 +08:00
韩贤涛
1f301be851 fix: Optimization of Rate Limiting Logic for Cluster, AI Token and WASM Plugin (#2997) 2025-10-15 17:24:42 +08:00
johnlanni
15db773e24 update wasm-go dependency 2025-10-11 16:41:08 +08:00
johnlanni
fe69084c04 enable ai-proxy&ai-statistics rebuild logic with new key 2025-10-11 16:32:32 +08:00
rinfx
fcc7fc0139 record consumer name even the consumer is not allowed (#2992) 2025-10-10 20:00:05 +08:00
rinfx
ac2f7dedaa [key-auth] record consumer name once the consumer name is determined (#2978) 2025-10-09 11:22:09 +08:00
Kent Dong
b351dc45e3 doc: Update the description of azureServiceUrl in ai-proxy README files (#2965) 2025-10-06 15:09:53 +08:00
Jun
aebe354055 add vectordb mapping (#2968) 2025-10-06 15:08:13 +08:00
johnlanni
45a11734bd remove rebuild logic in ai-proxy&ai-statistics 2025-09-26 16:26:06 +08:00
johnlanni
063bfbfcfe fix(ai-proxy): fix streaming process 2025-09-23 19:44:30 +08:00
rinfx
9a3ccff4c8 opt(ai-load-balancer): update global least request lua script for ai-load-balancer (#2945) 2025-09-23 19:24:33 +08:00
澄潭
623c8da8d8 fix(ai-proxy): Fix Azure OpenAI Response API handling and service URL type detection (#2948) 2025-09-23 18:49:55 +08:00
Jun
e2d00da861 fix: llm can be empty and optimize document and prompt (#2942) 2025-09-23 14:03:00 +08:00
rinfx
732aacdbc5 fix(ai-security-guard): compatible with old configs (#2941) 2025-09-23 10:23:25 +08:00
Jun
8b8c8b242b feat: add rag mcp server (#2930) 2025-09-21 14:48:22 +08:00
Jingze
f1345f9973 fix: optimize host pattern matching and fix SSE newline bug (#2899) 2025-09-21 14:34:51 +08:00
韩贤涛
de8a9c539b doc: optimize the documentation for hmac-auth-apisix (#2912) 2025-09-21 14:34:07 +08:00
aias00
88a679ee07 feat(ai-proxy): add Fireworks AI support (#2917) 2025-09-21 14:32:04 +08:00
Xijun Dai
47827ad271 refactor(v2): upgrade module to github.com/alibaba/higress/v2 (#2922)
Signed-off-by: Xijun Dai <daixijun1990@gmail.com>
2025-09-21 14:29:07 +08:00
woody
cd2082033c fix: add AttackLevel field support for MultiModalGuard prompt attack detection (#2938) 2025-09-21 14:25:36 +08:00
rinfx
ef12f40c0e deduplicate think tag for bedrock and vertex (#2933) 2025-09-18 14:35:36 +08:00
rinfx
caae3ee068 [feature] bedrock provider support multimodal and thinking (#2897) 2025-09-18 14:22:37 +08:00
rinfx
d7bebf79e1 vertex support multi-modal, function call and thinking (#2926) 2025-09-18 14:22:22 +08:00
rinfx
e70b9ec437 update ai-security-guard test (#2928) 2025-09-17 16:13:24 +08:00
rinfx
7e9f98d14b [ai-statistics] update logic of api extraction (#2927) 2025-09-16 19:51:18 +08:00