zhaobingshuang 79801f5c30 fix: deepseek reports an error when using Response Format #11677 (#11678) hai 8 meses
..
__base c5f7d650b5 feat: Allow using file variables directly in the LLM node and support more file types. (#10679) hai 9 meses
anthropic 02572e8cca fix: claude can not handle empty string (#11238) hai 8 meses
azure_ai_studio 51db59622c chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) hai 9 meses
azure_openai 6f9ce6a199 fix: fix azure open-4o-08-06 when enable json schema cant process content = "" (#11204) hai 8 meses
baichuan b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 10 meses
bedrock 7b5839335a [ref] use one method to get boto client for aws bedrock (#11506) hai 8 meses
chatglm 40fb4d16ef chore: refurbish Python code by applying refurb linter rules (#8296) hai 11 meses
cohere 5093337de1 FEAT: cohere rerank 3.5 model added (#11289) hai 8 meses
deepseek 79801f5c30 fix: deepseek reports an error when using Response Format #11677 (#11678) hai 8 meses
fireworks b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 10 meses
fishaudio 448a19bf54 fix: fish audio wrong validate credentials interface (#11019) hai 9 meses
gitee_ai 40fc6f529e fix: gitee ai wrong default model, and better para (#11168) hai 9 meses
google 926f604f09 feat: add gemini-2.0-flash-exp (#11570) hai 8 meses
gpustack 8aae235a71 fix: int None will cause error for context size (#11055) hai 9 meses
groq e7a4cfac4d fix: name of llama-3.3-70b-specdec (#11596) hai 8 meses
huggingface_hub b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 10 meses
huggingface_tei 096c0ad564 feat: Add support for TEI API key authentication (#11006) hai 9 meses
hunyuan 92a3898540 fix: resolve the incorrect model name of hunyuan-standard-256k (#10052) hai 9 meses
jina 8aae235a71 fix: int None will cause error for context size (#11055) hai 9 meses
leptonai 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) hai 11 meses
localai 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) hai 10 meses
minimax 32f8439143 fix: add the missing abab6.5t-chat model of Minimax (#11484) hai 8 meses
mistralai 42d986b96d [Pixtral] Add new model ; add vision (#11231) hai 8 meses
mixedbread b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 10 meses
moonshot 643a90c48d fix: use `removeprefix()` instead of `lstrip()` to remove the `data:` prefix (#11272) hai 8 meses
nomic b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 10 meses
novita 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) hai 11 meses
nvidia b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 10 meses
nvidia_nim 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) hai 11 meses
oci b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 10 meses
ollama 7e1184c071 feat: support json_schema for ollama models (#11449) hai 8 meses
openai aa135a3780 Add TTS to OpenAI_API_Compatible (#11071) hai 9 meses
openai_api_compatible 7e154a467b fix: better error message for stream (#11635) hai 8 meses
openllm 0067b16d1e fix: refactor all 'or []' and 'or {}' logic to make code more clear (#10883) hai 9 meses
openrouter 4d6b45427c Support streaming output for OpenAI o1-preview and o1-mini (#10890) hai 9 meses
perfxcloud 8aae235a71 fix: int None will cause error for context size (#11055) hai 9 meses
replicate b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 10 meses
sagemaker 51db59622c chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) hai 9 meses
siliconflow ec00b25793 feat: add siliconflow qwq and llama3.3 model (#11492) hai 8 meses
spark d0e0111f88 fix:Spark's large language model token calculation error #7911 (#8755) hai 11 meses
stepfun 643a90c48d fix: use `removeprefix()` instead of `lstrip()` to remove the `data:` prefix (#11272) hai 8 meses
tencent 40fb4d16ef chore: refurbish Python code by applying refurb linter rules (#8296) hai 11 meses
togetherai 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) hai 11 meses
tongyi fbc4ca980c fix: Remove duplicate 'response_format' parameter from model YAML files (#11531) hai 8 meses
triton_inference_server 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) hai 10 meses
upstage b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) hai 10 meses
vertex_ai bb3bc60f83 feat(model): add vertex_ai Gemini 2.0 Flash Exp (#11604) hai 8 meses
vessl_ai aa895cfa9b fix: [VESSL-AI] edit some words in vessl_ai.yaml (#10417) hai 9 meses
volcengine_maas e79eac688a chore(lint): sort __all__ definitions (#11243) hai 8 meses
voyage 8aae235a71 fix: int None will cause error for context size (#11055) hai 9 meses
wenxin e39e776d03 fix: better wenxin rerank handler, close #11252 (#11283) hai 8 meses
x cf0ff88120 feat: add grok-2-1212 and grok-2-vision-1212 (#11672) hai 8 meses
xinference 03ba4bc760 fix error with xinference tool calling with qwen2-instruct and add timeout retry setttings for xinference (#11012) hai 9 meses
yi e0846792d2 feat: add yi custom llm intergration (#9482) hai 10 meses
zhinao 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) hai 11 meses
zhipuai 142b4fd699 feat: add zhipu glm_4v_flash (#11440) hai 8 meses
__init__.py d069c668f8 Model Runtime (#1858) hai 1 ano
_position.yaml fb49413a41 feat: add voyage ai as a new model provider (#8747) hai 11 meses
model_provider_factory.py 4e7b6aec3a feat: support pinning, including, and excluding for model providers and tools (#7419) hai 1 ano