Junjie.M 286cdc41ab reasoning model unified think tag is <think></think> (#13392) 8 months ago
..
__base 286cdc41ab reasoning model unified think tag is <think></think> (#13392) 8 months ago
anthropic 2681bafb76 fix: handle document fetching from URL in Anthropic LLM model, solving base64 decoding error (#11858) 10 months ago
azure_ai_studio 413dfd5628 feat: add completion mode and context size options for LLM configuration (#13325) 8 months ago
azure_openai 34b21b3065 feat: Add o3-mini and o3-mini-2025-01-31 model variants (#13129) 8 months ago
baichuan daccb10d8c fix: volcengine_maas and baichuan message error (#11625) 10 months ago
bedrock 1a2523fd15 feat: bedrock_endpoint_url (#12838) 8 months ago
chatglm 40fb4d16ef chore: refurbish Python code by applying refurb linter rules (#8296) 1 year ago
cohere b09c39c8dc refactor: avoid to use extra space when finding model by name (#13043) 9 months ago
deepseek da2ee04fce fix: correct linewrap think display in generic openai api (#13260) 8 months ago
fireworks 56e15d09a9 feat: mypy for all type check (#10921) 10 months ago
fishaudio 448a19bf54 fix: fish audio wrong validate credentials interface (#11019) 11 months ago
gitee_ai 6df17a334c fix: Update the API call address for the text_embedding model (#12342) 10 months ago
google c8dcde6cd0 fix: Gemini 2.0 Flash 001 model yaml file naming (#13372) 8 months ago
gpustack 2bb521b135 Support TTS and Speech2Text for Model Provider GPUStack (#12381) 9 months ago
groq c6ddf6d6cc feat(model_providers): Add Groq DeepSeek-R1-Distill-Llama-70b (#13229) 8 months ago
huggingface_hub 166221d784 chore(lint): fix quotes for f-string formatting by bumping ruff to 0.9.x (#12702) 9 months ago
huggingface_tei 6a0ff3686c fix: fix typo (#12034) 10 months ago
hunyuan baeddd4d15 feat:Add support for stop parameter in hunyuan model #12313 (#12315) 10 months ago
jina 56e15d09a9 feat: mypy for all type check (#10921) 10 months ago
leptonai 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) 1 year ago
localai 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) 1 year ago
minimax 925d69a2ee feat:Support Minimax-Text-01 (#12763) 9 months ago
mistralai 42d986b96d [Pixtral] Add new model ; add vision (#11231) 10 months ago
mixedbread b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 1 year ago
moonshot 6ea77ab4cd fix: DeepSeek API Error with response format active (text and json_object) (#12747) 9 months ago
nomic 56e15d09a9 feat: mypy for all type check (#10921) 10 months ago
novita 560c5de1b7 Fixed Novita AI color and added DeepSeek R1 model (#13074) 9 months ago
nvidia 6d66d6da15 feat(model_providers): Support deepseek-r1 for Nvidia Catalog (#13269) 8 months ago
nvidia_nim 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) 1 year ago
oci 40dd63ecef Upgrade oracle models (#13174) 8 months ago
ollama 286cdc41ab reasoning model unified think tag is <think></think> (#13392) 8 months ago
openai 7203991032 feat: add parameter "reasoning_effort" and Openai o3-mini (#13243) 8 months ago
openai_api_compatible 286cdc41ab reasoning model unified think tag is <think></think> (#13392) 8 months ago
openllm 56e15d09a9 feat: mypy for all type check (#10921) 10 months ago
openrouter 6e5c915f96 feat(model): add deepseek-r1 for openrouter (#13312) 8 months ago
perfxcloud d44882c1b5 refactor: reduce duplciate code by inheritance (#13073) 9 months ago
replicate 56e15d09a9 feat: mypy for all type check (#10921) 10 months ago
sagemaker 147d578922 [Fix] revert sagemaker llm to support model hub (#12378) 9 months ago
siliconflow 8f9db61688 feat: added new silicon flow models (#13369) 8 months ago
spark 9d86147d20 fix: SparkLite API Auth error (#12781) (#12790) 9 months ago
stepfun 3c2e30f348 fix: #12143 support streaming mode content start with "data:" (#12171) 10 months ago
tencent 40fb4d16ef chore: refurbish Python code by applying refurb linter rules (#8296) 1 year ago
togetherai 56e15d09a9 feat: mypy for all type check (#10921) 10 months ago
tongyi 38c31e64db add enable_search parameter to qwen_max, plus, turbo (#13335) 8 months ago
triton_inference_server 166221d784 chore(lint): fix quotes for f-string formatting by bumping ruff to 0.9.x (#12702) 9 months ago
upstage 56e15d09a9 feat: mypy for all type check (#10921) 10 months ago
vertex_ai 2348abe4bf feat: added a couple of models not defined in vertex ai, that were already … (#13296) 8 months ago
vessl_ai 56e15d09a9 feat: mypy for all type check (#10921) 10 months ago
volcengine_maas 16865d43a8 feat: add deepseek models for volcengine provider (#13283) 8 months ago
voyage 8aae235a71 fix: int None will cause error for context size (#11055) 11 months ago
wenxin 166221d784 chore(lint): fix quotes for f-string formatting by bumping ruff to 0.9.x (#12702) 9 months ago
x cf0ff88120 feat: add grok-2-1212 and grok-2-vision-1212 (#11672) 10 months ago
xinference 286cdc41ab reasoning model unified think tag is <think></think> (#13392) 8 months ago
yi 56e15d09a9 feat: mypy for all type check (#10921) 10 months ago
zhinao 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) 1 year ago
zhipuai da67916843 feat: add glm-4-air-0111 (#12997) 9 months ago
__init__.py d069c668f8 Model Runtime (#1858) 1 year ago
_position.yaml 59ca44f493 chore(model_runtime): Move deepseek ahead in the providers list. (#13197) 8 months ago
model_provider_factory.py 4e7b6aec3a feat: support pinning, including, and excluding for model providers and tools (#7419) 1 year ago