LastHopeOfGPNU 1a6b961b5f Resolve 8475 support rerank model from infinity (#10939) 10 달 전
..
__base e61752bd3a feat/enhance the multi-modal support (#8818) 11 달 전
anthropic 3087913b74 Fix the situation where output_tokens/input_tokens may be None in response.usage (#10728) 10 달 전
azure_ai_studio 51db59622c chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) 10 달 전
azure_openai e03ec0032b fix: Azure OpenAI o1 max_completion_token error (#10593) 10 달 전
baichuan b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 11 달 전
bedrock c3d11c8ff6 fix: aws presign url is not workable remote url (#10884) 10 달 전
chatglm 40fb4d16ef chore: refurbish Python code by applying refurb linter rules (#8296) 1 년 전
cohere 0067b16d1e fix: refactor all 'or []' and 'or {}' logic to make code more clear (#10883) 10 달 전
deepseek 153807f243 fix: response_format label (#8326) 1 년 전
fireworks b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 11 달 전
fishaudio 62051d5171 Corrected type annotation to "Any" from "any" all files in "model_providers" folder (#9135) 11 달 전
gitee_ai ef8022f715 Gitee AI Qwen2.5-72B model (#10595) 10 달 전
google bc1013dacf feat: support json schema for gemini models (#10835) 10 달 전
gpustack 76b0328eb1 feat: add gpustack model provider (#10158) 11 달 전
groq b92504bebc Added Llama 3.2 Vision Models Speech2Text Models for Groq (#9479) 11 달 전
huggingface_hub b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 11 달 전
huggingface_tei 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) 11 달 전
hunyuan 92a3898540 fix: resolve the incorrect model name of hunyuan-standard-256k (#10052) 11 달 전
jina 0c1307b083 add jina rerank http timout parameter (#10476) 10 달 전
leptonai 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) 1 년 전
localai 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) 11 달 전
minimax 5b8f03cd9d add abab7-chat-preview model (#10654) 10 달 전
mistralai 5ddb601e43 add MixtralAI Model (#8517) 1 년 전
mixedbread b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 11 달 전
moonshot 1b5adf40da fix: moonshot response_format raise error (#9847) 11 달 전
nomic b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 11 달 전
novita 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) 1 년 전
nvidia b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 11 달 전
nvidia_nim 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) 1 년 전
oci b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 11 달 전
ollama fbfc811a44 feat: support function call for ollama block chat api (#10784) 10 달 전
openai 82575a7aea fix(gpt-4o-audio-preview): Remove the vision feature (#10932) 10 달 전
openai_api_compatible 1a6b961b5f Resolve 8475 support rerank model from infinity (#10939) 10 달 전
openllm 0067b16d1e fix: refactor all 'or []' and 'or {}' logic to make code more clear (#10883) 10 달 전
openrouter 4d6b45427c Support streaming output for OpenAI o1-preview and o1-mini (#10890) 10 달 전
perfxcloud b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 11 달 전
replicate b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 11 달 전
sagemaker 51db59622c chore(lint): cleanup repeated cause exception in logging.exception replaced by helpful message (#10425) 10 달 전
siliconflow e61242a337 feat: add vlm models from siliconflow (#10704) 10 달 전
spark d0e0111f88 fix:Spark's large language model token calculation error #7911 (#8755) 1 년 전
stepfun 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) 11 달 전
tencent 40fb4d16ef chore: refurbish Python code by applying refurb linter rules (#8296) 1 년 전
togetherai 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) 1 년 전
tongyi bd05df5cc5 fix tongyi embedding endpoint return None output (#10857) 10 달 전
triton_inference_server 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) 11 달 전
upstage b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 11 달 전
vertex_ai 05d43a4074 Fix: Correct the max tokens of Claude-3.5-Sonnet-20241022 for Bedrock and VertexAI (#10508) 10 달 전
vessl_ai aa895cfa9b fix: [VESSL-AI] edit some words in vessl_ai.yaml (#10417) 10 달 전
volcengine_maas 80da0c5830 fix: default max_chunks set to 1 as other providers (#10937) 10 달 전
voyage b90ad587c2 refactor: move the embedding to the rag module and abstract the rerank runner for extension (#9423) 11 달 전
wenxin 4d5546953a add llm: ernie-4.0-turbo-128k of wenxin (#10135) 11 달 전
x bf9349c4dc feat: add xAI model provider (#10272) 11 달 전
xinference 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) 11 달 전
yi e0846792d2 feat: add yi custom llm intergration (#9482) 11 달 전
zhinao 2cf1187b32 chore(api/core): apply ruff reformatting (#7624) 1 년 전
zhipuai 033ab5490b feat: support LLM understand video (#9828) 10 달 전
__init__.py d069c668f8 Model Runtime (#1858) 1 년 전
_position.yaml fb49413a41 feat: add voyage ai as a new model provider (#8747) 1 년 전
model_provider_factory.py 4e7b6aec3a feat: support pinning, including, and excluding for model providers and tools (#7419) 1 년 전