cyflhn 03ba4bc760 fix error with xinference tool calling with qwen2-instruct and add timeout retry setttings for xinference (#11012) 7 months ago
..
_assets d069c668f8 Model Runtime (#1858) 1 year ago
llm 03ba4bc760 fix error with xinference tool calling with qwen2-instruct and add timeout retry setttings for xinference (#11012) 7 months ago
rerank 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) 8 months ago
speech2text 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) 8 months ago
text_embedding 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) 8 months ago
tts 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) 8 months ago
__init__.py d069c668f8 Model Runtime (#1858) 1 year ago
xinference.py d069c668f8 Model Runtime (#1858) 1 year ago
xinference.yaml 03ba4bc760 fix error with xinference tool calling with qwen2-instruct and add timeout retry setttings for xinference (#11012) 7 months ago
xinference_helper.py 77aef9ff1d refactor: optimize the calculation of rerank threshold and the logic for forbidden characters in model_uid (#8879) 9 months ago