cyflhn 03ba4bc760 fix error with xinference tool calling with qwen2-instruct and add timeout retry setttings for xinference (#11012) 11 månader sedan
..
_assets d069c668f8 Model Runtime (#1858) 1 år sedan
llm 03ba4bc760 fix error with xinference tool calling with qwen2-instruct and add timeout retry setttings for xinference (#11012) 11 månader sedan
rerank 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) 1 år sedan
speech2text 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) 1 år sedan
text_embedding 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) 1 år sedan
tts 1e829ceaf3 chore: format get_customizable_model_schema return value (#9335) 1 år sedan
__init__.py d069c668f8 Model Runtime (#1858) 1 år sedan
xinference.py d069c668f8 Model Runtime (#1858) 1 år sedan
xinference.yaml 03ba4bc760 fix error with xinference tool calling with qwen2-instruct and add timeout retry setttings for xinference (#11012) 11 månader sedan
xinference_helper.py 77aef9ff1d refactor: optimize the calculation of rerank threshold and the logic for forbidden characters in model_uid (#8879) 1 år sedan