This endpoint is used to retrieve the list of model services currently installed on the device.
llm-model-name
format are model packages.apt list | grep llm-model-
apt
command to install packages as needed, for example, install the llm-model-qwen2.5-0.5b-p256-ax630c
package. Note: models occupy significant space; it is recommended to install as needed.apt install llm-model-qwen2.5-0.5b-p256-ax630c
You can query the list of models currently available on the device from a PC via the OpenAI API. Before running the program, modify the IP part of the base_url
below to the actual IP address of the device.
from openai import OpenAI
client = OpenAI(
api_key="sk-",
base_url="http://192.168.20.186:8000/v1"
)
client.models.list()
print(client.models.list())
SyncPage[Model](data=[
Model(id='melotts_zh-cn', created=0, object='model', owned_by='user', permission=[], root=''),
Model(id='qwen2.5-0.5B-prefill-20e', created=0, object='model', owned_by='user', permission=[], root=''),
Model(id='sherpa-ncnn-streaming-zipformer-20M-2023-02-17', created=0, object='model', owned_by='user', permission=[], root=''),
Model(id='sherpa-ncnn-streaming-zipformer-zh-14M-2023-02-23', created=0, object='model', owned_by='user', permission=[], root=''),
Model(id='single_speaker_english_fast', created=0, object='model', owned_by='user', permission=[], root=''),
Model(id='single_speaker_fast', created=0, object='model', owned_by='user', permission=[], root=''),
Model(id='qwen2.5-0.5B-p256-ax630c', created=0, object='model', owned_by='user', permission=[], root='')
],
object='list')