DeepSeek-R1-Distill-Qwen-1.5B is fine-tuned based on open-source models, using samples generated by DeepSeek-R1. with 0.5 billion parameters. Key highlights of this model include:
deepseek-r1-1.5B-ax630c
The Base Model providing a 128 context window and a maximum output of 1,024 tokens.
Support Platforms: LLM630 Compute Kit, Module LLM, and Module LLM Kit
apt install llm-model-deepseek-r1-1.5b-ax630c
deepseek-r1-1.5B-p256-ax630c
The Long-Context Model Compared to the Base Model, it provides extended context capabilities, offering a 256 context window and a maximum of 1,024 output tokens.
Support Platforms: LLM630 Compute Kit, Module LLM, Module LLM Kit
apt install llm-model-deepseek-r1-1.5b-p256-ax630c