pdf-icon

Whisper-base

Introduction

Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. Trained on 680k hours of labelled data Whisper models demonstrate a strong ability to generalise to many datasets and domains without the need for fine-tuning.

Available NPU Models

whisper-base

Support Platforms: LLM630 Compute Kit, Module LLM, and Module LLM Kit

  • The models are supports multilingual speech recognition and translation.

  • encode 660.31ms

  • avg-decode 51.11ms

Install

apt install llm-model-whisper-base
On This Page