ASR/STT subtitle generator. Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD. Noise-robust for JAV
-
Updated
Apr 20, 2026 - Python
ASR/STT subtitle generator. Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD. Noise-robust for JAV
AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment Extraction, Audio Denoising, and Enhancement, Support models such as paraformer, sensevoice, fireredasr, zipformer, moonshine, wenet, whisper, fsmn-vad, silero-vad, CT Transformer punc, Spleeter, Uvr5, etc, apply ONNX models in various scenarios.
React Native TurboModule for Sherpa-ONNX offline on-device Speech Processing (STT/TTS/Diarization/VAD) completely offline on the device. Support for Android & iOS
An upgrade framework for train and validate compare with icefall using Lightning.
A template for serving zipformer on Triton Inference Server.
Offline Speech-to-Text for React Native using sherpa-onnx Supports Zipformer, Paraformer, NeMo CTC, Whisper & more.
A lightweight smart meeting assistant. Integrating speech recognition and AI summarization to streamline the note-taking process and automatically generate efficient meeting minutes and action items.
Streaming piano-transcription system
🎤 Enable offline speech recognition in React Native using sherpa-onnx, supporting various model architectures for reliable performance.
Add a description, image, and links to the zipformer topic page so that developers can more easily learn about it.
To associate your repository with the zipformer topic, visit your repo's landing page and select "manage topics."