TacoSkill LAB
TacoSkill LAB
HomeSkillHubCreatePlaygroundSkillKit
© 2026 TacoSkill LAB
AboutPrivacyTerms
  1. Home
  2. /
  3. SkillHub
  4. /
  5. whisper
Improve

whisper

8.1

by davila7

182Favorites
259Upvotes
0Downvotes

OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M params). Use for speech-to-text, podcast transcription, or multilingual audio processing. Best for robust, multilingual ASR.

speech-to-text

8.1

Rating

0

Installs

AI & LLM

Category

Quick Review

Excellent comprehensive skill for Whisper speech recognition. The description clearly conveys capabilities (multilingual ASR, transcription, translation) that enable a CLI agent to invoke it appropriately. Task knowledge is thorough with complete code examples for basic transcription, model selection, batch processing, CLI usage, and integration patterns. Structure is logical with clear sections, though the SKILL.md is somewhat lengthy (could offload some reference tables to separate files). Novelty is moderate: while Whisper setup and parameter tuning can be complex, basic transcription is relatively straightforward for an LLM agent, so the token savings are meaningful but not exceptional. The skill excels at providing actionable guidance (model recommendations, best practices, performance metrics) and covers edge cases well (GPU acceleration, streaming, integration). Minor improvement: the languages.md reference file appears unused in the main document.

LLM Signals

Description coverage9
Task knowledge9
Structure8
Novelty6

GitHub Signals

18,073
1,635
132
71
Last commit 0 days ago

Publisher

davila7

davila7

Skill Author

Related Skills

rag-architectprompt-engineerfine-tuning-expert

Loading SKILL.md…

Try onlineView on GitHub

Publisher

davila7 avatar
davila7

Skill Author

Related Skills

rag-architect

Jeffallan

7.0

prompt-engineer

Jeffallan

7.0

fine-tuning-expert

Jeffallan

6.4

mcp-developer

Jeffallan

6.4
Try online