TacoSkill LAB
TacoSkill LAB
HomeSkillHubCreatePlaygroundSkillKit
© 2026 TacoSkill LAB
AboutPrivacyTerms
  1. Home
  2. /
  3. SkillHub
  4. /
  5. promptfoo-evaluation
Improve

promptfoo-evaluation

6.8

by daymade

168Favorites
145Upvotes
0Downvotes

Configures and runs LLM evaluation using Promptfoo framework. Use when setting up prompt testing, creating evaluation configs (promptfooconfig.yaml), writing Python custom assertions, implementing llm-rubric for LLM-as-judge, or managing few-shot examples in prompts. Triggers on keywords like "promptfoo", "eval", "LLM evaluation", "prompt testing", or "model comparison".

evaluation

6.8

Rating

0

Installs

AI & LLM

Category

Quick Review

Excellent skill for LLM evaluation with Promptfoo. The description clearly identifies when to use this skill (prompt testing, eval configs, custom assertions, llm-rubric, few-shot examples). SKILL.md is comprehensive and well-structured, covering configuration, prompt formats, test cases, Python assertions, LLM-as-judge, and troubleshooting. Includes practical examples like echo provider for preview mode, few-shot patterns, and long-text handling. The real-world example and reference to an actual project add credibility. Structure is logical with clear sections, though the document is lengthy (could benefit from more content in separate files). Novelty is solid—setting up Promptfoo evaluations with custom Python assertions and complex few-shot patterns would consume many tokens for a CLI agent to figure out independently. Minor deduction on structure for keeping all content in SKILL.md rather than splitting advanced patterns into separate guides, and novelty score reflects that while useful, the core Promptfoo workflow is somewhat straightforward once learned.

LLM Signals

Description coverage9
Task knowledge9
Structure8
Novelty7

GitHub Signals

469
52
5
2
Last commit 0 days ago

Publisher

daymade

daymade

Skill Author

Related Skills

rag-architectprompt-engineerfine-tuning-expert

Loading SKILL.md…

Try onlineView on GitHub

Publisher

daymade avatar
daymade

Skill Author

Related Skills

rag-architect

Jeffallan

7.0

prompt-engineer

Jeffallan

7.0

fine-tuning-expert

Jeffallan

6.4

mcp-developer

Jeffallan

6.4
Try online