TacoSkill LAB
TacoSkill LAB
HomeSkillHubCreatePlaygroundSkillKit
© 2026 TacoSkill LAB
AboutPrivacyTerms
  1. Home
  2. /
  3. SkillHub
  4. /
  5. spark-engineer
Improve

spark-engineer

6.4

by Jeffallan

56Favorites
100Upvotes
0Downvotes

Use when building Apache Spark applications, distributed data processing pipelines, or optimizing big data workloads. Invoke for DataFrame API, Spark SQL, RDD operations, performance tuning, streaming analytics.

spark

6.4

Rating

0

Installs

Data & Analytics

Category

Quick Review

Excellent Spark skill with comprehensive coverage of distributed data processing. The description clearly conveys when to invoke it (DataFrame API, Spark SQL, performance tuning, streaming). The structured reference system efficiently organizes deep technical knowledge across 5 specialized files. Core workflow provides clear steps from analysis to validation. Strong constraints section with actionable MUST/MUST NOT rules (broadcast joins, avoiding collect(), skew handling). Well-targeted for production Spark engineering. Novelty score reflects that while Spark expertise is valuable, a skilled CLI agent could handle basic Spark tasks; this skill excels at optimization and production-grade patterns that would otherwise require extensive token usage.

LLM Signals

Description coverage9
Task knowledge9
Structure9
Novelty7

GitHub Signals

69
8
2
20
Last commit 1 days ago

Publisher

Jeffallan

Jeffallan

Skill Author

Related Skills

pandas-proxlsxinfographic-syntax-creator

Loading SKILL.md…

Try onlineView on GitHub

Publisher

Jeffallan avatar
Jeffallan

Skill Author

Related Skills

pandas-pro

Jeffallan

6.4

xlsx

mrgoonie

7.2

infographic-syntax-creator

antvis

6.8

faiss

zechenzhangAGI

7.0
Try online