vaex

8.3

189

350

Use this skill for processing and analyzing large tabular datasets (billions of rows) that exceed available RAM. Vaex excels at out-of-core DataFrame operations, lazy evaluation, fast aggregations, efficient visualization of big data, and machine learning on large datasets. Apply when users need to work with large CSV/HDF5/Arrow/Parquet files, perform fast statistics on massive datasets, create visualizations of big data, or build ML pipelines that do not fit in memory.

big data

8.3

Rating

Installs

Data & Analytics

Quick Review

Excellent skill for big data processing with Vaex. The description clearly articulates when to use this skill (datasets exceeding RAM, billions of rows), making it easy for a CLI agent to invoke appropriately. The SKILL.md provides comprehensive task knowledge through well-organized patterns, code examples, and references to six detailed capability areas. The structure is exemplary: concise overview with clear indexing to reference files for deep dives, avoiding clutter while maintaining completeness. The skill addresses a genuinely novel use case where a CLI agent would struggle significantly with out-of-memory datasets and require extensive token usage to discover efficient lazy evaluation patterns. Minor score reduction in novelty reflects that while the use case is specialized, the core operations (filtering, aggregations, visualization) are conceptually familiar, just scaled up. Overall, this is a high-quality skill that would meaningfully reduce costs and enable capabilities otherwise impractical for LLM agents.