big-data

vaex

Use this skill for processing and analyzing large tabular datasets (billions of rows) that exceed available RAM. Vaex excels at out-of-core DataFrame operations, lazy evaluation, fast aggregations, efficient visualization of big data, and machine learning on large datasets. Apply when users need to work with large CSV/HDF5/Arrow/Parquet files, perform fast statistics on massive datasets, create visualizations of big data, or build ML pipelines that don't fit in memory.

big-datadataframeout-of-core-processinglazy-evaluation

ovachiever

81

Apache Spark, Hadoop, distributed computing, and large-scale data processing for petabyte-scale workloads

apache-sparkhadoopdistributed-computingbig-data

pluginagentmarketplace

11

advanced-analytics

Advanced analytics including machine learning, predictive modeling, and big data techniques

machine-learningpredictive-modelingbig-dataadvanced-analytics

pluginagentmarketplace

1

data-engineering

ETL pipelines, Apache Spark, data warehousing, and big data processing. Use for building data pipelines, processing large datasets, or data infrastructure.

big-datadata-warehouseETLapache-spark

pluginagentmarketplace

21

spark-engineer

Use when building Apache Spark applications, distributed data processing pipelines, or optimizing big data workloads. Invoke for DataFrame API, Spark SQL, RDD operations, performance tuning, streaming analytics.

apache-sparkbig-datadistributed-computingbatch-processing

Jeffallan

245

vaex

Use this skill for processing and analyzing large tabular datasets (billions of rows) that exceed available RAM. Vaex excels at out-of-core DataFrame operations, lazy evaluation, fast aggregations, efficient visualization of big data, and machine learning on large datasets. Apply when users need to work with large CSV/HDF5/Arrow/Parquet files, perform fast statistics on massive datasets, create visualizations of big data, or build ML pipelines that don't fit in memory.

pythondata-analysisbig-datascalable-algorithms

K-Dense-AI

3,233360

Agent Skills with tag: big-data

vaex

big-data

advanced-analytics

data-engineering

spark-engineer

vaex