Back to tags
Tag

Agent Skills with tag: big-data

6 skills match this tag. Use tags to discover related Agent Skills and explore similar workflows.

vaex

Use this skill for processing and analyzing large tabular datasets (billions of rows) that exceed available RAM. Vaex excels at out-of-core DataFrame operations, lazy evaluation, fast aggregations, efficient visualization of big data, and machine learning on large datasets. Apply when users need to work with large CSV/HDF5/Arrow/Parquet files, perform fast statistics on massive datasets, create visualizations of big data, or build ML pipelines that don't fit in memory.

big-datadataframeout-of-core-processinglazy-evaluation
ovachiever
ovachiever
81

big-data

Apache Spark, Hadoop, distributed computing, and large-scale data processing for petabyte-scale workloads

apache-sparkhadoopdistributed-computingbig-data
pluginagentmarketplace
pluginagentmarketplace
11

advanced-analytics

Advanced analytics including machine learning, predictive modeling, and big data techniques

machine-learningpredictive-modelingbig-dataadvanced-analytics
pluginagentmarketplace
pluginagentmarketplace
1

data-engineering

ETL pipelines, Apache Spark, data warehousing, and big data processing. Use for building data pipelines, processing large datasets, or data infrastructure.

big-datadata-warehouseETLapache-spark
pluginagentmarketplace
pluginagentmarketplace
21

spark-engineer

Use when building Apache Spark applications, distributed data processing pipelines, or optimizing big data workloads. Invoke for DataFrame API, Spark SQL, RDD operations, performance tuning, streaming analytics.

apache-sparkbig-datadistributed-computingbatch-processing
Jeffallan
Jeffallan
245

vaex

Use this skill for processing and analyzing large tabular datasets (billions of rows) that exceed available RAM. Vaex excels at out-of-core DataFrame operations, lazy evaluation, fast aggregations, efficient visualization of big data, and machine learning on large datasets. Apply when users need to work with large CSV/HDF5/Arrow/Parquet files, perform fast statistics on massive datasets, create visualizations of big data, or build ML pipelines that don't fit in memory.

pythondata-analysisbig-datascalable-algorithms
K-Dense-AI
K-Dense-AI
3,233360