Agent-Skills.md

Agent Skills: Monitoring & Analytics Skill

Monitor Proxmox infrastructure health and performance. Track node statistics, analyze resource utilization, and identify optimization opportunities across your cluster.

UncategorizedID: aiskillstore/marketplace/monitoring-analytics

Author

aiskillstore

https://github.com/aiskillstore View all skills

Repository

aiskillstore/marketplace

aiskillstore

23014

Install this agent skill to your local

pnpm dlx add-skill https://github.com/aiskillstore/marketplace/tree/HEAD/skills/dataknifeai/monitoring-analytics

Skill Files

Browse the full folder contents for monitoring-analytics.

Loading file tree…

skills/dataknifeai/monitoring-analytics/SKILL.md

Skill Metadata

Name: monitoring-analytics
Description: Monitor Proxmox infrastructure health and performance. Track node statistics, analyze resource utilization, and identify optimization opportunities across your cluster.

Monitoring & Analytics Skill

Monitor and analyze your Proxmox infrastructure health and performance.

What this skill does

This skill enables you to:

Get node statistics and performance metrics
Monitor CPU, memory, and disk utilization
Track network performance
Analyze VM/container performance
Monitor resource allocation efficiency
Identify performance bottlenecks
Generate performance reports
Track usage trends over time
Plan capacity based on metrics
Establish baselines and thresholds

When to use this skill

Use this skill when you need to:

Check cluster health and performance
Monitor node resource usage
Analyze VM/container performance
Identify performance bottlenecks
Troubleshoot performance issues
Plan capacity expansion
Generate performance reports
Establish monitoring baselines
Forecast resource needs
Optimize resource allocation

Available Tools

get_node_status - Get node statistics and performance
get_vm_status - Get VM performance metrics
get_container_status - Get container performance metrics
get_cluster_resources - Get overall cluster metrics

Typical Workflows

Infrastructure Health Check

Use get_cluster_resources for overall health
Use get_node_status for each node
Use get_vm_status and get_container_status for workload analysis
Generate comprehensive health report

Performance Analysis

Use get_node_status to analyze node performance
Use get_vm_status to check VM performance
Identify high-utilization resources
Analyze performance trends
Recommend optimizations

Capacity Planning

Use get_cluster_resources for current utilization
Use get_node_status for detailed metrics
Analyze growth trends
Project future capacity needs
Plan scaling or upgrades

Bottleneck Identification

Use get_node_status to find high CPU/memory nodes
Use get_vm_status for resource-hungry VMs
Use get_storage for disk bottlenecks
Analyze performance impact
Recommend solutions

Example Questions

"What's the current cluster health and performance?"
"Which nodes are running at high utilization?"
"Show me the performance metrics for all VMs"
"Are there any performance bottlenecks?"
"Get a complete performance analysis report"
"Which containers are consuming the most resources?"
"What are the resource trends over time?"

Response Format

When using this skill, I provide:

Node statistics with CPU, memory, disk metrics
VM/container performance data
Utilization trends and analysis
Bottleneck identification
Capacity planning recommendations
Optimization suggestions

Best Practices

Monitor metrics continuously
Establish performance baselines
Set appropriate alert thresholds
Track metrics over time for trends
Identify and optimize peak usage periods
Balance load across nodes
Monitor both physical and virtual resources
Analyze before and after optimization
Keep historical data for trend analysis
Use metrics to justify capacity investments
Monitor network performance
Consider both current and future growth