Observability
Edera provides comprehensive observability capabilities to monitor the health and performance of your containerized workloads through Prometheus-compatible metrics.
Metrics Overview
Edera exposes metrics in Prometheus format, providing insights into:
- Zone Performance: CPU and memory usage for individual zones
- Hypervisor Health: Zone-level hypervisor performance metrics
- Host Resources: Overall host (dom0) system performance
- Service Health: Health check status for Edera components
Metrics Configuration
Default Setup
Edera metrics are available at the default endpoint:
http://127.0.0.1:3035/metricsCustom Configuration
To customize the metrics endpoint, use the metrics-http-listen-addr flag on the protect-orchestrator service:
protect-orchestrator --metrics-http-listen-addr=0.0.0.0:3036Key Metrics Categories
Zone Metrics
Monitor individual zone performance and resource usage:
zone_cpu_usage_percent- CPU utilization per zonezone_memory_total_bytes- Total memory allocated to zoneszone_create_timestamp_milliseconds- Zone creation timestamps
Hypervisor Metrics
Track hypervisor-level performance:
hypervisor_cpu_usage_seconds_total- Cumulative CPU usagehypervisor_memory_max_bytes- Maximum memory allocation
Host Metrics
Monitor the underlying host system:
host_cpu_usage_percent- Host CPU utilizationhost_memory_total_bytes- Total host memory
Health Check Metrics
Service health monitoring:
health_check_total- Service health check status
Integration with Monitoring Tools
Prometheus
Configure Prometheus to scrape Edera metrics by adding this job to your prometheus.yml:
scrape_configs:
- job_name: 'edera'
static_configs:
- targets: ['localhost:3035']Grafana
Import Edera metrics into Grafana dashboards to visualize:
- Zone resource utilization trends
- Hypervisor performance over time
- Host system health
- Alert thresholds and notifications
Alerting Best Practices
Set up alerts for critical metrics:
- High CPU Usage: Alert when
zone_cpu_usage_percent> 80% - Memory Pressure: Monitor
zone_memory_total_bytesapproaching limits - Health Check Failures: Alert on
health_check_totalfailures - Host Resource Exhaustion: Watch
host_cpu_usage_percentandhost_memory_total_bytes
Important Notes
- Metrics are currently in Alpha stage and subject to change
- New metrics may be added in future versions
- Refer to version-specific documentation for the latest metric definitions
Additional Resources
For complete metric reference and version-specific details:
- Metrics Overview - General metrics documentation
- v1.3.0 Metrics Reference - Detailed metric definitions for v1.3.0
- Edera Documentation - Full technical documentation
Need help setting up monitoring? Contact support@edera.dev for assistance with your observability configuration.
