Snapshot Failed
Severity:
High
Elasticsearch Version:
8.5.0
Problem
Snapshot creation fails with error indicating repository corruption or access issues
Root Cause
Repository configuration error or disk space exhaustion on snapshot node
How to Detect
Symptoms
- Snapshot failure logs with error codes
- Repository status shows 'corrupted' or 'unavailable'
- Disk usage on snapshot node exceeds threshold
Commands
GET _snapshot/_status
GET _cat/nodes?v
GET _cluster/health
Remediation Steps
- Verify repository configuration and credentials
- Check disk space on snapshot node and free space if needed
- Re-register or recreate the snapshot repository
- Manually delete incomplete snapshots if necessary
- Retry snapshot creation
Prevention
- Monitor disk space regularly on snapshot nodes
- Implement automated alerts for repository health
- Schedule periodic repository integrity checks
- Ensure proper permissions and network access for repository
Production Example
curl -X GET "localhost:9200/_snapshot/_status"