Backup & Restore¶

Comprehensive guide for backing up and restoring Cortex Linux systems.

Backup Strategy¶

graph TB
    A[Backup Strategy] --> B[Full System]
    A --> C[Application Data]
    A --> D[Configuration]

    B --> B1[Disk Image]
    B --> B2[Partition Clone]

    C --> C1[Database]
    C --> C2[User Data]
    C --> C3[Container Volumes]

    D --> D1[/etc/cortex]
    D --> D2[Service Configs]
    D --> D3[Secrets]

What to Back Up¶

Critical Data¶

Data	Location	Frequency
Cortex Config	`/etc/cortex/`	On change
Application Data	`/var/lib/cortex/`	Daily
User Data	`/home/`	Daily
Databases	`/var/lib/postgresql/`	Hourly
Secrets	Varies	On change

System Configuration¶

# Essential system files
/etc/passwd
/etc/shadow
/etc/group
/etc/sudoers.d/
/etc/ssh/
/etc/netplan/
/etc/fstab
/etc/crontab

Backup Methods¶

Using Cortex Snapshots¶

The built-in snapshot system creates recovery points:

# Create snapshot
cortex-ops update rollback --list  # View existing

# Snapshots are created automatically before updates
cortex-ops update apply --packages

Using rsync¶

# Backup to remote server
rsync -avz --delete \
  --exclude='/proc/*' \
  --exclude='/sys/*' \
  --exclude='/dev/*' \
  --exclude='/tmp/*' \
  --exclude='/var/cache/*' \
  / backup@server:/backups/cortex/

# Backup specific directories
rsync -avz /etc/cortex/ backup@server:/backups/cortex/config/
rsync -avz /var/lib/cortex/ backup@server:/backups/cortex/data/

Using tar¶

# Create compressed archive
tar -czvf /backup/cortex-config-$(date +%Y%m%d).tar.gz \
  /etc/cortex/ \
  /var/lib/cortex/

# Exclude unnecessary files
tar -czvf /backup/full-backup.tar.gz \
  --exclude='/proc' \
  --exclude='/sys' \
  --exclude='/dev' \
  --exclude='/tmp' \
  --exclude='/var/cache' \
  /

Using dd (Disk Image)¶

# Full disk image
sudo dd if=/dev/sda of=/backup/disk.img bs=4M status=progress

# Compressed image
sudo dd if=/dev/sda bs=4M status=progress | gzip > /backup/disk.img.gz

# Image specific partition
sudo dd if=/dev/sda1 of=/backup/root.img bs=4M status=progress

Using Restic¶

# Install restic
sudo apt install restic

# Initialize repository
restic init --repo /backup/restic

# Create backup
restic -r /backup/restic backup /etc/cortex /var/lib/cortex

# Create backup to S3
export AWS_ACCESS_KEY_ID=xxx
export AWS_SECRET_ACCESS_KEY=xxx
restic -r s3:s3.amazonaws.com/bucket/cortex backup /etc/cortex

# List snapshots
restic -r /backup/restic snapshots

# Restore
restic -r /backup/restic restore latest --target /restore

Automated Backups¶

Cron Job¶

# /etc/cron.d/cortex-backup
# Daily backup at 2 AM
0 2 * * * root /usr/local/bin/cortex-backup.sh

# Hourly database backup
0 * * * * postgres pg_dump cortex_db > /backup/db/cortex-$(date +\%H).sql

Backup Script¶

#!/bin/bash
# /usr/local/bin/cortex-backup.sh

set -e

BACKUP_DIR="/backup/cortex"
DATE=$(date +%Y%m%d-%H%M%S)
RETENTION_DAYS=30

echo "Starting backup: $DATE"

# Create backup directory
mkdir -p "$BACKUP_DIR/$DATE"

# Backup configuration
tar -czf "$BACKUP_DIR/$DATE/config.tar.gz" /etc/cortex/

# Backup data
tar -czf "$BACKUP_DIR/$DATE/data.tar.gz" /var/lib/cortex/

# Backup databases
pg_dumpall > "$BACKUP_DIR/$DATE/databases.sql"

# Create checksum
cd "$BACKUP_DIR/$DATE"
sha256sum * > checksums.sha256

# Upload to remote (optional)
rsync -avz "$BACKUP_DIR/$DATE" backup@remote:/backups/cortex/

# Clean old backups
find "$BACKUP_DIR" -maxdepth 1 -type d -mtime +$RETENTION_DAYS -exec rm -rf {} \;

echo "Backup completed: $DATE"

Systemd Timer¶

# /etc/systemd/system/cortex-backup.service
[Unit]
Description=Cortex Backup Service

[Service]
Type=oneshot
ExecStart=/usr/local/bin/cortex-backup.sh

# /etc/systemd/system/cortex-backup.timer
[Unit]
Description=Daily Cortex Backup

[Timer]
OnCalendar=*-*-* 02:00:00
Persistent=true

[Install]
WantedBy=timers.target

sudo systemctl enable cortex-backup.timer
sudo systemctl start cortex-backup.timer

Database Backups¶

PostgreSQL¶

# Dump single database
pg_dump -U postgres cortex_db > cortex_db.sql

# Dump all databases
pg_dumpall -U postgres > all_databases.sql

# Compressed dump
pg_dump -U postgres cortex_db | gzip > cortex_db.sql.gz

# Custom format (faster restore)
pg_dump -U postgres -Fc cortex_db > cortex_db.dump

# Restore
psql -U postgres cortex_db < cortex_db.sql
pg_restore -U postgres -d cortex_db cortex_db.dump

SQLite¶

# Backup
sqlite3 /var/lib/cortex/data.db ".backup '/backup/data.db'"

# Or simply copy (when not in use)
cp /var/lib/cortex/data.db /backup/

# Restore
cp /backup/data.db /var/lib/cortex/data.db

Redis¶

# Trigger RDB snapshot
redis-cli BGSAVE

# Copy RDB file
cp /var/lib/redis/dump.rdb /backup/redis-$(date +%Y%m%d).rdb

# Restore
sudo systemctl stop redis
cp /backup/redis-YYYYMMDD.rdb /var/lib/redis/dump.rdb
sudo chown redis:redis /var/lib/redis/dump.rdb
sudo systemctl start redis

Restore Procedures¶

Restore Configuration¶

# From tar archive
tar -xzf config.tar.gz -C /

# From rsync backup
rsync -avz backup@server:/backups/cortex/config/ /etc/cortex/

# Verify and restart services
cortex config validate
sudo systemctl restart cortex

Restore Full System¶

# Boot from live USB

# Mount target disk
mount /dev/sda1 /mnt

# Restore from archive
tar -xzf full-backup.tar.gz -C /mnt

# Restore bootloader
mount --bind /dev /mnt/dev
mount --bind /proc /mnt/proc
mount --bind /sys /mnt/sys
chroot /mnt
grub-install /dev/sda
update-grub
exit

# Unmount and reboot
umount -R /mnt
reboot

Restore from Disk Image¶

# Restore full disk
sudo dd if=/backup/disk.img of=/dev/sda bs=4M status=progress

# Restore compressed image
gunzip -c /backup/disk.img.gz | sudo dd of=/dev/sda bs=4M status=progress

# Restore partition
sudo dd if=/backup/root.img of=/dev/sda1 bs=4M status=progress

Restore from Restic¶

# List snapshots
restic -r /backup/restic snapshots

# Restore specific snapshot
restic -r /backup/restic restore abc123 --target /

# Restore specific files
restic -r /backup/restic restore latest --target /restore \
  --include "/etc/cortex/*"

Disaster Recovery¶

Recovery Checklist¶

Assess the situation
What failed? Hardware, software, data?
What data is affected?
What's the recovery time objective (RTO)?
Prepare recovery environment
Boot from live USB or recovery media
Ensure backup media is accessible
Verify backup integrity
Restore system
Start with base system restore
Apply configuration
Restore application data
Restore databases
Verify restoration
Run health checks: cortex-ops doctor
Verify critical services
Test application functionality
Verify data integrity

Bare Metal Recovery¶

# 1. Boot from Cortex Linux USB

# 2. Partition disk
fdisk /dev/sda
# Create partitions as needed

# 3. Format partitions
mkfs.ext4 /dev/sda1
mkswap /dev/sda2

# 4. Mount target
mount /dev/sda1 /mnt

# 5. Restore from backup
rsync -avz backup@server:/backups/cortex/latest/ /mnt/

# 6. Install bootloader
for i in dev proc sys; do mount --bind /$i /mnt/$i; done
chroot /mnt
grub-install /dev/sda
update-grub
exit

# 7. Configure fstab
nano /mnt/etc/fstab

# 8. Reboot
umount -R /mnt
reboot

Verification¶

Verify Backup Integrity¶

# Check archive integrity
tar -tzf backup.tar.gz > /dev/null && echo "Archive OK"

# Verify checksums
sha256sum -c checksums.sha256

# Test restore to temporary location
mkdir /tmp/restore-test
tar -xzf backup.tar.gz -C /tmp/restore-test
ls /tmp/restore-test/etc/cortex/

Backup Monitoring¶

#!/bin/bash
# /usr/local/bin/check-backup.sh

BACKUP_DIR="/backup/cortex"
MAX_AGE_HOURS=24

# Find latest backup
LATEST=$(ls -t "$BACKUP_DIR" | head -1)
LATEST_PATH="$BACKUP_DIR/$LATEST"

# Check age
AGE_HOURS=$(( ($(date +%s) - $(stat -c %Y "$LATEST_PATH")) / 3600 ))

if [ $AGE_HOURS -gt $MAX_AGE_HOURS ]; then
    echo "CRITICAL: Latest backup is $AGE_HOURS hours old"
    exit 2
fi

# Check size
SIZE=$(du -s "$LATEST_PATH" | cut -f1)
if [ $SIZE -lt 1000 ]; then
    echo "WARNING: Backup size suspiciously small: $SIZE KB"
    exit 1
fi

echo "OK: Latest backup from $LATEST ($AGE_HOURS hours ago)"
exit 0

Best Practices¶

3-2-1 Rule: Keep 3 copies on 2 different media types with 1 offsite
Test restores regularly: Schedule quarterly restore tests
Encrypt sensitive backups: Use GPG or built-in encryption
Monitor backup jobs: Alert on failures
Document procedures: Keep runbooks updated
Automate everything: Human error is the biggest risk