Skip to content

Latest commit

 

History

History
357 lines (263 loc) · 16.4 KB

File metadata and controls

357 lines (263 loc) · 16.4 KB

Documentation Index

Repository: EDB_Testing - AAP with EnterpriseDB PostgreSQL Multi-Datacenter Last Updated: 2026-03-31 Documentation Version: 1.0


Quick Start

New to this repository? Start here:

  1. Quick Start GuideSTART HERE - OpenShift/RHEL deployment (15-30 min)
  2. AAP Containerized Quick StartNEW - Multi-DC DR deployment (30-60 min planning)
  3. Main README - Architecture overview and table of contents
  4. Deployment Guides - Detailed deployment methods
  5. DR Testing Guide - Complete testing framework

Quick deployment paths:

Need to perform a DR drill?


Documentation by Topic

🚀 Deployment Guides

Choose your deployment method:

Platform Guide Description
RHEL / Bare Metal TPA DeploymentRECOMMENDED Automated deployment with Trusted Postgres Architect
RHEL Manual RHEL Manual Install Traditional VM-based installation
OpenShift OpenShift Manual Install Operator-based deployment on OpenShift
OpenShift (Kustomize) Database Deployment GitOps-friendly Kustomize manifests
AAP on OpenShift AAP Deployment AAP operator with external PostgreSQL

Specialized Deployment Topics:


🏗️ Architecture

Understanding the system:

Document Description Read Time
Architecture OverviewCOMPREHENSIVE Complete architecture documentation 45 min
Main README Architecture High-level overview with diagram 5 min
AAP Containerized Growth DRNEW 3-node multi-DC deployment (cost-optimized) 25 min
AAP Containerized Enterprise DRNEW 8-node multi-DC deployment (production-grade) 30 min
Architecture Validation Report Validation vs Red Hat AAP 2.6 tested models 15 min
RHEL AAP Architecture AAP on RHEL with systemd services 10 min
OpenShift AAP Architecture AAP on OpenShift with operator 10 min

Architecture Overview covers:

  • Component details (GLB, AAP, PostgreSQL clusters)
  • Network connectivity and data flow (writes, reads, backups)
  • Replication topology (streaming + WAL archiving)
  • Datacenter configurations (DC1 active, DC2 passive)
  • Scaling strategies (horizontal, vertical, geographic)
  • Backup and restore architecture

AAP Containerized Deployment Models:

Choose based on your requirements:

Topology VMs Best For RTO Cost
Growth 16 total (3 AAP/DC) Small-medium, budget-conscious < 5 min Lower
Enterprise 26 total (8 AAP/DC) Production-critical, high-scale < 5 min Higher

Architecture Decisions:

  • Active-Passive topology (DC1 primary, DC2 standby)
  • Physical streaming replication + WAL archiving to S3
  • CloudNativePG operator (OpenShift) or EDB Postgres Advanced (RHEL)
  • EDB Failover Manager (EFM) for automated database failover
  • Global Load Balancer for traffic management and health-based routing

🔄 Disaster Recovery

DR Planning and Testing:

Document Purpose Read Time
DR Scenarios 6 documented failure scenarios 15 min
DR Testing Guide Complete testing framework (10,000+ words) 45 min
DR Testing Implementation Summary Implementation details and metrics 10 min
Split-Brain Prevention Database role validation and fencing 15 min
EDB Failover Manager EFM integration and configuration 20 min

DR Validation Reports:

DR Scripts:


⚙️ Operations

Day-to-day operations:

Key Operational Tasks:


📜 Operational Scripts

All scripts located in /scripts/:

Script Purpose Usage
scale-aap-up.sh Scale AAP to operational state ./scale-aap-up.sh <dc1|dc2>
scale-aap-down.sh Scale AAP to zero (DR prep) ./scale-aap-down.sh <dc1|dc2>
efm-orchestrated-failover.sh Full DR failover orchestration Called by EFM (post-promotion)
efm-aap-failover-wrapper.sh EFM integration hook Called by EFM with failover context
monitor-efm-scripts.sh Monitor EFM failover events ./monitor-efm-scripts.sh (CronJob)
dr-failover-test.sh Automated DR testing framework See DR Testing Guide
validate-aap-data.sh AAP data integrity validation ./validate-aap-data.sh <dc1|dc2>
measure-rto-rpo.sh RTO/RPO measurement with milestones ./measure-rto-rpo.sh start <test-id>
generate-dr-report.sh DR test report generation ./generate-dr-report.sh <test-id>

Script Documentation:


🔒 Development & CI/CD

Contributing and automation:

GitHub Actions Workflows:

  • .github/workflows/yaml-validation.yml - Kubernetes manifest validation
  • .github/workflows/shell-script-testing.yml - Bash script testing
  • .github/workflows/pr-validation.yml - PR validation and security scanning

Testing:


📊 Monitoring & Observability

Visibility and alerting:

Planned Documentation:

  • Monitoring and Alerting Guide (see Documentation Audit)
  • Grafana Dashboard Setup
  • PagerDuty Integration

🔐 Security

Security considerations:

Planned Documentation:

  • Security Hardening Guide (see Documentation Audit)
  • TLS/SSL Configuration
  • Secrets Management (Vault, Sealed Secrets)

📦 Reference Materials

Additional resources:

External Links:


Documentation by Deployment Type

RHEL / Bare Metal Deployment

Recommended Path:

  1. TPA Deployment Guide - Automated deployment ⭐
  2. RHEL AAP Architecture - Reference architecture
  3. EDB Failover Manager - EFM setup
  4. Operations Runbook - Day-to-day operations

Alternative:

OpenShift Deployment

Recommended Path:

  1. Database Deployment (Kustomize) - Deploy PostgreSQL ⭐
  2. AAP Deployment - Deploy AAP with external database
  3. OpenShift AAP Architecture - Reference architecture
  4. Cross-Cluster Replication - Setup DC1 → DC2 replication
  5. DR Testing Guide - Test failover procedures

Alternative:

Validation:


Documentation by Audience

🎯 SRE / Operations Team

Essential Reading:

  1. DR Scenarios - Understand failure modes
  2. Operations Runbook - Day-to-day procedures
  3. DR Testing Guide - Quarterly drill procedures
  4. Troubleshooting Guide - Issue resolution
  5. Scripts README - Automation tools

🎯 Database Administrators

Essential Reading:

  1. Install TPA - Automated PostgreSQL deployment
  2. EDB Operator - CloudNativePG operator
  3. Cross-Cluster Replication - Replication setup
  4. EDB Failover Manager - EFM integration
  5. Split-Brain Prevention - Database safety

🎯 Platform Engineers

Essential Reading:

  1. Main README - Architecture overview
  2. OpenShift AAP Architecture - Platform design
  3. CI/CD Pipeline - Automation workflows
  4. Database Deployment - Kustomize manifests
  5. DR Testing Guide - Testing framework

🎯 Application Developers

Essential Reading:

  1. AAP Deployment Reference - Deployment verification and troubleshooting
  2. AAP Deployment - AAP usage and integration
  3. Red Hat AAP Documentation - Component capabilities and features
  4. Troubleshooting Guide - Common issues
  5. Main README - System architecture

Documentation Status

Status Count Description
✅ Complete 23 Comprehensive, tested, up-to-date
⚠️ Partial 4 Exists but needs expansion (security, monitoring)
❌ Planned 3 Identified in audit, not yet created (GLOSSARY, FAQ, Migration Guide)

Recent Additions (2026-03-31 to 2026-04-03):

  • ✅ DR Testing Guide (10,000+ words)
  • ✅ DR Testing Implementation Summary
  • ✅ Component Testing Results
  • ✅ Split-Brain Prevention Documentation
  • ✅ CI/CD Pipeline Documentation
  • ✅ Documentation Audit Report
  • ✅ Documentation Index (this file)
  • ✅ Contributing Guide (CONTRIBUTING.md)
  • ✅ Scripts Library Reference (2026-04-03)
  • ✅ Scripts Hooks and CI/CD Guide (2026-04-03)
  • ✅ Scripts README reorganization (2026-04-03)
  • ✅ AAP Deployment Reference (2026-04-03) - Deployment-specific configuration and troubleshooting

Next Documentation Priorities:

  1. Security Hardening Guide (Week 2)
  2. Monitoring and Alerting Guide (Week 3)
  3. Backup and Restore Guide (Week 4)
  4. GLOSSARY.md (Month 2)

See Documentation Audit Report for complete roadmap.


Getting Help

For questions or issues:

For contributions:


Feedback

Documentation feedback welcome! Please:

  • Open an issue for corrections or suggestions
  • Submit a PR for improvements
  • Contact SRE team for urgent documentation needs

Last Documentation Review: 2026-03-31 Next Review: 2026-06-30 (quarterly)


This index is maintained by the SRE team. Auto-generated documentation should reference this index for consistency.