Get in Touch

Course Outline

1. Introduction to Distributed PostgreSQL

  • Scaling challenges associated with single-node PostgreSQL
  • Overview of the Citus extension: purpose, architecture, and components
  • Key concepts: coordinator node, worker nodes, metadata, and distribution keys

2. Cluster Architecture and Setup

  • Node types: coordinator versus workers
  • Table types: distributed, replicated, and local tables
  • Installing and configuring Citus within existing PostgreSQL environments
  • Cluster discovery and node management

3. Data Distribution and Sharding Strategies

  • Sharding methods: hash versus append
  • Selecting a distribution column to achieve optimal performance
  • Managing distributed and replicated tables
  • Re-balancing shards and scaling out

4. Distributed Query Execution and Optimization

  • How Citus routes and parallelizes queries
  • Understanding distributed query plans
  • Query pushdown and execution optimization

5. Consistency, Transactions and Fault Tolerance

  • Two-Phase Commit (2PC) and atomic operations
  • Handling failures in distributed transactions

6. Operational Management and Use Cases

  • Monitoring tools and views for Citus
  • Maintenance and upgrades in distributed environments

Requirements

  • Completion of Advanced Administration (High Availability & Replication) or equivalent experience
  • Strong understanding of PostgreSQL configuration and performance tuning
  • Familiarity with Linux and fundamental network concepts

Audience

This course is designed for experienced Database Administrators, DevOps Engineers, and System Architects who currently manage production PostgreSQL environments and require horizontal scaling capabilities.

 7 Hours

Number of participants


Price per participant

Testimonials (2)

Upcoming Courses

Related Categories