Skip to content

TerraformGuru

Karpenter Spot Interruption Handling with Zero Downtime

TerraformGuru

Home
Terraform Cert AWS
Terraform Cert AWS
- HashiCorp Terraform Associate Certification — 50 AWS Demos
- 01 Infrastructure as Code IaC Basics
  01 Infrastructure as Code IaC Basics
  - Infrastructure as Code Basics - IaC Fundamentals
- 02 Terraform Basics
  02 Terraform Basics
  - 02 01 Install Tools TerraformCLI AWSCLI VSCodeIDE
    02 01 Install Tools TerraformCLI AWSCLI VSCodeIDE
    
    Install Terraform CLI, AWS CLI & VS Code IDE
  - 02 02 Terraform Command Basics
    02 02 Terraform Command Basics
    
    Terraform Command Basics - Init, Plan, Apply, Destroy
  - 02 03 Terraform Language Syntax
    02 03 Terraform Language Syntax
    
    Terraform Configuration Language Syntax Guide
- 03 Terraform Fundamental Blocks
  03 Terraform Fundamental Blocks
  - 03 01 Terraform Block
    03 01 Terraform Block
    
    Terraform Settings Block - Version Constraints
  - 03 02 Provider Block
    03 02 Provider Block
    
    Terraform Provider Block - Registry & Configuration
  - 03 03 Multiple Provider Configurations
    03 03 Multiple Provider Configurations
    
    Terraform Multiple Provider Configurations Setup
  - 03 04 Providers Dependency Lock File
    03 04 Providers Dependency Lock File
    
    Terraform Dependency Lock File Explained
- 04 Terraform Resources
  04 Terraform Resources
  - 04 01 Resource Syntax and Behavior
    04 01 Resource Syntax and Behavior
    
    Terraform Resource Syntax, Behavior & State File
  - 04 02 Meta Argument depends on
    04 02 Meta Argument depends on
    
    Terraform depends_on Meta-Argument with AWS VPC
  - 04 03 Meta Argument count
    04 03 Meta Argument count
    
    Terraform count Meta-Argument - Create Multiple Resources
  - 04 04 Meta Argument for each
    04 04 Meta Argument for each
    
    Terraform for_each Meta-Argument with Maps & Sets
  - 04 05 Meta Argument lifecycle
    04 05 Meta Argument lifecycle
    
    Terraform Lifecycle Meta-Argument Guide
  - 04 06 Provisioners
    04 06 Provisioners
    
    Terraform Provisioners Overview and Use Cases
- 05 Terraform Variables
  05 Terraform Variables
  - 05 01 Terraform Input Variables
    05 01 Terraform Input Variables
    
    Terraform Input Variables - All Assignment Methods
  - 05 02 Terraform Output Values
    05 02 Terraform Output Values
    
    Terraform Output Values - Query & Secure Outputs
  - 05 03 Terraform Local Values
    05 03 Terraform Local Values
    
    Terraform Local Values - DRY Principle in Practice
- 06 Terraform Datasources
  06 Terraform Datasources
  - Terraform Data Sources - Fetch AWS AMI Dynamically
- 07 Terraform State
  07 Terraform State
  - 07 01 Terraform Remote State Storage and Locking
    07 01 Terraform Remote State Storage and Locking
    
    Terraform Remote State Storage with S3 & DynamoDB
  - 07 02 Terraform State Commands
    07 02 Terraform State Commands
    
    Terraform State Commands - Show, Taint, Import
- 08 Terraform Workspaces
  08 Terraform Workspaces
  - Terraform Workspaces for Multi-Environment Setup
- 09 Terraform Provisioners
  09 Terraform Provisioners
  - 09 01 File Provisioner
    09 01 File Provisioner
    
    Terraform File Provisioner - Copy Files to EC2
  - 09 02 remote exec provisioner
    09 02 remote exec provisioner
    
    Terraform remote-exec Provisioner on AWS EC2
  - 09 03 local exec provisioner
    09 03 local exec provisioner
    
    Terraform local-exec Provisioner - Run Local Scripts
  - 09 04 Null Resource
    09 04 Null Resource
    
    Terraform Null Resource & Triggers Explained
- 10 Terraform Modules
  10 Terraform Modules
  - 10 01 Terraform Modules Basics
    10 01 Terraform Modules Basics
    
    Terraform Module Basics - Root & Child Modules
  - 10 02 Terraform Build a Module
    10 02 Terraform Build a Module
    
    Build a Terraform Module - AWS S3 Static Website
    
    V3 build a module to host static website on aws s3
    V3 build a module to host static website on aws s3
    
    Modules
    Modules
    
    Aws s3 static website bucket
    Aws s3 static website bucket
    
    AWS S3 Static Website Bucket Terraform Module
- 11 Terraform Cloud and Enterprise Capabilities
  11 Terraform Cloud and Enterprise Capabilities
  - 11 01 Terraform Cloud Github Integration
    11 01 Terraform Cloud Github Integration
    
    Terraform Cloud GitHub Integration Setup Guide
  - 11 02 Share Modules in Private Module Registry
    11 02 Share Modules in Private Module Registry
    
    Terraform Cloud Private Module Registry Guide
    
    Terraform s3 website module manifests
    Terraform s3 website module manifests
    
    S3 Website Module for Terraform Private Registry
  - 11 03 Terraform Cloud CLI Driven Workflow
    11 03 Terraform Cloud CLI Driven Workflow
    
    Terraform Cloud CLI-Driven Workflow Tutorial
  - 11 04 Migrate State to Terraform Cloud
    11 04 Migrate State to Terraform Cloud
    
    Migrate Terraform State to Terraform Cloud
- 12 Terraform Cloud and Sentinel
  12 Terraform Cloud and Sentinel
  - 12 01 Terraform Cloud and Sentinel Policies
    12 01 Terraform Cloud and Sentinel Policies
    
    Terraform Cloud Sentinel Policies - Policy as Code
  - 12 02 Control Costs with Sentinel Policies
    12 02 Control Costs with Sentinel Policies
    
    Control Cloud Costs with Terraform Sentinel Policies
  - 12 03 Terraform Foundational Policies using Sentinel
    12 03 Terraform Foundational Policies using Sentinel
    
    Terraform Foundational Sentinel Policies Library
- 13 Terraform State Import
  13 Terraform State Import
  - Terraform Import - Manage Existing Infrastructure
- 14 Terraform Graph
  14 Terraform Graph
  - Terraform Graph - Visualize Resource Dependencies
- 15 Terraform Expressions
  15 Terraform Expressions
  - 15 01 Terraform Functions
    15 01 Terraform Functions
    
    Terraform Functions - Numeric, String & Template
  - 15 02 Terraform Dynamic Expressions
    15 02 Terraform Dynamic Expressions
    
    Terraform Dynamic Expressions - Conditional & Splat
  - 15 03 Terraform Dynamic Blocks
    15 03 Terraform Dynamic Blocks
    
    Terraform Dynamic Blocks for Repeatable Config
- 16 Terraform Debug
  16 Terraform Debug
  - Terraform Debug - TF_LOG Levels & Log Path Setup
- 17 Exam Preparation
  17 Exam Preparation
  - HashiCorp Terraform Certification Exam Preparation
- 18 Exam Registration
  18 Exam Registration
  - Terraform Certification Exam Registration Guide
Terraform Cert Azure
Terraform Cert Azure
- HashiCorp Terraform Associate Certification — 70 Azure Demos
- 01 Infrastructure as Code IaC Basics
  01 Infrastructure as Code IaC Basics
  - Infrastructure as Code Basics
- 02 Install Tools TerraformCLI AzureCLI VSCodeIDE
  02 Install Tools TerraformCLI AzureCLI VSCodeIDE
  - Install Terraform, Azure CLI and VSCode Editor
- 03 Terraform Command Basics
  03 Terraform Command Basics
  - Terraform Command Basics
- 04 Terraform Language Syntax
  04 Terraform Language Syntax
  - Terraform Configuration Language Syntax
- 05 Terraform Provider Resource Block Basics
  05 Terraform Provider Resource Block Basics
  - Terraform Settings, Providers and Resource Blocks
- 06 Azure Terraform VsCode Plugin
  06 Azure Terraform VsCode Plugin
  - Azure Terraform VSCode Extension
- 07 Multiple Provider Configurations
  07 Multiple Provider Configurations
  - Terraform Multiple Provider Blocks on Azure Cloud
- 08 Providers Dependency Lock File
  08 Providers Dependency Lock File
  - Terraform Provider Dependency Lock File
- 09 Resource Syntax and Behavior
  09 Resource Syntax and Behavior
  - Terraform Resource Syntax, Behavior and State
- 10 Meta Argument depends on
  10 Meta Argument depends on
  - Terraform Resource Meta-Argument depends_on
- 11 01 Terraform Azure Linux Virtual Machine
  11 01 Terraform Azure Linux Virtual Machine
  - Provision Azure Linux VM using Terraform
- 11 02 Meta Argument count
  11 02 Meta Argument count
  - Terraform Resource Meta-Argument count
- 12 Meta Argument for each Maps
  12 Meta Argument for each Maps
  - Terraform Resource Meta-Argument for_each Maps
- 13 Meta Argument for each ToSet
  13 Meta Argument for each ToSet
  - Terraform Resource Meta-Argument for_each toset
- 14 Meta Argument for each Chaining
  14 Meta Argument for each Chaining
  - Terraform Resource Meta-Argument for_each Chaining
- 15 Meta Argument lifecycle create before destroy
  15 Meta Argument lifecycle create before destroy
  - Terraform Meta-Argument lifecycle create_before_destroy
- 16 Meta Argument lifecycle prevent destroy
  16 Meta Argument lifecycle prevent destroy
  - Terraform Meta-Argument lifecycle prevent_destroy
- 17 Meta Argument lifecycle ignore changes
  17 Meta Argument lifecycle ignore changes
  - Terraform Meta-Argument lifecycle ignore_changes
- 18 Input Variables Basic
  18 Input Variables Basic
  - Terraform Input Variables Basics
- 19 Input Variables Assign when prompted
  19 Input Variables Assign when prompted
  - Terraform Input Variables Assign when prompted
- 20 Input Variables Override default with cli var
  20 Input Variables Override default with cli var
  - Terraform Input Variables CLI Argument var
- 21 Input Variables Override with Environment Variables
  21 Input Variables Override with Environment Variables
  - Terraform Input Variables using Environment Variables
- 22 Input Variables Assign with terraform tfvars
  22 Input Variables Assign with terraform tfvars
  - Terraform Input Variables using terraform.tfvars
- 23 Input Variables Assign with tfvars var file
  23 Input Variables Assign with tfvars var file
  - Terraform Input Variables using -var-file Argument
- 24 Input Variables Assign with auto tfvars
  24 Input Variables Assign with auto tfvars
  - Terraform Input Variables using .auto.tfvars
- 25 Input Variables Collection Type Lists
  25 Input Variables Collection Type Lists
  - Terraform Input Variables with Collection Type lists
- 26 Input Variables Collection Type Maps
  26 Input Variables Collection Type Maps
  - Terraform Input Variables with Collection Type maps
- 27 Input Variables Validation Rules
  27 Input Variables Validation Rules
  - Terraform Input Variables with Validation Rules
- 28 Input Variables Sensitive
  28 Input Variables Sensitive
  - Terraform Sensitive Input Variables
- 29 Input Variables Structural Type object
  29 Input Variables Structural Type object
  - Terraform Input Variables with Structural Type object
- 30 Input Variables Structural Type tuple
  30 Input Variables Structural Type tuple
  - Terraform Input Variables with Structural Type tuple
- 31 Input Variables Collection Type Sets
  31 Input Variables Collection Type Sets
  - Terraform Input Variables with Collection Type set
- 32 Output Values Basics
  32 Output Values Basics
  - Terraform Output Values Basics
- 33 Output Values with count and Splat Expression
  33 Output Values with count and Splat Expression
  - Terraform Output Values with Splat Expression
- 34 Output Values with for each and for loops
  34 Output Values with for each and for loops
  - Terraform Output Values with for_each and for loop
- 35 Terraform Local Values
  35 Terraform Local Values
  - Terraform Local Values
- 36 Terraform Conditional Expressions
  36 Terraform Conditional Expressions
  - Terraform Conditional Expressions
- 37 Terraform Datasources
  37 Terraform Datasources
  - Terraform Datasources
- 38 Terraform Remote State Storage and Locking
  38 Terraform Remote State Storage and Locking
  - Terraform Remote State Storage & Locking
- 39 Terraform Remote State Datasource
  39 Terraform Remote State Datasource
  - Terraform Remote State Datasource
- 40 Terraform State Commands
  40 Terraform State Commands
  - Terraform State Commands
- 41 Terraform apply refresh only command
  41 Terraform apply refresh only command
  - Terraform Command apply refershonly
- 42 Terraform Workspaces with Local Backends
  42 Terraform Workspaces with Local Backends
  - Terraform Workspaces with Local Backend
- 43 Terraform Workspaces with Remote Backends
  43 Terraform Workspaces with Remote Backends
  - Terraform Workspaces with Remote Backend
- 44 Terraform File Provisioner
  44 Terraform File Provisioner
  - Terraform File Provisioner
- 45 Terraform remote exec provisioner
  45 Terraform remote exec provisioner
  - Terraform remote-exec Provisioner
- 46 Terraform local exec provisioner
  46 Terraform local exec provisioner
  - Terraform local-exec Provisioner
- 47 Terraform Null Resource
  47 Terraform Null Resource
  - Terraform Null Resource
- 48 Terraform State Import
  48 Terraform State Import
  - Terraform State Import
- 49 Terraform Modules use Public Module
  49 Terraform Modules use Public Module
  - Terraform Modules use Public Modules
- 50 Terraform Azure Static Website
  50 Terraform Azure Static Website
  - Build a Static Website on Azure with Terraform
- 51 Terraform Modules Build Local Module
  51 Terraform Modules Build Local Module
  - Build a Local Terraform Module
  - Terraform manifests
    Terraform manifests
    
    Modules
    Modules
    
    Azure static website
    Azure static website
    
    Azure Static Website using Storage Account
- 52 Terraform Module Publish to Public Registry
  52 Terraform Module Publish to Public Registry
  - Terraform Module Publish to Terraform Public Registry
  - Terraform azure static website module manifests
    Terraform azure static website module manifests
    
    Azure Static Website using Storage Account
- 53 Terraform Module Sources
  53 Terraform Module Sources
  - Terraform Module Sources
- 54 Terraform Cloud Github Integration
  54 Terraform Cloud Github Integration
  - Terraform Cloud & Github Integration
  - Terraform manifests
    Terraform manifests
    
    Terraform Cloud with Azure Usecase Demo
- 55 Share Modules in Private Module Registry
  55 Share Modules in Private Module Registry
  - Share Terraform Modules in Private Modules Registry
  - Terraform azure static website module manifests
    Terraform azure static website module manifests
    
    Azure Static Website using Storage Account
- 56 Terraform Cloud CLI Driven Workflow
  56 Terraform Cloud CLI Driven Workflow
  - Terraform Cloud - CLI-Driven Workflow
- 57 Migrate State to Terraform Cloud
  57 Migrate State to Terraform Cloud
  - Migrate State to Terraform Cloud
- 58 Terraform Cloud and Sentinel Policies
  58 Terraform Cloud and Sentinel Policies
  - Terraform Cloud and Sentinel Policies
- 59 Terraform Foundational Policies using Sentinel
  59 Terraform Foundational Policies using Sentinel
  - Terraform Foundational Policies using Sentinel
- 60 Terraform Dynamic Blocks
  60 Terraform Dynamic Blocks
  - Terraform Dynamic Blocks
- 61 Terraform Debug
  61 Terraform Debug
  - Terraform Debug
- 62 Terraform Override Files
  62 Terraform Override Files
  - Terraform Override Files
- 63 Terraform External Provider and Datasource Demo 1
  63 Terraform External Provider and Datasource Demo 1
  - Terraform External Provider and Datasource
- 64 Terraform External Provider and Datasource Demo 2
  64 Terraform External Provider and Datasource Demo 2
  - Terraform External Provider and Datasource
- 65 Terraform CLI Config File MacOS and Linux
  65 Terraform CLI Config File MacOS and Linux
  - Terraform CLI Config File MacOS and LinuxOS
- 66 Terraform CLI Config File WindowsOS
  66 Terraform CLI Config File WindowsOS
  - Terraform CLI Config File Windows OS
- 67 Terraform Manage Providers
  67 Terraform Manage Providers
  - Terraform Manage Providers
- 68 Exam Preparation
  68 Exam Preparation
  - Exam Preparation
- 69 Exam Registration
  69 Exam Registration
  - Exam Registration
Terraform on AWS
Terraform on AWS
- Terraform on AWS — SRE & IaC DevOps — 20 Real-World Demos
- 01 Infrastructure as Code IaC Basics
  01 Infrastructure as Code IaC Basics
  - Infrastructure as Code IaC Basics with Terraform
- 02 Terraform Basics
  02 Terraform Basics
  - 02 01 Install Tools TerraformCLI AWSCLI VSCodeIDE
    02 01 Install Tools TerraformCLI AWSCLI VSCodeIDE
    
    Install Terraform CLI, AWS CLI, and VS Code IDE
  - 02 02 Terraform Command Basics
    02 02 Terraform Command Basics
    
    Terraform Command Basics - Init, Plan, Apply
  - 02 03 Terraform Language Syntax
    02 03 Terraform Language Syntax
    
    Terraform Configuration Language Syntax Guide
- 03 Terraform Settings Providers Resources
  03 Terraform Settings Providers Resources
  - Terraform Settings, Providers, and Resource Blocks
- 04 Terraform Variables and Datasources
  04 Terraform Variables and Datasources
  - Terraform Variables, Datasources, and Outputs
- 05 Terraform Loops MetaArguments SplatOperator
  05 Terraform Loops MetaArguments SplatOperator
  - 05 01 MetaArgument Count For Loops Lists Maps
    05 01 MetaArgument Count For Loops Lists Maps
    
    Terraform Count, For Loops, Lists, Maps, and Splat
  - 05 02 MetaArgument for each
    05 02 MetaArgument for each
    
    Terraform for_each Meta-Argument with toset, tomap
  - 05 03 Utility Project
    05 03 Utility Project
    
    Terraform Utility Project for EC2 Instance Types
  - 05 04 for each with az instancetype check
    05 04 for each with az instancetype check
    
    Terraform for_each with AZ Instance Type Check
- 06 AWS VPC
  06 AWS VPC
  - 06 01 AWS VPC using Mgmt Console
    06 01 AWS VPC using Mgmt Console
    
    Design AWS VPC Using the AWS Management Console
  - 06 02 AWS VPC using Terraform
    06 02 AWS VPC using Terraform
    
    3-Tier AWS VPC with NAT Gateways Using Terraform
- 07 AWS EC2Instance and SecurityGroups
  07 AWS EC2Instance and SecurityGroups
  - AWS EC2 Instances and Security Groups with Terraform
- 08 AWS ELB Classic LoadBalancer
  08 AWS ELB Classic LoadBalancer
  - AWS Classic Load Balancer with Terraform Module
- 09 AWS ALB Application LoadBalancer Basic
  09 AWS ALB Application LoadBalancer Basic
  - AWS Application Load Balancer Basics with Terraform
- 10 ALB Path Based Routing
  10 ALB Path Based Routing
  - AWS ALB Context Path-Based Routing with Terraform
  - Terraform manifests
    Terraform manifests
    
    14 ALB Autoscaling with Launch Configuration
    14 ALB Autoscaling with Launch Configuration
    
    ALB Autoscaling with Launch Configuration on AWS
- 11 ALB Host Header Based Routing
  11 ALB Host Header Based Routing
  - AWS ALB Host Header based Routing using Terraform
- 12 ALB HTTPHeader QueryString Redirects
  12 ALB HTTPHeader QueryString Redirects
  - AWS ALB Different Listener Rules for Routing
- 13 DNS to DB
  13 DNS to DB
  - Terraform DNS to DB Demo on AWS with EC2
- 14 Autoscaling with Launch Configuration
  14 Autoscaling with Launch Configuration
  - AWS Autoscaling with Launch Configuration
- 15 Autoscaling with Launch Templates
  15 Autoscaling with Launch Templates
  - AWS Autoscaling with Launch Templates
- 16 AWS NLB Network Load Balancer
  16 AWS NLB Network Load Balancer
  - AWS Network Load Balancer with Terraform
- 17 AWS CloudWatch
  17 AWS CloudWatch
  - AWS CloudWatch using Terraform
- 18 Develop Terraform Modules Locally
  18 Develop Terraform Modules Locally
  - Develop Terraform Modules Locally
  - Backup terraform manifests
    Backup terraform manifests
    
    Modules
    Modules
    
    Aws vpc
    Aws vpc
    
    AWS VPC Terraform Module Backup Reference
    
    Upgrade AWS VPC Terraform Module from v2 to v3
    
    Modules
    Modules
    
    Vpc endpoints
    Vpc endpoints
    
    AWS VPC Endpoints Terraform Sub-Module (Backup)
  - Terraform manifests
    Terraform manifests
    
    Modules
    Modules
    
    Aws vpc
    Aws vpc
    
    AWS VPC Terraform Module for Local Development
    
    Modules
    Modules
    
    Vpc endpoints
    Vpc endpoints
    
    AWS VPC Endpoints Terraform Sub-Module Reference
- 19 Develop Terraform Module from scratch
  19 Develop Terraform Module from scratch
  - Build Terraform Module from Scratch
  - V3 build a module to host static website on aws s3
    V3 build a module to host static website on aws s3
    
    Modules
    Modules
    
    Aws s3 static website bucket
    Aws s3 static website bucket
    
    AWS S3 Static Website Bucket Terraform Module
- 20 Remote State Storage with AWS S3 and DynamoDB
  20 Remote State Storage with AWS S3 and DynamoDB
  - Terraform Remote State Storage with AWS S3 & DynamoDB
- 21 terraform remote state datasource
  21 terraform remote state datasource
  - Terraform Remote State Datasource Demo
- 22 IaC DevOps using AWS CodePipeline
  22 IaC DevOps using AWS CodePipeline
  - Terraform IaC DevOps using AWS CodePipeline
  - Git Repo Files
    Git Repo Files
    
    IaC DevOps with AWS CodePipeline and Terraform
Terraform on Azure
Terraform on Azure
- Terraform on Azure — IaC DevOps SRE — 25 Real-World Demos
- 01 Infrastructure as Code IaC Basics
  01 Infrastructure as Code IaC Basics
  - Infrastructure as Code Basics
- 02 Install Tools TerraformCLI AzureCLI VSCodeIDE
  02 Install Tools TerraformCLI AzureCLI VSCodeIDE
  - Install Terraform, Azure CLI and VSCode Editor
- 03 Terraform Command Basics
  03 Terraform Command Basics
  - Terraform Command Basics
- 04 Terraform Language Syntax
  04 Terraform Language Syntax
  - Terraform Configuration Language Syntax
- 05 Terraform Provider Resource Block Basics
  05 Terraform Provider Resource Block Basics
  - Terraform Settings, Providers and Resource Blocks
- 06 Azure Terraform VsCode Plugin
  06 Azure Terraform VsCode Plugin
  - Azure Terraform VSCode Extension
- 07 Multiple Provider Configurations
  07 Multiple Provider Configurations
  - Terraform Multiple Provider Blocks on Azure Cloud
- 08 Providers Dependency Lock File
  08 Providers Dependency Lock File
  - Terraform Provider Dependency Lock File
- 09 Resource Syntax and Behavior
  09 Resource Syntax and Behavior
  - Terraform Resource Syntax, Behavior and State
- 10 Azure Virtual Network 4Tier
  10 Azure Virtual Network 4Tier
  - Azure Virtual Network Design using Terraform
- 11 Azure Linux Virtual Machine
  11 Azure Linux Virtual Machine
  - Azure Linux VM using Terraform
- 12 Azure Bastion Service and Host
  12 Azure Bastion Service and Host
  - Azure Bastion Host and Service using Terraform
- 13 Azure Standard LoadBalancer using Portal
  13 Azure Standard LoadBalancer using Portal
  - Azure Standard Load Balancer using Terraform
- 14 Azure Standard LoadBalancer Basic
  14 Azure Standard LoadBalancer Basic
  - Azure Standard Load Balancer using Terraform
- 15 Azure Standard LoadBalancer Inbound NATRules
  15 Azure Standard LoadBalancer Inbound NATRules
  - Azure Load Balancer Inbound NAT Rules using Terraform
- 16 Azure SLB VM with MetaArgument Count
  16 Azure SLB VM with MetaArgument Count
  - Terraform Meta-Argument Count
- 17 Azure SLB VM with for each and for loops
  17 Azure SLB VM with for each and for loops
  - Terraform Meta-Argument for_each and For Loops
- 18 Azure VM ScaleSets Manual scaling
  18 Azure VM ScaleSets Manual scaling
  - Azure Virtual Machine Scale Sets with Terraform
- 19 Azure VM ScaleSets Auto scaling
  19 Azure VM ScaleSets Auto scaling
  - Azure Virtual Machine Scale Sets Autoscaling with Terraform
- 20 Azure External and Internal LB with VMSS
  20 Azure External and Internal LB with VMSS
  - Azure Internal Load Balancer using Terraform
- 21 Azure Private DNS Zones
  21 Azure Private DNS Zones
  - Azure Private DNS Zones using Terraform
- 22 Delegate DNS Domain to Azure DNS
  22 Delegate DNS Domain to Azure DNS
  - Delegate DNS Domain to Azure DNS
- 23 Azure Public DNS Zone
  23 Azure Public DNS Zone
  - Azure Public DNS Zones using Terraform
- 24 Terraform Remote State Storage
  24 Terraform Remote State Storage
  - Terraform Remote State Storage & Locking
- 25 Azure Traffic Manager
  25 Azure Traffic Manager
  - Azure Traffic Manager using Terraform
- 26 Azure Application Gateway using Portal
  26 Azure Application Gateway using Portal
  - Azure Application Gateway Standard using Azure Portal
- 27 Azure Application Gateway Basics
  27 Azure Application Gateway Basics
  - Azure Application Gateway Basics using Terraform
- 28 Azure Application Gateway Path Based Routing
  28 Azure Application Gateway Path Based Routing
  - Azure Application Gateway Path based Routing
- 29 Azure Application Gateway Multisite Hosting
  29 Azure Application Gateway Multisite Hosting
  - Azure Application Gateway Multisite Hosting
- 30 Azure Application Gateway SSL SelfSigned
  30 Azure Application Gateway SSL SelfSigned
  - Azure Application Gateway SSL using Terraform
- 31 Azure Application Gateway SSL SelfSigned KeyVault
  31 Azure Application Gateway SSL SelfSigned KeyVault
  - Azure Application Gateway SSL with Key Vault
- 32 Azure IaC DevOps
  32 Azure IaC DevOps
  - Azure IaC DevOps for Terraform Project
  - Git Repo Files
    Git Repo Files
    
    Terraform on Azure with Azure IaC DevOps for Terraform Project
- 33 Azure MySQL Single Server
  33 Azure MySQL Single Server
  - Azure MySQL Single Server using Terraform
- 34 Terraform Modules use Public Module
  34 Terraform Modules use Public Module
  - Terraform Modules use Public Modules
- 35 Terraform Azure Static Website
  35 Terraform Azure Static Website
  - Build a Static Website on Azure with Terraform
- 36 Terraform Modules Build Local Module
  36 Terraform Modules Build Local Module
  - Build a Local Terraform Module
  - Terraform manifests
    Terraform manifests
    
    Modules
    Modules
    
    Azure static website
    Azure static website
    
    Azure Static Website using Storage Account
- 37 Terraform Module Publish to Public Registry
  37 Terraform Module Publish to Public Registry
  - Terraform Module Publish to Terraform Public Registry
  - Terraform azure static website module manifests
    Terraform azure static website module manifests
    
    Azure Static Website using Storage Account
- 38 Terraform Module Sources
  38 Terraform Module Sources
  - Terraform Module Sources
Terraform on AWS EKS
Terraform on AWS EKS
- Terraform on AWS EKS Kubernetes — 50 Real-World Demos
- 01 Infrastructure as Code IaC Basics
  01 Infrastructure as Code IaC Basics
  - Infrastructure as Code Basics
- Terraform Basics (3 Demos)
  Terraform Basics (3 Demos)
  - 02 01 Install Tools TerraformCLI AWSCLI VSCodeIDE
    02 01 Install Tools TerraformCLI AWSCLI VSCodeIDE
    
    Terraform & AWS CLI Installation
  - 02 02 Terraform Command Basics
    02 02 Terraform Command Basics
    
    Terraform Command Basics
  - 02 03 Terraform Language Syntax
    02 03 Terraform Language Syntax
    
    Terraform Configuration Language Syntax
- 03 Terraform Settings Providers Resources
  03 Terraform Settings Providers Resources
  - Terraform Settings, Providers & Resource Blocks
- 04 Terraform Variables and Datasources
  04 Terraform Variables and Datasources
  - Terraform Variables and Datasources
- Loops & MetaArguments (4 Demos)
  Loops & MetaArguments (4 Demos)
  - 05 01 MetaArgument Count For Loops Lists Maps
    05 01 MetaArgument Count For Loops Lists Maps
    
    Terraform For Loops, Lists, Maps and Count Meta-Argument
  - 05 02 MetaArgument for each
    05 02 MetaArgument for each
    
    Terraform for_each Meta-Argument with Functions toset, tomap
  - 05 03 Utility Project
    05 03 Utility Project
    
    Terraform Small Utility Project
  - 05 04 for each with az instancetype check
    05 04 for each with az instancetype check
    
    Meta-Argument for_each with AZ Instance Type Check
- AWS VPC (2 Demos)
  AWS VPC (2 Demos)
  - 06 01 AWS VPC using Mgmt Console
    06 01 AWS VPC using Mgmt Console
    
    Design AWS VPC using AWS Management Console
  - 06 02 AWS VPC using Terraform
    06 02 AWS VPC using Terraform
    
    Design a 3 Tier AWS VPC with NAT Gateways using Terraform
- 07 AWS EC2 BastionHost
  07 AWS EC2 BastionHost
  - AWS EC2 Bastion Host in Public Subnet
- 08 AWS EKS Cluster Basics
  08 AWS EKS Cluster Basics
  - EKS Cluster and Node Groups using Terraform
- 09 Kubernetes Fundamentals
  09 Kubernetes Fundamentals
  - Docker and Kubernetes Fundamentals
- 10 Kubernetes Deployment and Service
  10 Kubernetes Deployment and Service
  - Kubernetes Deployment and Services
- 11 Kubernetes Resources via Terraform
  11 Kubernetes Resources via Terraform
  - Kubernetes Resources using Terraform
- 12 Terraform Remote State Storage
  12 Terraform Remote State Storage
  - Terraform Remote State Storage with AWS S3 and DynamnoDB
- 13 EKS IRSA
  13 EKS IRSA
  - EKS IRSA - IAM Roles for Service Accounts
- 14 EBS CSI Install Kubernetes Storage
  14 EBS CSI Install Kubernetes Storage
  - EKS IAM Role for Kubernetes Service Accounts
- 15 EBS Kubernetes SampleApp YAML
  15 EBS Kubernetes SampleApp YAML
  - Demo on Kubernetes Storage Class, PVC and PV with YAML
- 16 EBS Kubernetes SampleApp Terraform
  16 EBS Kubernetes SampleApp Terraform
  - Demo on Kubernetes Storage Class, PVC and PV with Terraform
- 17 EBS Resizing on EKS
  17 EBS Resizing on EKS
  - EBS Volumes Resize and Retain Concepts on EKS Cluster
- 18 EBS CSI Install using EKS AddOn
  18 EBS CSI Install using EKS AddOn
  - Use EKS AddOns to Install EBS CSI Driver
- 19 EKS Admins AWS Admin User
  19 EKS Admins AWS Admin User
  - Create new AWS Admin User to access EKS Cluster Resources
- 20 EKS Admins AWS Basic User
  20 EKS Admins AWS Basic User
  - Create new AWS Basic User to access EKS Cluster Resources
- 21 EKS Admins as AWS IAM Users
  21 EKS Admins as AWS IAM Users
  - Create new AWS Basic User to access EKS Cluster Resources
- 22 EKS Admins with AWS IAM Roles
  22 EKS Admins with AWS IAM Roles
  - EKS Admins with IAM Roles
- 23 EKS Admins with AWS IAM Roles TF
  23 EKS Admins with AWS IAM Roles TF
  - EKS Admins with IAM Roles using Terraform
- 24 EKS ReadOnly IAM Users
  24 EKS ReadOnly IAM Users
  - Kubernetes Role and Role Binding
- 25 EKS DeveloperAccess IAM Users
  25 EKS DeveloperAccess IAM Users
  - Kubernetes Role and Role Binding
- 26 EKS with LoadBalancer Controller
  26 EKS with LoadBalancer Controller
  - AWS EKS Kubernetes with AWS Load Balancer Controller
- 27 EKS Ingress Basics
  27 EKS Ingress Basics
  - AWS EKS Ingress Basics automate with Terraform
- 28 EKS Ingress Context Path Routing
  28 EKS Ingress Context Path Routing
  - AWS EKS Ingress Context Path Routing with Terraform
- 29 EKS Ingress SSL SSLRedirect
  29 EKS Ingress SSL SSLRedirect
  - AWS EKS Ingress SSL Redirect with Terraform
- 30 EKS ExternalDNS Install
  30 EKS ExternalDNS Install
  - AWS EKS ExternalDNS Install with Terraform
- 31 EKS ExternalDNS with Ingress Service
  31 EKS ExternalDNS with Ingress Service
  - AWS EKS Ingress and ExternalDNS with Terraform
- 32 EKS ExternalDNS with k8s Service
  32 EKS ExternalDNS with k8s Service
  - AWS EKS Kubernetes Service, ExternalDNS with Terraform
- 33 EKS Ingress NameBasedVirtualHost Routing
  33 EKS Ingress NameBasedVirtualHost Routing
  - AWS EKS Ingress Name based Virtal Host Routing
- 34 EKS Ingress SSLDiscovery Host
  34 EKS Ingress SSLDiscovery Host
  - AWS EKS Ingress SSL Discovery Host with Terraform
- 35 EKS Ingress SSLDiscovery TLS
  35 EKS Ingress SSLDiscovery TLS
  - AWS EKS Ingress SSL Discovery TLS with Terraform
- 36 EKS Ingress Groups
  36 EKS Ingress Groups
  - AWS EKS Ingress Groups Automate with Terraform
- 37 EKS Ingress TargetType IP
  37 EKS Ingress TargetType IP
  - AWS EKS Ingress Target Type IP Automate with Terraform
- 38 EKS Ingress InternalLB
  38 EKS Ingress InternalLB
  - AWS EKS Ingress Internal LB with Terraform
- 39 EKS Ingress Cross Namespaces
  39 EKS Ingress Cross Namespaces
  - AWS EKS Ingress Cross Namespaces with Terraform
- 40 EKS NLB Basics
  40 EKS NLB Basics
  - AWS EKS Kubernetes NLB Basics with Terraform
- 41 EKS NLB TLS externaldns
  41 EKS NLB TLS externaldns
  - AWS EKS kubernetes NLB TLS External DNS with Terraform
- 42 EKS NLB InternalLB
  42 EKS NLB InternalLB
  - AWS EKS Kubernetes Internal NLB with Terraform
- 43 EKS Fargate Profiles
  43 EKS Fargate Profiles
  - AWS EKS Fargate Profile with Terraform
- 44 EKS Run k8s workloads on Fargate
  44 EKS Run k8s workloads on Fargate
  - AWS EKS Fargate Profile with Terraform
- 45 Fargate Only EKS Cluster
  45 Fargate Only EKS Cluster
  - AWS Fargate Only EKS Cluster with Terraform
- 46 EKS EFS CSI Install
  46 EKS EFS CSI Install
  - AWS EKS Kubernetes EFS CSI Driver with Terraform
- 47 EKS EFS Static Provisioning
  47 EKS EFS Static Provisioning
  - AWS EKS EFS Static Provisioning with Terraform
- 48 EKS EFS Dynamic Provisioning
  48 EKS EFS Dynamic Provisioning
  - AWS EKS EFS Dynamic Provisioning with Terraform
- 49 EKS EFS Fargate
  49 EKS EFS Fargate
  - AWS EKS Fargate Mount EFS with Terraform
- 50 EKS Cluster Autoscaler
  50 EKS Cluster Autoscaler
  - AWS EKS Cluster Autoscaler with Terraform
- 51 EKS Cluster Autoscaler Testing
  51 EKS Cluster Autoscaler Testing
  - AWS EKS Cluster Autoscaler - Testing
- 52 EKS Horizontal Pod Autoscaler
  52 EKS Horizontal Pod Autoscaler
  - AWS EKS Horizontal Pod Autoscaler with Terraform
- 53 EKS Vertical Pod Autoscaler Install
  53 EKS Vertical Pod Autoscaler Install
  - AWS EKS Vertical Pod Autoscaler with Terraform
- 54 EKS Monitoring Logging kubectl
  54 EKS Monitoring Logging kubectl
  - AWS EKS Monitoring and Logging with kubectl
- 55 EKS Monitoring Logging Terraform
  55 EKS Monitoring Logging Terraform
  - AWS EKS Monitoring and Logging with Terraform
Ultimate DevOps on AWS
Ultimate DevOps on AWS
- Ultimate DevOps Real-World Project on AWS — 55+ Demos
- AWS Cost Estimates for EKS Karpenter DevOps Course
- Docker Commands (3 Demos)
  Docker Commands (3 Demos)
  - 02 01 EC2 Docker Setup
    02 01 EC2 Docker Setup
    
    EC2 Docker Setup: Install Docker on Amazon Linux 2023
  - 02 02 Pull Run Docker Image
    02 02 Pull Run Docker Image
    
    How to Pull and Run Docker Images from Docker Hub and run
  - 02 03 Build Push DockerHub
    02 03 Build Push DockerHub
    
    How to Create and Push Docker Images to Docker Hub: A Step-by-Step Guide
- 03 Dockerfile Mastery
  03 Dockerfile Mastery
  - Dockerfile Multi-Stage Build for Java Spring Boot Apps
- 04 Docker Compose
  04 Docker Compose
  - Docker Compose for Microservices Retail Application
- 05 Docker Buildx
  05 Docker Buildx
  - Docker Buildx Multi-Arch Builds on EC2 with QEMU
- Terraform Basics (7 Demos)
  Terraform Basics (7 Demos)
  - 06 01 Tools Install
    06 01 Tools Install
    
    Install Terraform CLI, AWS CLI, and VS Code Extensions
  - 06 02 Terraform Foundation
    06 02 Terraform Foundation
    
    Terraform Basics: Create an S3 Bucket on AWS
  - 06 03 VPC
    06 03 VPC
    
    Build an AWS VPC with Terraform Step by Step
  - 06 04 VPC Tfvars
    06 04 VPC Tfvars
    
    Terraform Variable Precedence and tfvars Files
  - 06 05 Remote Backend S3
    06 05 Remote Backend S3
    
    Terraform Remote Backend with Amazon S3 Setup
  - 06 06 VPC Remote Backend
    06 06 VPC Remote Backend
    
    Configure Terraform VPC with S3 Remote Backend
  - 06 07 VPC Module
    06 07 VPC Module
    
    Create Reusable Terraform Modules for AWS VPC
- 07 Terraform EKS Cluster
  07 Terraform EKS Cluster
  - Provision AWS EKS Cluster with Terraform Step by Step
- Kubernetes Foundation (5 Demos)
  Kubernetes Foundation (5 Demos)
  - 08 01 Pods
    08 01 Pods
    
    Kubernetes Pod Basics: Deploy Your First Pod on EKS
  - 08 02 Deployments
    08 02 Deployments
    
    Kubernetes Deployments: Rolling Updates, Scaling & Probes
  - 08 03 Services
    08 03 Services
    
    Kubernetes Services: ClusterIP and Service Networking
  - 08 04 ConfigMaps
    08 04 ConfigMaps
    
    Kubernetes ConfigMaps for Application Configuration
  - 08 05 StatefulSets
    08 05 StatefulSets
    
    Kubernetes StatefulSets: Deploy MySQL on EKS
- Kubernetes Secrets (4 Demos)
  Kubernetes Secrets (4 Demos)
  - 09 01 Secrets Basics
    09 01 Secrets Basics
    
    Kubernetes Secrets: Manage MySQL Credentials Securely
  - 09 02 Pod Identity Agent
    09 02 Pod Identity Agent
    
    Amazon EKS Pod Identity Agent Setup and Configuration
  - 09 03 Secrets Manager Driver
    09 03 Secrets Manager Driver
    
    AWS Secrets Store CSI Driver for EKS Installation
  - 09 04 Secrets Manager Catalog
    09 04 Secrets Manager Catalog
    
    AWS Secrets Manager Integration with EKS Catalog Service
- Kubernetes Storage (3 Demos)
  Kubernetes Storage (3 Demos)
  - 10 01 EBS CSI Driver
    10 01 EBS CSI Driver
    
    Install Amazon EBS CSI Driver on EKS with Pod Identity
  - 10 02 EBS CSI Catalog
    10 02 EBS CSI Catalog
    
    EBS Persistent Storage for MySQL StatefulSet on EKS
  - 10 03 RDS MySQL
    10 03 RDS MySQL
    
    Amazon RDS MySQL Integration with EKS Microservices
- Kubernetes Ingress (3 Demos)
  Kubernetes Ingress (3 Demos)
  - 11 01 LB Controller Install
    11 01 LB Controller Install
    
    Install AWS Load Balancer Controller on EKS with Helm
  - 11 02 Ingress HTTP
    11 02 Ingress HTTP
    
    Kubernetes Ingress HTTP with AWS ALB on EKS
  - 11 03 Ingress HTTPS
    11 03 Ingress HTTPS
    
    Kubernetes Ingress HTTPS with ACM and Route53 on EKS
- Helm Package Manager (5 Demos)
  Helm Package Manager (5 Demos)
  - 12 01 Helm Basics
    12 01 Helm Basics
    
    Helm Basics: Deploy Kubernetes Apps with Helm Charts
  - 12 02 Helm Custom Values
    12 02 Helm Custom Values
    
    Helm Custom Values: Override Chart Defaults with YAML
  - 12 03 Helm Chart Explore
    12 03 Helm Chart Explore
    
    Explore Helm Charts: Pull, Lint, and Render Templates
  - 12 04 Helm Package Publish
    12 04 Helm Package Publish
    
    Package and Publish Helm Charts to Amazon ECR Private
  - 12 05 Helm Retail Store
    12 05 Helm Retail Store
    
    Deploy Retail Store on EKS with Helm and AWS Data Plane
- 13 Terraform EKS AddOns
  13 Terraform EKS AddOns
  - Terraform EKS Add-Ons: LBC, EBS CSI, Secrets Store CSI
- Retail Store Microservices (2 Demos)
  Retail Store Microservices (2 Demos)
  - 14 01 AWS Data Plane
    14 01 AWS Data Plane
    
    Provision AWS Data Plane with Terraform for EKS Apps
  - 14 02 Microservices Integration
    14 02 Microservices Integration
    
    Connect EKS Microservices to AWS Managed Data Services
    
    Verify EKS Microservice Connectivity to AWS Data Plane
- 15 Terraform EKS ExternalDNS
  15 Terraform EKS ExternalDNS
  - Install ExternalDNS on EKS with Terraform and Route53
- 16 RetailStore ExternalDNS
  16 RetailStore ExternalDNS
  - ExternalDNS with Kubernetes Ingress on EKS and Route53
- Autoscaling — Karpenter (4 Demos)
  Autoscaling — Karpenter (4 Demos)
  - 17 01 Karpenter Install
    17 01 Karpenter Install
    
    Install Karpenter Autoscaler on Amazon EKS Cluster
  - 17 02 OnDemand Instances
    17 02 OnDemand Instances
    
    Karpenter On-Demand Instance Autoscaling on EKS
  - 17 03 Spot Instances
    17 03 Spot Instances
    
    Karpenter Spot Instances for EKS Cost Optimization
  - 17 04 Spot Interruption
    17 04 Spot Interruption
    
    Karpenter Spot Interruption Handling with Zero Downtime Karpenter Spot Interruption Handling with Zero Downtime
    Table of contents
    
    Step-01: Introduction
    
    What You'll Learn
    
    Stage-1: Karpenter Spot Instances - Interruption Handling
    
    Stage-2: Karpenter Spot Instances - Interruption Handling
    
    Stage-3: Karpenter Spot Instances - Interruption Handling
    
    Stage-4: Karpenter Spot Instances - Interruption Handling
    
    Stage-5: Karpenter Spot Instances - Interruption Handling
    
    Stage-6: Karpenter Spot Instances - Interruption Handling
    
    Understanding Spot Interruptions
    
    Step-02: How Karpenter Handles Interruptions
    
    Step-03: Prerequisites
    
    Step-04: Review Test Application
    
    Step-05: Deploy Test Application
    
    Step-06: Prepare Monitoring (Open 4 Terminals)
    
    Terminal 1: Karpenter Logs (Filtered)
    
    Terminal 2: Node Status
    
    Terminal 3: Pod Status
    
    Terminal 4: NodeClaims
    
    Step-07: Simulate Spot Interruption
    
    Step-08: Watch the Magic Happen ✨
    
    Terminal 1: Karpenter Logs
    
    Terminal 2: Node Status
    
    Terminal 3: Pod Status
    
    Terminal 4: NodeClaims
    
    Step-09: Verify Success
    
    Step-10: Why This Worked - The Secret Sauce
    
    The PodDisruptionBudget (PDB) - The Hero
    
    Other Key Components
    
    Step-11: Clean Up
    
    Step-12: Production Best Practices
    
    1. Always Use PodDisruptionBudgets
    
    2. Set Appropriate Grace Periods
    
    3. Set terminationGracePeriodSeconds Appropriately
    
    4. Mix Spot and On-Demand for Critical Apps
    
    5. Use Diverse Instance Types
    
    Summary
- 18 Autoscaling HPA
  18 Autoscaling HPA
  - Kubernetes HPA: Horizontal Pod Autoscaler on EKS
- 19 Helm RetailStore AWS Dataplane
  19 Helm RetailStore AWS Dataplane
  - Helm Retail Store Deployment with AWS Persistent Data Plane
- Observability — OpenTelemetry (4 Demos)
  Observability — OpenTelemetry (4 Demos)
  - 20 01 EKS Environment ADOT
    20 01 EKS Environment ADOT
    
    AWS ADOT OpenTelemetry EKS Add-On Setup with Terraform
  - 20 02 Traces XRay
    20 02 Traces XRay
    
    Distributed Tracing on EKS with ADOT and AWS X-Ray
  - 20 03 Logs CloudWatch
    20 03 Logs CloudWatch
    
    EKS Application Logging with ADOT and CloudWatch Logs
  - 20 04 Metrics AMP AMG
    20 04 Metrics AMP AMG
    
    EKS Metrics with ADOT, Managed Prometheus, and Grafana
- DevOps CI/CD Pipeline (4 Demos)
  DevOps CI/CD Pipeline (4 Demos)
  - 21 01 CI GitHub Actions ECR
    21 01 CI GitHub Actions ECR
    
    GitHub Actions CI Pipeline: Build and Push to AWS ECR
  - 21 02 CD ArgoCD Install
    21 02 CD ArgoCD Install
    
    Install ArgoCD on Amazon EKS for GitOps Deployments
  - 21 03 CD ArgoCD Helm
    21 03 CD ArgoCD Helm
    
    ArgoCD Helm-Based GitOps Deployment for Microservices
  - 21 04 Full CICD Flow
    21 04 Full CICD Flow
    
    Full CI/CD Pipeline: GitHub Actions, ECR, and ArgoCD
Terraform on GCP
Terraform on GCP
- Terraform on GCP — DevOps SRE IaC — 30 Real-World Demos
- 01 Terraform Install Tools
  01 Terraform Install Tools
  - GCP Google Cloud Platform - Install CLI Tools
- 02 Terraform Commands
  02 Terraform Commands
  - GCP Google Cloud Platform - Terraform Commands
- 03 Terraform Language Basics
  03 Terraform Language Basics
  - GCP Google Cloud Platform - Terraform Settings, Providers and Resource Blocks
- 04 Terraform MetaArgument provider
  04 Terraform MetaArgument provider
  - GCP Google Cloud Platform - Terraform Meta-Argument Provider
- 05 Terraform Variables Output Values
  05 Terraform Variables Output Values
  - GCP Google Cloud Platform - Terraform Input Variables and Output Values
- 06 Terraform MetaArgument count
  06 Terraform MetaArgument count
  - GCP Google Cloud Platform - Terraform Meta-Argument Count
- 07 Terraform Datasources
  07 Terraform Datasources
  - GCP Google Cloud Platform - Terraform Datasources
- 08 Terraform MetaArgument foreach
  08 Terraform MetaArgument foreach
  - GCP Google Cloud Platform - Terraform Meta-argument for_each
- 09 Instance Templates and LocalValues
  09 Instance Templates and LocalValues
  - GCP Google Cloud Platform - Terraform Datasources
- 10 Managed Instance Groups MIGPublicIPs
  10 Managed Instance Groups MIGPublicIPs
  - GCP Google Cloud Platform - Managed Instance Groups using Terraform
- 11 Regional HTTP LB MIGPublic
  11 Regional HTTP LB MIGPublic
  - GCP Google Cloud Platform - Regional Application Load Balancer using Terraform
- 12 Regional HTTP LB MIGPrivate
  12 Regional HTTP LB MIGPrivate
  - GCP Google Cloud Platform - Regional Application Load Balancer with MIG Private using Terraform
- 13 Regional HTTP LB MIGUpdatePolicy
  13 Regional HTTP LB MIGUpdatePolicy
  - GCP Google Cloud Platform - Test MIG Update Policy using Terraform
- 14 Regional HTTPS LB SelfSigned
  14 Regional HTTPS LB SelfSigned
  - GCP Google Cloud Platform - Selfsigned SSL with Certmanager
- 15 Cloud Domains and Cloud DNS
  15 Cloud Domains and Cloud DNS
  - Cloud Domains and Cloud DNS
- 16 Regional HTTPS LB CloudDNS
  16 Regional HTTPS LB CloudDNS
  - GCP Google Cloud Platform - Selfsigned SSL with Certmanager
- 17 Regional HTTP LB PATH Routing
  17 Regional HTTP LB PATH Routing
  - GCP Google Cloud Platform - Regional Application Load Balancer Path routing
- 18 Regional HTTP LB HOST Routing
  18 Regional HTTP LB HOST Routing
  - GCP Google Cloud Platform - Regional Application Load Balancer Host routing
- 19 Regional HTTP LB HEADER Routing
  19 Regional HTTP LB HEADER Routing
  - GCP Google Cloud Platform - Regional Application Load Balancer Header routing
- 20 Regional HTTPS LB Logging
  20 Regional HTTPS LB Logging
  - GCP Google Cloud Platform - Cloud Logging
- 21 Regional HTTPS LB Monitoring
  21 Regional HTTPS LB Monitoring
  - GCP Google Cloud Platform - Cloud Monitoring
- 22 CloudSQL PublicDB TF Remote State
  22 CloudSQL PublicDB TF Remote State
  - GCP Google Cloud Platform - CloudSQL Public Database
- 23 DNS to DB SelfSigned CloudSQL PublicDB
  23 DNS to DB SelfSigned CloudSQL PublicDB
  - GCP Google Cloud Platform - Cloud Monitoring
- 24 DNS to DB CloudDNS CloudSQL PublicDB
  24 DNS to DB CloudDNS CloudSQL PublicDB
  - GCP Google Cloud Platform - Cloud Monitoring
- 25 CloudSQL PrivateDB
  25 CloudSQL PrivateDB
  - GCP Google Cloud Platform - CloudSQL Private Database
- 26 DNS to DB SelfSigned CloudSQL PrivateDB
  26 DNS to DB SelfSigned CloudSQL PrivateDB
  - GCP Google Cloud Platform - DNS to DB Selfsigned Cloud SQL Private DB
- 27 DNS to DB CloudDNS CloudSQL PrivateDB
  27 DNS to DB CloudDNS CloudSQL PrivateDB
  - GCP Google Cloud Platform - DNS to DB with CloudSQL Private Database
- 28 Terraform Modules
  28 Terraform Modules
  - GCP Google Cloud Platform - Terraform Modules
- 29 Terraform Build Custom Module
  29 Terraform Build Custom Module
  - GCP Google Cloud Platform - Terraform Custom Modules
- 30 Terraform GCP DevOps CloudBuild GitHub
  30 Terraform GCP DevOps CloudBuild GitHub
  - GCP Google Cloud Platform - Implement DevOps for Terraform Code in GCP
- 31 Global HTTP LB MIGPrivate
  31 Global HTTP LB MIGPrivate
  - GCP Google Cloud Platform - Global Application Load Balancer
Terraform on GCP GKE
Terraform on GCP GKE
- Terraform on GCP GKE — DevOps SRE IaC — 40 Real-World Demos
- 01 Terraform Install Tools
  01 Terraform Install Tools
  - GCP Google Cloud Platform - Install CLI Tools
- 02 Terraform Commands
  02 Terraform Commands
  - GCP Google Cloud Platform - Terraform Commands
- 03 Terraform Language Basics
  03 Terraform Language Basics
  - GCP Google Cloud Platform - Terraform Settings, Providers and Resource Blocks
- 04 Terraform MetaArgument provider
  04 Terraform MetaArgument provider
  - GCP Google Cloud Platform - Terraform Meta-Argument Provider
- 05 Terraform Variables Output Values
  05 Terraform Variables Output Values
  - GCP Google Cloud Platform - Terraform Input Variables and Output Values
- 06 Terraform MetaArgument count
  06 Terraform MetaArgument count
  - GCP Google Cloud Platform - Terraform Meta-Argument Count
- 07 Terraform Datasources
  07 Terraform Datasources
  - GCP Google Cloud Platform - Terraform Datasources
- 08 Terraform MetaArgument foreach
  08 Terraform MetaArgument foreach
  - GCP Google Cloud Platform - Terraform Meta-argument for_each
- 09 GKE Public Standard Cluster
  09 GKE Public Standard Cluster
  - GCP Google Kubernetes Engine GKE - Standard Public Cluster
- 10 Kubernetes Resources yaml
  10 Kubernetes Resources yaml
  - GCP Google Kubernetes Engine GKE - Standard Public Cluster
- 11 Kubernetes Resources Terraform
  11 Kubernetes Resources Terraform
  - Kubernetes Resources using Terraform
- 12 GKE Private Standard Cluster Autoscaler
  12 GKE Private Standard Cluster Autoscaler
  - GCP Google Kubernetes Engine GKE - Standard Private Cluster
- 13 GKE Horizontal Pod Autoscaling
  13 GKE Horizontal Pod Autoscaling
  - GCP Google Kubernetes Engine Horizontal Pod Autoscaling
- 14 GKE Vertical Pod Autoscaling
  14 GKE Vertical Pod Autoscaling
  - GCP Google Kubernetes Engine Vertical Pod Autoscaling
- 15 GKE Private Standard Cluster private endpoint
  15 GKE Private Standard Cluster private endpoint
  - GCP Google Kubernetes Engine GKE - Standard Private Cluster with Private endpoint
- 16 GKE Private Autopilot cluster
  16 GKE Private Autopilot cluster
  - GCP Google Kubernetes Engine GKE - Autopilot Private Cluster
- 17 GKE Storage Persistent Disks
  17 GKE Storage Persistent Disks
  - GCP Google Kubernetes Engine - GKE Storage
- 18 GKE Storage CloudSQL
  18 GKE Storage CloudSQL
  - GCP GKE - CloudSQL Private Database used in GKE workloads
- 19 GKE Cloud Storage FUSE CSI
  19 GKE Cloud Storage FUSE CSI
  - GKE Storage with Cloud Storage Buckets - GCS Fuse CSI Driver
- 20 GKE Storage Filestore
  20 GKE Storage Filestore
  - GKE Storage with GCP File Store - Custom StorageClass
- GKE Gateway API (13 Demos)
  GKE Gateway API (13 Demos)
  - 01 GKE LB Gateway API Basic
    01 GKE LB Gateway API Basic
    
    GKE with Kubernetes Gateway API - Regional Load Balancers
  - 02 GKE LB Gateway API StaticIP
    02 GKE LB Gateway API StaticIP
    
    GKE with Kubernetes Gateway API - Load Balancer with Static IP
  - 03 GKE Gateway API Selfsigned SSL k8sSecrets
    03 GKE Gateway API Selfsigned SSL k8sSecrets
    
    GKE with Kubernetes Gateway API - Load Balancer with self-signed SSL
  - 04 GKE Gateway API Selfsigned SSL CertManager
    04 GKE Gateway API Selfsigned SSL CertManager
    
    GKE with Kubernetes Gateway API - Load Balancer with self-signed SSL
  - 05 GKE Gateway API HTTP to HTTPS Redirect
    05 GKE Gateway API HTTP to HTTPS Redirect
    
    GKE with Kubernetes Gateway API - Load Balancer HTTP to HTTPS Redirect
  - 06 GKE Gateway API ContextPath Routing
    06 GKE Gateway API ContextPath Routing
    
    GKE with Kubernetes Gateway API - Load Balancer Context path routing
  - 07 GKE Gateway API Domain Routing
    07 GKE Gateway API Domain Routing
    
    GKE with Kubernetes Gateway API - Load Balancer Context path routing
  - 08 GKE Gateway API Traffic Splitting
    08 GKE Gateway API Traffic Splitting
    
    GKE with Kubernetes Gateway API - Load Balancer Traffic Splitting
  - 09 GKE Gateway API HealthChecks SessionAffinity
    09 GKE Gateway API HealthChecks SessionAffinity
    
    GKE with Kubernetes Gateway API - Load Balancer Health Checks
  - 10 Cloud Domains and Cloud DNS
    10 Cloud Domains and Cloud DNS
    
    Cloud Domains and Cloud DNS
  - 11 GKE Gateway API ProdSSL CloudDNS
    11 GKE Gateway API ProdSSL CloudDNS
    
    GKE with Kubernetes Gateway API - Load Balancer with Cloud DNS and Cloud Domains
  - 12 GKE Gateway API ProdSSL ExternalDomainProvider
    12 GKE Gateway API ProdSSL ExternalDomainProvider
    
    GKE with Kubernetes Gateway API - Load Balancer with Cloud DNS and Cloud Domains
  - 13 GKE Gateway API Global LB
    13 GKE Gateway API Global LB
    
    GKE with Kubernetes Gateway API - Global Load Balancers
- 22 Terraform Modules
  22 Terraform Modules
  - GCP Google Cloud Platform - Terraform Modules
- 23 GKE Infra Custom Terraform Modules
  23 GKE Infra Custom Terraform Modules
  - GCP Google Cloud Platform - Create GKE Custom Terraform Module
- 24 GKE Infra DevOps CloudBuild GitHub
  24 GKE Infra DevOps CloudBuild GitHub
  - GCP Google Cloud Platform - Implement DevOps for GKE Terraform Code in GCP
- 25 GKE Workloads Custom Terraform Modules
  25 GKE Workloads Custom Terraform Modules
  - GCP Google Cloud Platform - Create GKE Kubernetes Deployment Custom Terraform Module
- 26 GKE Workloads DevOps CloudBuild GitHub
  26 GKE Workloads DevOps CloudBuild GitHub
  - GCP Google Cloud Platform - Implement DevOps for GKE Workloads
- 27 GKE App Continuous Integration
  27 GKE App Continuous Integration
  - GCP Google Cloud Platform - Implement Continuous Integrationf for a Dockerized App
- 28 GKE App Continuous Delivery
  28 GKE App Continuous Delivery
  - GCP Google Cloud Platform - Implement Continuous Integrationf for a Dockerized App
All Courses

17_04: Karpenter Spot Interruption Handling¶

Step-01: Introduction¶

In the previous demo (17_03), we learned how to provision Spot instances with Karpenter and achieve 70% cost savings. But there's one critical question we didn't answer:

"What happens when AWS reclaims your Spot instance?"

In this demo, we'll see Karpenter's interruption handling in action - the mechanism that makes Spot instances production-ready by gracefully handling interruptions with zero downtime.

What You'll Learn¶

How Karpenter detects Spot interruptions via SQS queue
Graceful pod eviction and rescheduling in real-time
How PodDisruptionBudgets maintain availability during interruptions
Simulating interruptions for testing
Zero-downtime migration strategies

Stage-1: Karpenter Spot Instances - Interruption Handling¶

Karpenter Spot Instances - Interruption Handling

Stage-2: Karpenter Spot Instances - Interruption Handling¶

Karpenter Spot Instances - Interruption Handling

Stage-3: Karpenter Spot Instances - Interruption Handling¶

Karpenter Spot Instances - Interruption Handling

Stage-4: Karpenter Spot Instances - Interruption Handling¶

Karpenter Spot Instances - Interruption Handling

Stage-5: Karpenter Spot Instances - Interruption Handling¶

Karpenter Spot Instances - Interruption Handling

Stage-6: Karpenter Spot Instances - Interruption Handling¶

Karpenter Spot Instances - Interruption Handling

Understanding Spot Interruptions¶

The 2-Minute Warning:

T = 0s:    AWS decides to reclaim instance
T = 0s:    Interruption warning sent (EventBridge → SQS)
T = 120s:  Instance terminates (no exceptions!)

Without Karpenter: - ❌ Pods get hard-killed after 2 minutes - ❌ Service disruption - ❌ Failed requests

With Karpenter: - ✅ Graceful pod termination - ✅ Automatic rescheduling to healthy nodes - ✅ Zero downtime

Step-02: How Karpenter Handles Interruptions¶

The Flow:

1. AWS sends interruption warning → EventBridge → SQS Queue
2. Karpenter polls SQS (every 10 seconds), detects message
3. Karpenter cordons node (stops new pod scheduling)
4. Karpenter provisions replacement node (proactive!)
5. Karpenter drains node (respects PodDisruptionBudgets)
6. Kubernetes reschedules pods to new node
7. Old node terminates after pods are safe

Key Point: Karpenter starts provisioning the new node BEFORE draining the old one - this is why there's zero downtime!

Step-03: Prerequisites¶

Ensure you have from previous demos:

✅ Karpenter installed with interruption queue (17_01)
✅ Spot NodePool deployed (17_01)
✅ SQS queue connected to EventBridge (17_01)

Quick verification:

# 1. Check Karpenter is running
kubectl get pods -n kube-system -l app.kubernetes.io/name=karpenter

# 2. Verify interruption queue configured
helm get values karpenter -n kube-system | grep interruptionQueue
# Expected: interruptionQueue: retail-dev-eksdemo1

# 3. Check SQS queue exists
aws sqs list-queues | grep -i retail-dev-eksdemo1

Step-04: Review Test Application¶

We'll deploy a simple app with 5 replicas and a PodDisruptionBudget to ensure availability during interruptions.

Key configuration in Spot_Interruption_Handling.yaml:

apiVersion: apps/v1
kind: Deployment
metadata:
  name: spot-test-app
  namespace: default
spec:
  replicas: 5
  selector:
    matchLabels:
      app: spot-test
  template:
    metadata:
      labels:
        app: spot-test
    spec:
      nodeSelector:
        karpenter.sh/capacity-type: spot  # ← Force Spot nodes

      terminationGracePeriodSeconds: 30   # ← Allow graceful shutdown

      containers:
      - name: nginx
        image: nginx:alpine
        ports: 
          - containerPort: 80
        resources:
          requests:
            cpu: 100m
            memory: 128Mi
          limits:
            cpu: 200m
            memory: 256Mi

---
# PodDisruptionBudget - THE KEY TO ZERO DOWNTIME
apiVersion: policy/v1
kind: PodDisruptionBudget
metadata:
  name: spot-test-app-pdb
  namespace: default
spec:
  minAvailable: 3  # ← Keep at least 3 pods running during disruptions
  selector:
    matchLabels:
      app: spot-test

Why PodDisruptionBudget (PDB) is critical: - Without PDB: All 5 pods could be evicted immediately → service down! - With PDB (minAvailable: 3): Max 2 pods evicted at a time → service stays up!

Step-05: Deploy Test Application¶

# Deploy the test app
cd 17_04_Karpenter_Spot_Interruption_Handling
kubectl apply -f kube-manifests-Spot-Interruption-Handling/Spot_Interruption_Handling.yaml

# Output
deployment.apps/spot-test-app created
poddisruptionbudget.policy/spot-test-app-pdb created

Verify pods are running:

# Watch pods get scheduled (takes ~1-2 minutes)
kubectl get pods -l app=spot-test -o wide

# Expected: All 5 pods Running on a Spot node
# NAME                  READY   STATUS    NODE
# spot-test-app-xxx     1/1     Running   ip-10-0-11-246.ec2.internal
# spot-test-app-yyy     1/1     Running   ip-10-0-11-246.ec2.internal
# ... (3 more)

Checkpoint: ✅ 5 pods running on Spot node

Step-06: Prepare Monitoring (Open 4 Terminals)¶

Before triggering the interruption, open 4 terminal windows to watch the magic:

Terminal 1: Karpenter Logs (Filtered)¶

kubectl logs -n kube-system -l app.kubernetes.io/name=karpenter -f | \
  grep -E "interrupt|cordon|drain"

Terminal 2: Node Status¶

kubectl get nodes -l karpenter.sh/capacity-type=spot -w

Terminal 3: Pod Status¶

kubectl get pods -l app=spot-test -o wide -w

Terminal 4: NodeClaims¶

kubectl get nodeclaims -w

Step-07: Simulate Spot Interruption¶

Open a 5th terminal and send the interruption message:

# Get the Spot instance ID
SPOT_INSTANCE_ID=$(kubectl get nodes -l karpenter.sh/capacity-type=spot -o json | \
  jq -r '.items[0].spec.providerID' | cut -d'/' -f5)

echo "Target Instance: $SPOT_INSTANCE_ID"

# Get SQS queue URL
CLUSTER_NAME="retail-dev-eksdemo1"
QUEUE_URL=$(aws sqs get-queue-url --queue-name $CLUSTER_NAME --query QueueUrl --output text)

# Send interruption message
aws sqs send-message \
  --queue-url "$QUEUE_URL" \
  --message-body "{
    \"version\": \"0\",
    \"id\": \"test-interrupt-$(date +%s)\",
    \"detail-type\": \"EC2 Spot Instance Interruption Warning\",
    \"source\": \"aws.ec2\",
    \"account\": \"123456789012\",
    \"time\": \"$(date -u +%Y-%m-%dT%H:%M:%SZ)\",
    \"region\": \"us-east-1\",
    \"resources\": [
      \"arn:aws:ec2:us-east-1:123456789012:instance/$SPOT_INSTANCE_ID\"
    ],
    \"detail\": {
      \"instance-id\": \"$SPOT_INSTANCE_ID\",
      \"instance-action\": \"terminate\"
    }
  }"

echo "✅ Interruption message sent!"
echo "🔍 Watch your 4 monitoring terminals..."

Step-08: Watch the Magic Happen ✨¶

Now watch your 4 terminals! Here's what you'll see:

Terminal 1: Karpenter Logs¶

{"level":"INFO","message":"initiating delete from interruption message",
 "queue":"retail-dev-eksdemo1","messageKind":"spot_interrupted",
 "NodeClaim":{"name":"spot-nodepool-xjqfh"},"action":"CordonAndDrain",
 "Node":{"name":"ip-10-0-11-246.ec2.internal"}}

Key events: - ✅ Message detected within 10-20 seconds - ✅ Node cordoned (no new pods) - ✅ Drain initiated

Terminal 2: Node Status¶

# T+0s: Original node running
ip-10-0-11-246.ec2.internal   Ready    34m

# T+40s: New replacement node appears!
ip-10-0-11-246.ec2.internal   Ready      41m
ip-10-0-12-253.ec2.internal   NotReady   0s   ← NEW NODE!

# T+60s: New node ready
ip-10-0-12-253.ec2.internal   Ready      19s

# T+2m30s: Old node draining
ip-10-0-11-246.ec2.internal   NotReady   43m

# T+3m: Old node deleted
ip-10-0-12-253.ec2.internal   Ready      4m44s  ← Only new node remains

Key observation: ✅ New node ready BEFORE old node fully drained = zero downtime!

Terminal 3: Pod Status¶

# T+0s: All pods on old node
spot-test-app-xxx   1/1   Running   ip-10-0-11-246.ec2.internal
spot-test-app-yyy   1/1   Running   ip-10-0-11-246.ec2.internal
... (5 total)

# T+40s: First 2 pods evicted (PDB allows max 2 at a time)
spot-test-app-xxx   0/1   Completed     ip-10-0-11-246.ec2.internal
spot-test-app-yyy   1/1   Terminating   ip-10-0-11-246.ec2.internal
spot-test-app-aaa   0/1   Pending       <none>  ← Replacement pods!
spot-test-app-bbb   0/1   Pending       <none>

# T+60s: New pods scheduled to new node
spot-test-app-aaa   1/1   Running   ip-10-0-12-253.ec2.internal
spot-test-app-bbb   1/1   Running   ip-10-0-12-253.ec2.internal

# T+2m: Remaining 3 pods evicted and replaced
... (similar pattern)

# T+3m: All 5 pods running on new node ✅
spot-test-app-aaa   1/1   Running   ip-10-0-12-253.ec2.internal
spot-test-app-bbb   1/1   Running   ip-10-0-12-253.ec2.internal
... (5 total on NEW node)

Key observation: ✅ Always 3+ pods running (thanks to PDB) = zero downtime!

Terminal 4: NodeClaims¶

# Old NodeClaim
spot-nodepool-xjqfh   t2.small   spot   ip-10-0-11-246.ec2.internal   True

# New NodeClaim appears
spot-nodepool-s68tk   t2.small   spot                                 Unknown
spot-nodepool-s68tk   t2.small   spot   ip-10-0-12-253.ec2.internal   True

# Old NodeClaim deleted
spot-nodepool-s68tk   t2.small   spot   ip-10-0-12-253.ec2.internal   True  ← Only new

Step-09: Verify Success¶

After ~2-3 minutes, verify everything worked:

# Check all pods running
kubectl get pods -l app=spot-test -o wide

# Expected output:
# NAME                  READY   STATUS    RESTARTS   NODE
# spot-test-app-xxx     1/1     Running   0          ip-10-0-12-253.ec2.internal
# ... (5 pods total, all Running, RESTARTS=0)

Success indicators: - ✅ All 5 pods Running - ✅ RESTARTS: 0 (clean migration, no crashes) - ✅ All on new node (different IP from original) - ✅ Old node deleted

# Verify only new Spot node exists
kubectl get nodes -l karpenter.sh/capacity-type=spot

# Expected: Only 1 node (the new one)
# NAME                          STATUS   AGE
# ip-10-0-12-253.ec2.internal   Ready    4m44s

Timeline summary: - ⚡ Detection: 10-20 seconds - ⚡ New node provisioned: 30-40 seconds - ⚡ Full migration: ~2-3 minutes - ⚡ Downtime: ZERO (PDB kept 3 pods running throughout)

Step-10: Why This Worked - The Secret Sauce¶

The PodDisruptionBudget (PDB) - The Hero¶

apiVersion: policy/v1
kind: PodDisruptionBudget
spec:
  minAvailable: 3  # ← This is what prevented downtime!

What PDB does:

Without PDB:

T+20s: Karpenter drains node
       → All 5 pods evicted immediately
       → 0/5 pods running ← SERVICE DOWN! ❌
T+60s: New node ready, pods rescheduled
       → 5/5 pods running ← 40 seconds of downtime!

With PDB (minAvailable: 3):

T+20s: Karpenter drains node
       → PDB blocks: "You can only evict 2 pods, must keep 3 running!"
       → 2 pods evicted, 3 stay running ← SERVICE UP! ✅
T+40s: New node ready
       → 2 replacement pods start
       → Now 5/5 pods running (3 old + 2 new)
T+60s: PDB allows evicting remaining 3 pods (replacements ready)
       → All 5 pods now on new node ← ZERO downtime! ✅

The formula:

Karpenter + PodDisruptionBudget + Proactive Provisioning = Zero Downtime

Other Key Components¶

1. terminationGracePeriodSeconds: 30 - Gives pods 30 seconds to shut down gracefully - Nginx handles this automatically (stops accepting new connections, completes in-flight requests) - Must be < 120s (the Spot interruption window)

2. Diverse instance types - Karpenter can pick from multiple instance families - If t3 Spot is unavailable, tries t3a, t2, c5a, etc. - Increases chance of finding replacement capacity quickly

Step-11: Clean Up¶

# Delete the test deployment
kubectl delete -f kube-manifests-Spot-Interruption-Handling/Spot_Interruption_Handling.yaml

# Output
deployment.apps "spot-test-app" deleted
poddisruptionbudget.policy "spot-test-app-pdb" deleted

Karpenter will automatically clean up the unused Spot node in ~30-60 seconds.

# Watch automatic cleanup
kubectl get nodes -l karpenter.sh/capacity-type=spot -w

# After ~30s, the node will be deleted (consolidation)

Verify complete cleanup:

kubectl get nodes -l karpenter.sh/capacity-type=spot
# Expected: No resources found

kubectl get nodeclaims
# Expected: No resources found (or only on-demand nodes)

Step-12: Production Best Practices¶

Now that you've seen it work, here's how to use this in production:

1. Always Use PodDisruptionBudgets¶

For any production deployment:

apiVersion: policy/v1
kind: PodDisruptionBudget
metadata:
  name: my-app-pdb
spec:
  minAvailable: 2  # At least 2 pods for HA
  selector:
    matchLabels:
      app: my-app

Or for StatefulSets:

spec:
  maxUnavailable: 1  # Only 1 pod down at a time

2. Set Appropriate Grace Periods¶

Workload Type	Recommended
Stateless API	30s
WebSocket server	60s
Batch job	90s

Never exceed 90s - you need buffer before AWS force-terminates at 120s!

3. Set terminationGracePeriodSeconds Appropriately¶

Your application needs time to shut down gracefully: - ✅ Stop accepting new connections - ✅ Complete in-flight requests - ✅ Close database connections - ✅ Flush logs/metrics

Most web servers (nginx, Apache) handle this automatically - they respond to SIGTERM by gracefully shutting down.

For custom applications, ensure your code handles termination signals properly.

4. Mix Spot and On-Demand for Critical Apps¶

Best practice for production:

# 60% on Spot (cost savings)
apiVersion: apps/v1
kind: Deployment
metadata:
  name: my-app-spot
spec:
  replicas: 3
  template:
    spec:
      nodeSelector:
        karpenter.sh/capacity-type: spot

---
# 40% on On-Demand (stability)
apiVersion: apps/v1
kind: Deployment
metadata:
  name: my-app-ondemand
spec:
  replicas: 2
  template:
    spec:
      nodeSelector:
        karpenter.sh/capacity-type: on-demand

Result: - 60% cost savings from Spot - 40% guaranteed capacity from On-Demand - Even if all Spot nodes interrupted simultaneously, 2 pods stay up!

5. Use Diverse Instance Types¶

# In your Spot NodePool
requirements:
  - key: karpenter.k8s.aws/instance-family
    operator: In
    values: ["t3", "t3a", "t2", "c5a", "c6a", "m5"]  # ← Multiple options

Why: If t3 Spot is unavailable, Karpenter tries t3a, then t2, etc. Increases replacement node availability.

Summary¶

You just proved Spot instances are production-ready with Karpenter!

What you demonstrated: - ✅ Karpenter detects interruptions in ~10-20 seconds (SQS polling) - ✅ Proactively provisions replacement nodes - ✅ Gracefully migrates pods with zero downtime - ✅ PodDisruptionBudgets maintain service availability - ✅ Complete in ~2-3 minutes from interruption to full recovery

The magic formula:

Karpenter + PodDisruptionBudget + terminationGracePeriod = 
70% Cost Savings + Zero Downtime!

Key takeaway for production:

Spot instances with Karpenter are NOT risky - they're a smart, cost-effective choice when you use PodDisruptionBudgets and configure appropriate grace periods for your applications.