A practical guide to GPU overprovisioning strategies, including scheduler-level oversubscription, time slicing, memory controls, MIG, vGPU, queue backfill, and operational guardrails.
CFN Cloud
Cloud-native notes on Kubernetes, platform engineering, and modern infrastructure.
Popular topics
Featured reads
A smaller front shelf of pieces worth starting with.
A practical guide to choosing between serverless GPUs and dedicated GPUs for startups, based on cost structure, delivery speed, performance predictability, operations burden, and team maturity.
A comprehensive deep dive into Linux glibc (ptmalloc2) heap memory allocation and reclamation strategies. Explores Arenas, Chunks, Bins (Fast, Small, Large, Unsorted) data structures, and the principles of classic vulnerabilities such as Use-After-Free.
A practical comparison of five AI agent frameworks - OpenClaw, ZeroClaw, PicoClaw, Nanobot, and IronClaw - covering size, architecture, security tradeoffs, and adoption fit.
An engineering-oriented comparison of KAI-Scheduler’s Reservation Pod approach and HAMi’s hard isolation path, including trade-offs, failure modes (noisy neighbor), and how the two layers can complement each other.
An engineering-oriented guide to hetGPU: how a compiler + runtime stack can make one GPU binary run across NVIDIA/AMD/Intel/Tenstorrent, including SIMT vs MIMD, memory model gaps, and live kernel migration.
Reading tracks
Start with basics, then move into operations, tradeoffs, and troubleshooting.
Kubernetes
A curated reading track for Kubernetes.
GPU
A curated reading track for GPU.
System
A curated reading track for System.
Recent writing
New notes, guides, and long-form pieces from the main archive.
Running Stateful Apps on Kubernetes: Storage, Identity, and Operations
A practical guide to stateful applications on Kubernetes, covering storage choices, stable identities, rollout concerns, and failure handling.
Deploying MySQL on Kubernetes: Stateful Basics, Storage, and Services
Learn the essentials of running MySQL on Kubernetes, including StatefulSets, persistent storage, Services, and operational tradeoffs.
Kubernetes Declarative Configuration Explained: Apply, Drift, and Reconciliation
Understand declarative configuration in Kubernetes, why desired state matters, and how apply, diff, and reconciliation shape safe operations.
Kubernetes Canary Releases Explained: Safer Rollouts and Traffic Control
Learn practical canary release patterns in Kubernetes, how to reduce rollout risk, and which signals to watch before promoting traffic.
Kubernetes Service Explained: ClusterIP, NodePort, LoadBalancer, and DNS
Learn how Kubernetes Services provide stable networking for Pods, how service types differ, and how to troubleshoot selectors, endpoints, and traffic flow.
Kubernetes Namespaces Explained: Isolation, Scope, and Team Boundaries
Learn how Kubernetes Namespaces organize resources, scope policies and quotas, and support safer multi-team or multi-environment clusters.
Kubernetes Pods Explained: Lifecycle, Scheduling, and Debugging Basics
Understand what a Pod really is in Kubernetes, how Pods are scheduled and restarted, and which commands help you debug them.
Kubernetes Deployment vs ReplicaSet: Rolling Updates and Desired State
Learn how Deployments and ReplicaSets work together in Kubernetes, how rolling updates happen, and how to debug rollout and selector problems.
Install Minikube and Start a Local Cluster
Get a working single-node cluster fast with Minikube.
K3s Quick Start: Lightweight Kubernetes Setup and Core Concepts
Learn when to use K3s, how to install it quickly, and how to run core Kubernetes workloads on a lightweight cluster for labs and edge environments.
Kubernetes Introduction: What It Solves and How to Think About It
A practical introduction to Kubernetes: what problems it solves, what it does not solve, and how desired state and controllers work in real clusters.
Kubernetes Architecture Explained: Control Plane, Nodes, and Reconciliation
Understand Kubernetes architecture from the control plane to worker nodes, including the API server, scheduler, controllers, kubelet, and reconciliation loops.
Kubernetes Learning Path: A Practical Quickstart Course Guide
A step-by-step Kubernetes learning path covering core concepts, workloads, networking, storage, and troubleshooting so you can study in the right order.
Kubernetes Basics: Pods, Deployments, Services, and Namespaces Explained
Learn the four Kubernetes basics that matter most in practice: Pods, Deployments, Services, and Namespaces, plus the declarative workflow behind them.