GPU

Posts under GPU.

GPU collects 5 posts focused on practical patterns, operations context, and implementation details so readers can move from concepts to production decisions.

Featured in this category

2026-03-12

GPU Overprovisioning Solutions: From Oversubscription and Sharing to Isolation

A practical guide to GPU overprovisioning strategies, including scheduler-level oversubscription, time slicing, memory controls, MIG, vGPU, queue backfill, and operational guardrails.

Read selection

2026-03-12

How Startups Should Choose: Serverless GPU vs Dedicated GPU

A practical guide to choosing between serverless GPUs and dedicated GPUs for startups, based on cost structure, delivery speed, performance predictability, operations burden, and team maturity.

2026-01-26

KAI-Scheduler vs HAMi: Two Ways to Share GPUs in Kubernetes (Soft vs Hard Isolation)

An engineering-oriented comparison of KAI-Scheduler’s Reservation Pod approach and HAMi’s hard isolation path, including trade-offs, failure modes (noisy neighbor), and how the two layers can complement each other.

2026-01-20

hetGPU: Chasing Cross-Vendor GPU Binary Compatibility

An engineering-oriented guide to hetGPU: how a compiler + runtime stack can make one GPU binary run across NVIDIA/AMD/Intel/Tenstorrent, including SIMT vs MIMD, memory model gaps, and live kernel migration.

2026-01-12

Kubernetes GPU Virtualization Explained Through gpu-manager Startup Flow

A deep dive into Kubernetes GPU virtualization through gpu-manager startup flow, including device interception, topology awareness, scheduling, and allocation mechanics.

All posts in this category

All posts in reverse chronological order.

2026-03-12 · 161 views

GPU Overprovisioning Solutions: From Oversubscription and Sharing to Isolation

A practical guide to GPU overprovisioning strategies, including scheduler-level oversubscription, time slicing, memory controls, MIG, vGPU, queue backfill, and operational guardrails.

Read this piece →

2026-03-12 · 151 views

How Startups Should Choose: Serverless GPU vs Dedicated GPU

A practical guide to choosing between serverless GPUs and dedicated GPUs for startups, based on cost structure, delivery speed, performance predictability, operations burden, and team maturity.

Read →

2026-01-26 · 353 views

GPU

Other topics

Featured in this category

All posts in this category

GPU Overprovisioning Solutions: From Oversubscription and Sharing to Isolation

How Startups Should Choose: Serverless GPU vs Dedicated GPU

KAI-Scheduler vs HAMi: Two Ways to Share GPUs in Kubernetes (Soft vs Hard Isolation)

hetGPU: Chasing Cross-Vendor GPU Binary Compatibility

Kubernetes GPU Virtualization Explained Through gpu-manager Startup Flow