Home About Careers Blog Contact

Engineering Notes

Deep dives into the systems we build. No fluff, no marketing — just technical writing from the team in the trenches.

Jun 2026 GPU Systems

Why We Ditched Kubernetes for GPU Workloads

K8s was designed for stateless microservices, not 8-GPU nodes with NVLink topologies. Here's what we built instead — and why bare-metal scheduling outperforms container orchestration by 40% on inference throughput.

Read article →
Apr 2026 Distributed Systems

CRDTs for Agent State: Eventual Consistency at Scale

When you have 10,000 AI agents coordinating across regions, strong consistency is a luxury you can't afford. We use operation-based CRDTs for state management — here's the architecture.

Read article →
Mar 2026 Infrastructure

Predictive Thermal Management: ML for Our Own Hardware

We trained a lightweight model to predict thermal throttling 90 seconds before it happens. By preemptively migrating workloads, we eliminated 99.7% of thermal-induced performance drops.

Read article →
Jan 2026 Launch

Hello, World. We're MetalBear.

Why we started MetalBear, what problems we're solving, and our vision for the future of AI infrastructure. The manifesto for building things that never break.

Read article →

Stay in the loop

Engineering updates, no more than once a month. No spam, unsubscribe anytime.