Core Model Overview
A four-layer model (Yin-Yang, Five Elements, Yun, Qi) for understanding AI infrastructure as an evolving organic system
Recent content in Jimmy Song 的博客 on Jimmy Song
A four-layer model (Yin-Yang, Five Elements, Yun, Qi) for understanding AI infrastructure as an evolving organic system
Understanding system tensions: expansion vs. constraint, innovation vs. governance, speed vs. stability in AI infrastructure
Five system roles: data, models, compute, platforms, and hardware—how they interact and balance in AI infrastructure
System evolution stages: exploration, platform, scale, and rebalancing phases in AI infrastructure growth
Effective flow and pressure distribution in systems—data flow, signal propagation, and system health monitoring
Integrating Yin-Yang, Five Elements, Yun, and Qi layers to explain complex AI infrastructure system behavior
Practical principles for applying the Yin-Yang Five Elements Qi model in GPU scheduling, Agent Runtime, and platform governance
Five-dimensional diagnosis framework for AI infrastructure health: element balance, flow smoothness, tension dynamics, stage alignment, and runaway warnings
Core value and applications of the Yin-Yang Five Elements Qi-Yun model for AI infrastructure architects
Core definition, boundaries, and evaluation criteria for AI-native infrastructure, focusing on model behavior, compute scarcity, and uncertainty governance.
Three planes (Intent, Execution, Governance) + closed-loop feedback for AI-native infrastructure architecture alignment.
Discussing Intent vs Consequence, why compute and cost are the first-order constraints of AI-native infrastructure.
Analyzing the closed-loop governance of metrics, budgets, isolation, and sharing in AI-native infrastructure, and explaining how SLO maps to cost and risk.
Redrawing boundaries across platform, infra, ML, and security, and transforming accountability and collaboration in the AI era.
An actionable roadmap for AI-native migration, covering bypass pilot, domain isolation, AI-first refactoring, and anti-patterns, with focus on governance loops and organizational contracts.
Bilingual glossary of core AI-native infrastructure terminology for aligning organizational language.
Ten critical questions for CEO/CTO to evaluate AI-native infrastructure readiness.
HAMi moves from cluster to desktop with Olares.
From Anji bamboo weaving to industrialization
A GPU explainer for Kubernetes veterans new to AI. Maps token, model, training, inference, Transformer, Tensor Core, HBM, and KV cache to concepts you already know.
From GPU utilization to productive GPU-hours.
A practical AI Infra review of Agentic AI reliability, covering a five-dimension framework, fault tolerance, recovery, observability, and hybrid architecture design.
From GPU hardware, Kubernetes scheduling, inference engines to token cost — understanding the 8-layer observability architecture for modern AI infrastructure.
How I built a personal AI infrastructure using ChatGPT, OpenClaw, Obsidian, GitHub, Lark, GLM-5.1, and a Mac mini M4.
The Linux Foundation's Tokenomics Foundation signals a shift: tokens are becoming a core resource in the AI era, much like CPUs in the cloud era.
AI Native Landscape has moved to landscape.jimmysong.io with 600+ curated open-source projects, AI skill search support, and a call for community contributions.
A practitioner's perspective on AI infrastructure trends: evolving bottlenecks, roles of CPU/GPU/scheduling, ecosystem shifts, and compute demand across training, inference, and Agent workloads.
Observations on the evolution of AI infrastructure control planes, focusing on HAMi v2.9, GPU scheduling, and Kubernetes resource models.
At KubeCon EU 2026, I witnessed Kubernetes' anxiety and transformation in the AI era. This article explores the challenges and future opportunities for Kubernetes in the age of AI.
KubeCon Europe 2026 Day One: How Kubernetes is adapting to the AI infrastructure wave and the evolution of the GPU resource layer.
A systematic upgrade to HAMi’s website and docs, improving community visibility, content structure, search, and usability.
On the eve of GTC 2026, rethinking whether AI is becoming the new infrastructure from NVIDIA's AI Five-Layer Cake, the rise of agent runtime, to AI-native infrastructure.
A CTO/VP view on open GPU scheduling: CDI, Kubernetes DRA, virtualization data planes, ecosystem governance, and lock-in risk.
A curated collection of AI learning resources we removed from the AI Resources list: awesome lists, courses, tutorials, and cookbooks. These educational materials deserve their own spotlight.
Before ChatGPT and TensorFlow, there was Hadoop, Kafka, and Kubernetes. This post honors the traditional open source infrastructure that became the foundation of today's AI revolution.
Observations from my first month at Dynamia: From cloud native to AI Native Infra, why this direction is worth investing in, and the key issues and opportunities in compute governance.
Exploring how Spec becomes the governable core asset in Agent-Driven Development (ADD) and the trend toward control-plane engineering systems.
Comparing Miaoyan, Zhipu, and Shandianshuo voice input methods for developers: speed, stability, command capabilities, and cost models.
How technical standards and data sovereignty shape AI open source paths and infrastructure competition in the global AI era.
Joining Dynamia as Open Source Ecosystem VP to drive AI-native infrastructure ecosystem development, transforming compute from hardware consumption to core asset.
A hands-on experience with Verdent's standalone Mac app, exploring how parallel AI agents, isolated workspaces, and task-oriented workflows change real-world development.
A look back at the major changes in 2025: shifting from Cloud Native to AI Native Infrastructure, AI tool ecosystem, and major website improvements.
Manus's acquisition by Meta sparked polarized opinions. This article explores the butterfly effect in AI applications and key lessons for entrepreneurs on growth strategies.
Beijing and Shanghai's open source plans reveal opportunities and challenges for China's AI infrastructure, balancing technology and governance.
In 2025, software engineering shifts from code-centric to runtime and cost governance. AI and Agents move complexity to runtime, compute, and budget layers, reshaping engineering value.
Explores why AI Agents need Kubernetes infrastructure and how Agent orchestration, MCP services, and AI gateways enable production-ready AI architectures.
Comprehensive introduction to the AI Open Source Landscape's positioning, interface, scoring model, and data mechanisms to help developers efficiently discover quality AI projects.
2026 AI's turning point: not models, but infrastructure, agentic runtimes, GPU efficiency, and new organizational forms.
From an engineering and organizer's perspective, real changes at COSCon'25: AI as the default backdrop, discussions returning to engineering issues, and Chinese open source entering a long-term phase.
An analysis of Block's Goose project, why it became one of the first Agentic AI Foundation (AAIF) projects, and what this means for Agentic Runtime and the evolution of AI-Native infrastructure.