Jimmy Song – Jimmy Song 的博客

Recent content in Jimmy Song 的博客 on Jimmy Song

https://jimmysong.io/index.xml (RSS订阅地址)

Core Model Overview

A four-layer model (Yin-Yang, Five Elements, Yun, Qi) for understanding AI infrastructure as an evolving organic system

2026/2/10

The Yin-Yang Layer: Dynamic Balance of System States

Understanding system tensions: expansion vs. constraint, innovation vs. governance, speed vs. stability in AI infrastructure

2026/2/10

Five Elements Layer: Classification and Collaboration of System Roles

Five system roles: data, models, compute, platforms, and hardware—how they interact and balance in AI infrastructure

2026/2/10

The Yun Layer: Stages and Cycles of System Evolution

System evolution stages: exploration, platform, scale, and rebalancing phases in AI infrastructure growth

2026/2/10

Qi Layer: Effective System Flow and Pressure Fields

Effective flow and pressure distribution in systems—data flow, signal propagation, and system health monitoring

2026/2/10

Dynamic Relationship Modeling: Five Elements Flow Under Yin-Yang Balance

Integrating Yin-Yang, Five Elements, Yun, and Qi layers to explain complex AI infrastructure system behavior

2026/2/10

Engineering Practice Guide: Architecture Decisions Guided by Theory

Practical principles for applying the Yin-Yang Five Elements Qi model in GPU scheduling, Agent Runtime, and platform governance

2026/2/10

System Diagnosis Principles: Criteria for Health Status

Five-dimensional diagnosis framework for AI infrastructure health: element balance, flow smoothness, tension dynamics, stage alignment, and runaway warnings

2026/2/10

Conclusion and Outlook

Core value and applications of the Yin-Yang Five Elements Qi-Yun model for AI infrastructure architects

2026/2/10

What Is AI-Native Infrastructure?

Core definition, boundaries, and evaluation criteria for AI-native infrastructure, focusing on model behavior, compute scarcity, and uncertainty governance.

2026/1/18

AI-Native Infrastructure One-Page Reference Architecture: Three Planes + One Loop

Three planes (Intent, Execution, Governance) + closed-loop feedback for AI-native infrastructure architecture alignment.

2026/1/17

Why Start with Compute Governance, Not API Design

Discussing Intent vs Consequence, why compute and cost are the first-order constraints of AI-native infrastructure.

2026/1/18

Operating and Governing AI-Native Infrastructure: Metrics, Budget, Isolation, Sharing, SLO to Cost

Analyzing the closed-loop governance of metrics, budgets, isolation, and sharing in AI-native infrastructure, and explaining how SLO maps to cost and risk.

2026/1/18

Organization and Culture: How the Operating Model Changes

Redrawing boundaries across platform, infra, ML, and security, and transforming accountability and collaboration in the AI era.

2026/1/18

Migration Roadmap: From Cloud Native to AI Native

An actionable roadmap for AI-native migration, covering bypass pilot, domain isolation, AI-first refactoring, and anti-patterns, with focus on governance loops and organizational contracts.

2026/1/18

Glossary

Bilingual glossary of core AI-native infrastructure terminology for aligning organizational language.

2026/1/18

Executive Checklist (10 Questions)

Ten critical questions for CEO/CTO to evaluate AI-native infrastructure readiness.

2026/1/18

Olares and HAMi: A New Inflection Point for Desktop AI Workstations

HAMi moves from cluster to desktop with Olares.

2026/6/24

Every Nation Begins with Textiles

From Anji bamboo weaving to industrialization

2026/6/20

Why GPUs Became the Foundation of AI: A GPU Primer for K8s Veterans

A GPU explainer for Kubernetes veterans new to AI. Maps token, model, training, inference, Transformer, Tensor Core, HBM, and KV cache to concepts you already know.

2026/6/17

GPU Utilization Is Breaking: AI Infrastructure Needs a New Definition of Efficiency

From GPU utilization to productive GPU-hours.

2026/6/17

When an Agent Becomes a Distributed State Machine: Agentic AI Infrastructure Reliability

A practical AI Infra review of Agentic AI reliability, covering a five-dimension framework, fault tolerance, recovery, observability, and hybrid architecture design.

2026/6/16

From GPU to Token: The 8-Layer Observability Stack for AI Infrastructure

From GPU hardware, Kubernetes scheduling, inference engines to token cost — understanding the 8-layer observability architecture for modern AI infrastructure.

2026/6/9

My Personal AI Stack: Building a Continuously Running Personal AI Infrastructure for ~$100/Month

How I built a personal AI infrastructure using ChatGPT, OpenClaw, Obsidian, GitHub, Lark, GLM-5.1, and a Mac mini M4.

2026/6/7

Token Is More Than a Billing Unit, It's Becoming the Resource Unit of the AI Era

The Linux Foundation's Tokenomics Foundation signals a shift: tokens are becoming a core resource in the AI era, much like CPUs in the cloud era.

2026/6/4

AI Native Landscape Launches as a Standalone Site

AI Native Landscape has moved to landscape.jimmysong.io with 600+ curated open-source projects, AI skill search support, and a call for community contributions.

2026/6/4

AI Infra Industry Trends: From Compute Bottlenecks to Ecosystem Evolution

A practitioner's perspective on AI infrastructure trends: evolving bottlenecks, roles of CPU/GPU/scheduling, ecosystem shifts, and compute demand across training, inference, and Agent workloads.

2026/5/31

Kubernetes as the GPU Control Plane: HAMi v2.9 and Next-Gen AI Infra

Observations on the evolution of AI infrastructure control planes, focusing on HAMi v2.9, GPU scheduling, and Kubernetes resource models.

2026/5/14

Kubernetes's Anxiety and Rebirth in the AI Wave

At KubeCon EU 2026, I witnessed Kubernetes' anxiety and transformation in the AI era. This article explores the challenges and future opportunities for Kubernetes in the age of AI.

2026/4/3

Day One in Amsterdam: Kubernetes Is Rethinking AI

KubeCon Europe 2026 Day One: How Kubernetes is adapting to the AI infrastructure wave and the evolution of the GPU resource layer.

2026/3/22

HAMi Website Refactor: Why HAMi Docs and Website Underwent a Complete Redesign

A systematic upgrade to HAMi’s website and docs, improving community visibility, content structure, search, and usability.

2026/3/17

GTC 2026 Eve: AI is Becoming the New Infrastructure

On the eve of GTC 2026, rethinking whether AI is becoming the new infrastructure from NVIDIA's AI Five-Layer Cake, the rise of agent runtime, to AI-native infrastructure.

2026/3/15

When GPUs Move Toward Open Scheduling: Structural Shifts in AI Native Infrastructure

A CTO/VP view on open GPU scheduling: CDI, Kubernetes DRA, virtualization data planes, ecosystem governance, and lock-in risk.

2026/2/13

AI Learning Resources: 44 Curated Collections from Our Cleanup

A curated collection of AI learning resources we removed from the AI Resources list: awesome lists, courses, tutorials, and cookbooks. These educational materials deserve their own spotlight.

2026/2/8

Standing on Giants' Shoulders: The Traditional Infrastructure Powering Modern AI

Before ChatGPT and TensorFlow, there was Hadoop, Kafka, and Kubernetes. This post honors the traditional open source infrastructure that became the foundation of today's AI revolution.

2026/2/8

My First Month at Dynamia: Why AI Native Infra Is Worth the Investment

Observations from my first month at Dynamia: From cloud native to AI Native Infra, why this direction is worth investing in, and the key issues and opportunities in compute governance.

2026/2/6

The True Inflection Point of ADD: When Spec Becomes the Core Asset of AI-Era Software

Exploring how Spec becomes the governable core asset in Agent-Driven Development (ADD) and the trend toward control-plane engineering systems.

2026/1/20

AI Voice Dictation Input Methods Are Becoming the New Shortcut Key for the Programming Era

Comparing Miaoyan, Zhipu, and Shandianshuo voice input methods for developers: speed, stability, command capabilities, and cost models.

2026/1/18

From Spatial Data to AI Open Source: Technical Standards, Data Sovereignty, and the Global Divide

How technical standards and data sovereignty shape AI open source paths and infrastructure competition in the global AI era.

2026/1/11

Joining Dynamia: Embarking on a New Journey in AI Native Infrastructure

Joining Dynamia as Open Source Ecosystem VP to drive AI-native infrastructure ecosystem development, transforming compute from hardware consumption to core asset.

2026/1/7

Running Parallel AI Agents on My Mac: Hands-On with Verdent's Standalone App

A hands-on experience with Verdent's standalone Mac app, exploring how parallel AI agents, isolated workspaces, and task-oriented workflows change real-world development.

2026/1/4

2025 Annual Review: The Transformation Journey from Cloud Native to AI Native

A look back at the major changes in 2025: shifting from Cloud Native to AI Native Infrastructure, AI tool ecosystem, and major website improvements.

2025/12/31

The Butterfly Effect After Manus Was Acquired by Meta

Manus's acquisition by Meta sparked polarized opinions. This article explores the butterfly effect in AI applications and key lessons for entrepreneurs on growth strategies.

2025/12/30

AI Infra Open Source in China: Analysis of Beijing and Shanghai's Plans

Beijing and Shanghai's open source plans reveal opportunities and challenges for China's AI infrastructure, balancing technology and governance.

2025/12/25

From 2025 Onwards, Software Engineering Shifts from Code-Centric to Runtime and Cost-Centric

In 2025, software engineering shifts from code-centric to runtime and cost governance. AI and Agents move complexity to runtime, compute, and budget layers, reshaping engineering value.

2025/12/24

From Cloud Native to AI Native: Why Kubernetes Is the Foundation for Next-Gen AI Agents

Explores why AI Agents need Kubernetes infrastructure and how Agent orchestration, MCP services, and AI gateways enable production-ready AI architectures.

2025/12/24

AI Open Source Landscape: A One-Stop Guide to AI Project Navigation and Scoring System

Comprehensive introduction to the AI Open Source Landscape's positioning, interface, scoring model, and data mechanisms to help developers efficiently discover quality AI projects.

2025/12/23

AI 2026: Infrastructure, Agents, and the Next Cloud-Native Shift

2026 AI's turning point: not models, but infrastructure, agentic runtimes, GPU efficiency, and new organizational forms.

2025/12/19

What I Saw at COSCon'25: The Real State of Open Source in China

From an engineering and organizer's perspective, real changes at COSCon'25: AI as the default backdrop, discussions returning to engineering issues, and Chinese open source entering a long-term phase.

2025/12/18

Decoding Goose: Why It Joined AAIF and What This Means for Agentic Runtime

An analysis of Block's Goose project, why it became one of the first Agentic AI Foundation (AAIF) projects, and what this means for Agentic Runtime and the evolution of AI-Native infrastructure.

2025/12/12