Перейти до вмісту

AI Infrastructure

Цей контент ще не доступний вашою мовою.

AI/ML Engineering Track | Phase 6

Best for: learners moving from model/application work into inference, cost, and infrastructure decisions.

This phase is intentionally split between:

  • learner-scale local infrastructure
  • production-oriented AI infra concepts

That means not every module should be read in pure numeric order if your immediate goal is local-first work.

#Module
1.1Cloud AI Services
1.2AIOps
1.3High-Performance LLM Inference: vLLM and sglang
1.4Local Inference Stack for Learners
1.5Home AI Operations and Cost Model
1.6GPU Memory Hierarchy and Bandwidth Math for LLM Inference
1.7Production-Tier LLM Inference Engines: Decision Framework
1.8Benchmarking LLM Inference: TTFT, TPOT, and Workload-Aware Load Shaping

This section is not only about datacenter-scale AI. It also teaches when a learner should stay simple, local, and private instead of prematurely copying large-scale serving architecture.