Jinn's Hub
about / blog / sources / projects / ZH /
All tags

Posts tagged with "MLSys"

    From Python to Silicon — A Compiler & Arch Primer for the Working ML Engineer
    You can write production ML systems for years without knowing what IR, MLIR, LLVM, ISA, or FFI actually mean. This is the patch — a bilingual primer for the undergrad-CS-but-skipped-compilers crowd, with a full HTML deep dive carrying six hand-drawn SVG plates.
    Attention Mechanisms — Full, Sparse, Linear, NSA & GLA
    Breaking down Full, Sparse, and Linear Attention, all the way to DeepSeek NSA and Gated Linear Attention
    TritonForge: Server-based Multi-turn RL for Triton Kernel Generation
    End-to-end server-based RL training and evaluation system for Triton kernel generation across NVIDIA and AMD, built on slime + Megatron
    Transformer Deep Dive (Math + Code)
    Deconstructing Transformer's Self-Attention, LayerNorm, and MLP from math, code, and architecture perspectives
© 2026 • Jinn's Hub 🔬
Press Esc or click anywhere to close