🧠 ML Concept Visualizer

Interactive visualizations for machine learning, deep learning, and systems optimization concepts

🚀

AI Accelerators & CPU Architectures

Deep dive into hardware design for machine learning

CISC vs RISC vs VLIW Comparing CPU instruction architectures and parallelism philosophies
TPU v1 Architecture Interactive block diagram of the Google Tensor Processing Unit
TPU v7 Deep Dive Interactive architecture overview of a speculative future TPU generation
Systolic Array Core Visualize how data flows through a rhythmic matrix multiply unit
Array vs. Vector Processors Space-time mapping of data parallelism across different hardware architectures
Vector ILP & Pipeline Throughput Visualizing accumulation of vector chunks and instruction-level parallelism
GPU Warp Basics (SIMT) How SPMD code maps to a warp of hardware threads sharing a Program Counter
Branch Divergence Profiler Simulator to measure throughput and utilization losses in divergent kernels
TPU Programming Model Visualization of the CISC-like instruction set used to program AI accelerators
Vector Unit Architecture On-chip vector processing for activation and pooling operations
PCIe Interface Details How accelerators communicate with the host CPU at high bandwidth

📊

Classical ML & Classification

Traditional machine learning algorithms and metrics

🧩

CNN Architectures

Convolutional neural networks and deep learning

📚

ML Fundamentals

Core concepts and building blocks

⚡

GEMM Optimization

General matrix multiply optimization techniques

🔥

Kernel Optimization

Data reuse and computational efficiency

💾

Cache & Memory Optimization

Memory hierarchy and cache performance

Cache Hierarchy Concepts Understanding the memory hierarchy

🔄

Dataflow Patterns

Hardware execution patterns for neural networks

📈

Performance Modeling

Analyzing and predicting system performance

Roofline Model Visualize performance bounds and bottlenecks