Vector Instruction Execution

Executing vadd vc, va, vb

Cycle 0

Sequential Pipelining

Time Flow $\downarrow$ Space (Parallel Lanes)

Pipeline Status

Throughput 1 result / cyc
Current Cycle 0

Operand Streams

Vector A ($va$)
Vector B ($vb$)

Pipelined Execution

Input pairs $(A, B)$ enter from the top. The functional unit is pipelined into stages, allowing new data to enter every cycle.

Throughput Math

$$T = 1 \text{ Result/Cycle}$$