Overlapping Data Transfer and Kernel Execution
Elapsed Time
0.00s
Efficiency
0%
$$ T \approx t_E + \frac{t_T}{nStreams} $$
$$ T \approx t_T + \frac{t_E}{nStreams} $$