How to Optimize GPU Performance - Guidelines and Ideas
Jan 15, 2020, 343 wordsDon’t continue to optimize once a part is only a small fraction of overall execution time
\[S_{\text{overall}} = \frac{1}{(1-p) + \frac{p}{s}}\]where $S_{\text{overall}}$ is the theoretical overall speedup og whole task, $p$ is the proportion of parallel part in the whole task, $s$ is the speedup of parallel portion [1].