Tuning Automatically-Parallelized Code
Potential hand improvements are due to
limitations in compiler techniques. E.g.,
- array reductions are parallelized by only few compilers
insufficient information available to the compiler. E.g.,
- loop iteration range may be input data
- variables are defined in other subroutines (no interprocedural analysis)