Thread-Level Parallelism with OpenMP
./lab-11.pdf
look at:
void v_add_optimized_chunks(double* x, double* y, double* z)
make test-v-add
look at:
double dotp_manual_optimized(double* x, double* y)
double dotp_reduction_optimized(double* x, double* y)
make test-dot-p