1.
Automatic intra-register vectorization for the Intel[R] architecture
by Bik, Aart J.C
International journal of parallel programming, 2002-04-01, Vol.30 (2), p.65-98

2.
Mapping conjugate gradient algorithms for neutron diffusion applications onto SIMD, MIMD, and mixed-mode machines
by SO, J. J. E
International journal of parallel programming, 1998, Vol.26 (2), p.183-207
