The first goal from the LAPACK venture was to help make the widely applied EISPACK and LINPACK libraries run successfully on shared-memory vector and parallel processors. On these machines, LINPACK and EISPACK are inefficient mainly because their memory access styles disregard the multi-layered memory hierarchies of the devices, thus paying excessi