University of California, San Diego
Computer Science
We present a Cholesky factorization for multicore with GPU accelerators systems. The challenges in developing scalable high performance algorithms for these emerging systems stem from their heterogeneity, massive parallelism, and the huge... more
- by Rajib Nath
Recent activities of major chip manufacturers, such as Intel, AMD, IBM and NVIDIA, make it more evident than ever that future designs of microprocessors and large HPC systems will be hybrid/heterogeneous in nature, relying on the... more
- by Rajib Nath
Implementations of the Basic Linear Algebra Subprograms (BLAS) interface are major building block of dense linear algebra (DLA) libraries, and therefore have to be highly optimized. We present some techniques and implementations that... more
- by Rajib Nath
The goal of the Matrix Algebra on GPU and Multicore Architectures (MAGMA) project is to create a new generation of linear algebra libraries that achieve the fastest possible time to an accurate solution on hybrid/heterogeneous... more
- by Rajib Nath
Abstract. We present a Cholesky factorization for multicore with GPU accelerators. The challenges in developing scalable high performance algorithms for these emerging systems stem from their heterogeneity, massive parallelism, and the... more
- by Rajib Nath
We present a Hessenberg reduction (HR) algorithm for hybrid systems of homogeneous multicore with GPU accelerators that can exceed 25�� the performance of the corresponding LAPACK algorithm running on current homogeneous multicores. This... more
- by Rajib Nath
Abstract Dense linear algebra (DLA) is one of the most important softwares in high performance computing. It is also important for it's wide usage in other application domains like machine learning, gaming, speech processing, image... more
- by Rajib Nath
Abstract We present an improved matrix���matrix multiplication routine (General Matrix Multiply [GEMM]) in the MAGMA BLAS library that targets the NVIDIA Fermi graphics processing units (GPUs) using Compute Unified Data Architecture... more
- by Rajib Nath
Abstract In this work we propose a joint energy, thermal and cooling management technique (JETC) that significantly reduces per server cooling and memory energy costs. Our analysis shows that decoupling the optimization of cooling energy... more
- by Rajib Nath
CdS thin films of varying thicknesses were deposited on cleaned glass substrates at room temperature by thermal evaporation technique in a vacuum of about 2 x 10 -5 torr. UV-VIS spectra of the films were studied using the optical... more
The bioelectret state has been proposed to be a universal property of enzymes and to play an important role for the catalytic action of enzymes. In the present investigation the bioelectret state in an enzyme amylase (EC 3.2.2.1) has been... more
We report synthesis of ZnO quantum dot embedded in polyvinylpyrrolidone (PVP) matrix and its functioning as acetone sensor. The specimen is prepared via quenching technique where bulk ZnO powder is calcined at very high temperature of... more
The rising number of cores in manycore archi tectures, along with technology scaling, results in high power densities and thermal issues on the die. To explore innovative thermal management techniques in such processors, we need an... more
The tile QR factorization provides an efficient and scalable way for factoring a dense matrix in parallel on multicore processors. This article presents a way of efficiently implementing the algorithm on a system with a powerful GPU and... more