


default search action
Parallel Computing, Volume 35
Volume 35, Number 1, January 2009
- Xiandong Meng, Vipin Chaudhary

:
Boosting data throughput for sequence database similarity searches on FPGAs using an adaptive buffering scheme. 1-11 - Ricardo C. Corrêa

, Valmir Carneiro Barbosa
:
Partially ordered distributed computations on asynchronous point-to-point networks. 12-28 - Lih-Yuan Deng, Huajiang Li, Jyh-Jen Horng Shiau:

Scalable parallel multiple recursive generators of large order. 29-37 - Alfredo Buttari

, Julien Langou
, Jakub Kurzak, Jack J. Dongarra:
A class of parallel tiled linear algebra algorithms for multicore architectures. 38-53
Volume 35, Number 2, February 2009
- Fabrício Alves Barbosa da Silva

, Hermes Senger
:
Improving scalability of Bag-of-Tasks applications running on master-slave platforms. 57-71 - Yuh-Rau Wang:

A novel O(1) time algorithm for 3D block-based medial axis transform by peeling corner shells. 72-82 - Anne Benoit

, Mourad Hakem, Yves Robert
:
Contention awareness and fault-tolerant scheduling for precedence constrained tasks in heterogeneous systems. 83-108 - Lars K. S. Daldorff, Bengt Eliasson

:
Parallelization of a Vlasov-Maxwell solver in four-dimensional phase space. 109-115
Volume 35, Number 3, March 2009
- Rupak Biswas, Leonid Oliker, Jeffrey S. Vetter:

Revolutionary technologies for acceleration of emerging petascale applications. 117-118 - David A. Bader

, Virat Agarwal, Seunghwa Kang:
Computing discrete transforms on the Cell Broadband Engine. 119-137 - Jakub Kurzak, Wesley Alvaro, Jack J. Dongarra:

Optimizing matrix multiplication for a short-vector SIMD architecture - CELL processor. 138-150 - Jeremy S. Meredith, Gonzalo Alvarez, Thomas A. Maier

, Thomas C. Schulthess, Jeffrey S. Vetter:
Accuracy and performance of graphics processors: A Quantum Monte Carlo application case study. 151-163 - David J. Hardy, John E. Stone

, Klaus Schulten:
Multilevel summation of electrostatic potentials using graphics processing units. 164-177 - Samuel Williams

, Leonid Oliker, Richard W. Vuduc
, John Shalf
, Katherine A. Yelick
, James Demmel:
Optimization of sparse matrix-vector multiplication on emerging multicore platforms. 178-194
Volume 35, Number 4, April 2009
- Suresh Behara

, Sanjay Mittal:
Parallel finite element computation of incompressible flows. 195-212 - Arquimedes Canedo, Ben A. Abderazek

, Masahiro Sowa:
Efficient compilation for queue size constrained queue processors. 213-225 - Tien-Yien Li, Chih-Hsiung Tsai:

HOM4PS-2.0para: Parallelization of HOM4PS-2.0 for solving polynomial systems. 226-238 - Sid Ahmed Ali Touati, Zsolt Mathe:

Periodic register saturation in innermost loops. 239-254
Volume 35, Number 5, May 2009
- Won Woo Ro, Jean-Luc Gaudiot:

A complexity-effective microprocessor design with decoupled dispatch queues and prefetching. 255-268 - Yaohang Li, Michael Mascagni, Andrey Gorin:

A decentralized parallel implementation for parallel tempering algorithm. 269-283 - Leopold Grinberg, Dmitry Pekurovsky

, Spencer J. Sherwin
, George E. Karniadakis:
Parallel performance of the coarse space linear vertex solver and low energy basis preconditioner for spectral/hp elements. 284-304 - Antonio Robles-Gómez

, Aurelio Bermúdez
, Rafael Casado
, Åshild Grønstad Solheim:
A dynamic distributed mechanism for reconfiguring high-performance networks. 305-312
Volume 35, Number 6, June 2009
- Ching-Wen Chen, Chuan-Chi Weng, Chang-Jung Ku:

An overlapping and pipelining data transmission MAC protocol with multiple channels in ad hoc networks. 313-330 - Taro Konda, Yoshimasa Nakamura:

A new algorithm for singular value decomposition and its parallelization. 331-344 - Gerold Jäger, Clemens Wagner:

Efficient parallelizations of Hermite and Smith normal form algorithms. 345-357 - Julian Borrill, Leonid Oliker, John Shalf

, Hongzhang Shan, Andrew Uselton:
HPC global file system performance analysis using a scientific-application derived benchmark. 358-373
Volume 35, Number 7, July 2009
- Markus Geimer

, Felix Wolf, Brian J. N. Wylie, Bernd Mohr
:
A scalable tool architecture for diagnosing wait states in massively parallel applications. 375-388 - Jay Smith, Vladimir Shestak, Howard Jay Siegel, Suzy Price, Larry Teklits, Prasanna Sugavanam:

Robust resource allocation in a cluster based imaging system. 389-400 - Yang Wang, Ming Zhu, Hua Li:

A distributed Key Message algorithm to optimize the communication in clusters. 401-415 - Hatem Ltaief

, Marc Garbey:
A parallel Aitken-additive Schwarz waveform relaxation suitable for the grid. 416-428
Volume 35, Numbers 8-9, August - September 2009
- Cole Trapnell, Michael C. Schatz

:
Optimizing data intensive GPGPU computations for DNA sequence alignment. 429-440 - Tz-Liang Kueng, Cheng-Kuan Lin, Tyne Liang, Jimmy J. M. Tan, Lih-Hsing Hsu:

Embedding paths of variable lengths into hypercubes with conditional link-faults. 441-454 - Arturo González-Escribano

, Arjan J. C. van Gemund, Valentín Cardeñoso-Payo
:
Performance implications of synchronization structure in parallel programming. 455-474 - Ananta Tiwari, Vahid Tabatabaee, Jeffrey K. Hollingsworth:

Tuning parallel applications in parallel. 475-492
Volume 35, Numbers 10-11, October - November 2009
- Diane Lingrand, Tristan Glatard

, Johan Montagnat:
Modeling the latency on production grids with respect to the execution context. 493-511 - Anshu Dubey

, Katie Antypas, Murali K. Ganapathy, Lynn B. Reid, Katherine Riley, Daniel J. Sheeler, Andrew R. Siegel, Klaus Weide
:
Extensible component-based architecture for FLASH, a massively parallel, multiphysics simulation code. 512-522 - Ismael Marín Carrión

, Enrique Arias-Antúnez, M. M. Artigao Castillo, Julio José Águila Guerrero, Juan José Miralles Canals:
Thread-based implementations of the false nearest neighbors method. 523-534 - Hamid Mahini, Hamid Sarbazi-Azad:

Resource placement in three-dimensional tori. 535-543 - Henning Meyerhenke

, Burkhard Monien, Stefan Schamberger:
Graph partitioning and disturbed diffusion. 544-569
Volume 35, Number 12, December 2009
- Franck Cappello, Thomas Hérault

, Jack J. Dongarra:
Foreword. 571
- Bin Jia:

Process cooperation in multiple message broadcast. 572-580 - Peter Sanders, Jochen Speck, Jesper Larsson Träff:

Two-tree algorithms for full bandwidth broadcast, reduction and scan. 581-594 - Daniel Becker, Rolf Rabenseifner, Felix Wolf, John C. Linford:

Scalable timestamp synchronization for event traces of message-passing applications. 595-607 - Rajeev Thakur

, William Gropp
:
Test suite for evaluating performance of multithreaded MPI communication. 608-617

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














