


default search action
IPDPS 2015: Hyderabad, India - Workshops
- 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, IPDPS 2015, Hyderabad, India, May 25-29, 2015. IEEE Computer Society 2015, ISBN 978-1-4673-7684-6

Workshop 1: HCW - Heterogeneity in Computing Workshop
- Shoukat Ali, Denis Trystram:

HCW Introduction. 1-2 - Behrooz A. Shirazi:

Message from the HCW Steering Committee Chair. 3 - Denis Trystram:

Message from the HCW Program Committee Chair. 4 - Andrew S. Grimshaw:

HCW 2014 Keynote Talk. 5
Session 1: Scheduling and Load Balancing
- Nathanael Cheriere, Erik Saule:

Considerations on Distributed Load Balancing for Fully Heterogeneous Machines: Two Particular Cases. 6-16 - Tarun Beri, Sorav Bansal, Subodh Kumar:

ProSteal: A Proactive Work Stealer for Bulk Synchronous Tasks Distributed on a Cluster of Heterogeneous Machines with Multiple Accelerators. 17-26 - Safia Kedad-Sidhoum, Florence Monna, Denis Trystram:

Scheduling Tasks with Precedence Constraints on Hybrid Multi-core Machines. 27-33
Session 2: Applications
- Emmanuel Agullo, Olivier Beaumont

, Lionel Eyraud-Dubois, Julien Herrmann, Suraj Kumar, Loris Marchal
, Samuel Thibault:
Bridging the Gap between Performance and Bounds of Cholesky Factorization on Heterogeneous Platforms. 34-45 - Md. Tarikul Islam, Hien Nguyen, Jaspal Subhlok, Edgar Gabriel:

Efficient Message Logging to Support Process Replicas in a Volunteer Computing Environment. 46-56 - Subhash Saini, Haoqiang Jin, Dennis C. Jespersen, Samson Cheung, M. Jahed Djomehri, Johnny Chang, Robert Hood:

Early Multi-node Performance Evaluation of a Knights Corner (KNC) Based NASA Supercomputer. 57-67
Workshop 2: RAW - Reconfigurable Architectures Workshop
- Jürgen Becker

, Ken Eguro, Diana Göhringer, Wayne Luk, Marco D. Santambrogio, Ramachandran Vaidyanathan, Steven J. E. Wilton:
RAW Introduction and Committees. 68-69 - Viktor K. Prasanna:

RAW 2015 Keynote. 70
Session 1 - Runtime and Tools for Partially Reconfigurable FPGA-Based Systems
- Tian Xia, Jean-Christophe Prévotet

, Fabienne Nouvel:
Mini-NOVA: A Lightweight ARM-based Virtualization Microkernel Supporting Dynamic Partial Reconfiguration. 71-80 - Berend H. J. Dekens, Marco Jan Gerrit Bekooij, Gerard J. M. Smit:

Real-Time Multiprocessor Architecture for Sharing Stream Processing Accelerators. 81-89 - Aurelio Morales-Villanueva

, Ann Gordon-Ross:
Partial Region and Bitstream Cost Models for Hardware Multitasking on Partially Reconfigurable FPGAs. 90-96 - Marco Rabozzi, Riccardo Cattaneo

, Tobias Becker
, Wayne Luk, Marco D. Santambrogio:
Relocation-Aware Floorplanning for Partially-Reconfigurable FPGA-Based Systems. 97-104
Session 2 - Applications and Special Purpose Architectures with Reconfigurable Hardware
- Da Tong, Shijie Zhou, Viktor K. Prasanna:

High-Throughput Online Hash Table on FPGA. 105-112 - Nachiket Kapre, Han Jianglei, Andrew Bean, Pradeep Moorthy, Siddhartha:

GraphMMU: Memory Management Unit for Sparse Graph Accelerators. 113-120 - Omer Arap, Martin Swany

, Geoffrey Brown, Bryce Himebaugh:
Adaptive Recursive Doubling Algorithm for Collective Communication. 121-128 - Shijie Zhou, Charalampos Chelmis

, Viktor K. Prasanna:
Accelerating Large-Scale Single-Source Shortest Path on FPGA. 129-136
Session 3 - New Architectures and Performance Evaluation for Reconfigurable Computing
- Nicklas Bo Jensen, Pascal Schleuniger, Andreas Erik Hindborg, Maxwell Walter, Sven Karlsson

:
Experiences with Compiler Support for Processors with Exposed Pipelines. 137-143 - Arash Ashrafi, Ramachandran Vaidyanathan:

An Architecture for Configuring an Effcient Scan Path for a Subset of Elements. 144-153 - Shreyas G. Singapura, Anand V. Panangadan, Viktor K. Prasanna:

Performance Modeling of Matrix Multiplication on 3D Memory Integrated FPGA. 154-162 - Lim Hui Hui, Nachiket Kapre:

Enhancing Speedups for FPGA Accelerated SPICE through Frequency Scaling and Precision Reduction. 163-169
Short Papers
- Rohit Kumar, Ann Gordon-Ross:

An Automated High-Level Design Framework for Partially Reconfigurable FPGAs. 170-175 - Marc-André Daigneault, Jean-Pierre David:

Intermediate-Level Synthesis of a Gauss-Jordan Elimination Linear Solver. 176-181 - Riccardo Cattaneo

, Mahdi Badie Moradmand, Donatella Sciuto, Marco D. Santambrogio:
K-Ways Partitioning of Polyhedral Process Networks: A Multi-level Approach. 182-189 - Christian Herglotz, Jürgen Seiler, André Kaup

, Arne Hendricks, Marc Reichenbach
, Dietmar Fey:
Estimation of Non-functional Properties for Embedded Hardware with Application to Image Processing. 190-195 - Kartik V. Hegde, Vadiraj Kulkarni, R. Harshavardhan, David S. Sumam:

Adaptive Reconfigurable Architecture for Image Denoising. 196-201
Workshop 3: HIPS-Workshop on High-Level Parallel Programming Models and Supportive Environments and LSPP-Workshop on Large-Scale Parallel Processing
- Sriram Krishnamoorthy, Tobias Hilbrich, Darren J. Kerbyson, Ramakrishnan Rajamony, Charles C. Weems:

HIPS-LSPP Introduction and Committees. 202-203 - Torsten Hoefler, Laxmikant V. Kalé:

HIPS-LSPP Keynotes. 204
Session I: Performance Analysis and Optimization
- Matthias Weber, Ronald Geisler, Holger Brunst, Wolfgang E. Nagel:

Folding Methods for Event Timelines in Performance Analysis. 205-214 - Tim Cramer

, Robert Dietrich, Christian Terboven
, Matthias S. Müller
, Wolfgang E. Nagel:
Performance Analysis for Target Devices with the OpenMP Tools Interface. 215-224 - Jian Lin

, Khaled Hamidouche, Xiaoyi Lu, Mingzhe Li, Dhabaleswar K. Panda:
High-Performance Coarray Fortran Support with MVAPICH2-X: Initial Experience and Evaluation. 225-234 - Sourav Chakraborty, Hari Subramoni, Jonathan L. Perkins, Ammar Ahmad Awan, Dhabaleswar K. Panda:

On-demand Connection Management for OpenSHMEM and OpenSHMEM+MPI. 235-244
Session II: Parallelization
- Aravind Sukumaran-Rajam

, Luis Esteban Campostrini, Juan Manuel Martinez Caamaño, Philippe Clauss
:
Speculative Runtime Parallelization of Loop Nests: Towards Greater Scope and Efficiency. 245-254
Session III: Application-Specific Studies
- Daniel G. Chavarría-Miranda, Mahantesh Halappanavar, Sriram Krishnamoorthy, Joseph B. Manzano

, Abhinav Vishnu, Adolfy Hoisie
:
On the Impact of Execution Models: A Case Study in Computational Chemistry. 255-264 - Nishant Saurabh

, Ana Lucia Varbanescu, Gyan Ranjan:
Computing the Pseudo-Inverse of a Graph's Laplacian Using GPUs. 265-274
Workshop 4: NIDISC - Workshop on Nature Inspired Distributed Computing
- Pascal Bouvry

, Grégoire Danoy
, Franciszek Seredynski
, El-Ghazali Talbi, Albert Y. Zomaya
:
NIDISC Introduction and Committees. 275
Session 1: Applications of Bio-Inspired Algorithms
- Jakub Gasior

, Franciszek Seredynski
:
Dynamic Job Scheduling in the Cloud Using Slowdown Optimization and Sandpile Cellular Automata Model. 276-285 - Francois Legillon, Nouredine Melab, Didier Renard, El-Ghazali Talbi:

A Multi-objective Evolutionary Algorithm for Cloud Platform Reconfiguration. 286-291 - Raed Alkharboush, Robson Eduardo De Grande

, Azzedine Boukerche:
A Genetic Algorithm Approach for Adjusting Time Series Based Load Prediction. 292-298
Session 2: Parallel, Distributed, and Adaptive Algorithms
- Omar Andrés Carmona Cortes, Mônica Sakuray Pais, Filipo Novo Mór

, Andrew Rau-Chaplin
, César Augusto Missio Marcon
:
Differential Evolution on a GPGPU: The Influence of Parameters on Speedup and the Quality of Solutions. 299-306 - Jakub Muszynski, Sébastien Varrette, Bernabé Dorronsoro Díaz, Pascal Bouvry

:
Distributed Cellular Evolutionary Algorithms in a Byzantine Environment. 307-313 - Amir Nakib

, Bernard Thibault, Patrick Siarry:
Bayesian Based Metaheuristic for Large Scale Continuous Optimization. 314-322 - Ajay Pratap, Rajiv Misra

:
Firefly Inspired Improved Distributed Proximity Algorithm for D2D Communication. 323-328
Workshop 5: HiCOMB - Workshop on High Performance Computational Biology
- Sanguthevar Rajasekaran, Srinivas Aluru, David A. Bader

:
HiCOMB Introduction and Committees. 329-330 - Ramesh Hariharan, Ananth Kalyanaraman, Michela Taufer

, Trilce Estrada, Pietro Cicotti, Pavan Balaji:
HiCOMB 2015 Keynote and Invited Talks. 331
HiCOMB Session 1
- Tuan Tu Tran, Mathieu Giraud

, Jean-Stéphane Varré
:
Perfect Hashing Structures for Parallel Similarity Searches. 332-341 - Basavaraj Talawar

:
A Crossbar Interconnection Network in DNA. 342-345 - Denis Trystram:

Handling Heterogeneity for Efficient Implementations: A Case Study on Sequence Comparison. 346-349 - G. M. Siddesh, K. G. Srinivasa

, Ishank Mishra, Abhinav Anurag, Eklavya Uppal:
Phylogenetic Analysis Using MapReduce Programming Model. 350-356
HiCOMB Session 2
- Wajeeta Lohana, Jawwad A. Shamsi, Tahir Q. Syed, Farrukh Hasan:

Towards Context-Aware DNA Sequence Compression for Efficient Data Exchange. 357-366
HiCOMB Session 3
- Solon P. Pissis

, Ahmad Retha
:
Generalised Implementation for Fixed-Length Approximate String Matching under Hamming Distance and Applications. 367-374 - Hanyu Jiang, Narayan Ganesan:

Fine-Grained Acceleration of HMMER 3.0 via Architecture-Aware Optimization on Massively Parallel Processors. 375-383
Workshop 6: APDCM - Advances in Parallel and Distributed Computing Models
- Oscar H. Ibarra, Koji Nakano, Akihiro Fujiwara, Susumu Matsumae

:
APDCM Introduction and Committees. 384
Session 1: Parallel Algorithms and Applications
- Toru Fujita, Koji Nakano

, Yasuaki Ito:
Bulk GCD Computation Using a GPU to Break Weak RSA Keys. 385-394 - Meher Chaitanya, Kishore Kothapalli:

A Simple Parallel Algorithm for Biconnected Components in Sparse Graphs. 395-404 - Marc Aurel Kiefer, Korbinian Molitorisz, Jochen Bieler, Walter F. Tichy:

Parallelizing a Real-Time Audio Application - A Case Study in Multithreaded Software Engineering. 405-414 - Ajay Kattepur, Manoj Nambiar:

Performance Modeling of Multi-tiered Web Applications with Varying Service Demands. 415-424
Session 2: Parallel Computing Systems
- Abhishek Bansal, Sambhav Gupta, Turbo Majumder:

Efficient Estimation of Non-stationary Traffic Parameters on Networks-on-Chip. 425-433 - Daniel Dauwe, Eric Jonardi, Ryan D. Friese

, Sudeep Pasricha, Anthony A. Maciejewski
, David A. Bader
, Howard Jay Siegel:
A Methodology for Co-Location Aware Application Performance Modeling in Multicore Computing. 434-443 - Shounak Chakraborty

, Shirshendu Das, Hemangee K. Kapoor:
Performance Constrained Static Energy Reduction Using Way-Sharing Target-Banks. 444-453 - Ke Gao, Dongrui Fan

, Jie Wu, Zhiyong Liu:
Decoupling Contention with Victim Row-Buffer on Multicore Memory Systems. 454-463
Session 3: Distributed Algorithms and Computing
- Manmohan Chaubey, Erik Saule:

Replicated Data Placement for Uncertain Scheduling. 464-472 - Guillaume Aupy, Anne Benoit

, Henri Casanova, Yves Robert:
Scheduling Computational Workflows on Failure-Prone Platforms. 473-482 - Nicolas Braud-Santoni

, Swan Dubois
, Mohamed-Hamza Kaaouachi, Franck Petit
:
A Generic Framework for Impossibility Results in Time-Varying Graphs. 483-489 - Ajoy Kumar Datta, Anissa Lamani, Lawrence L. Larmore, Franck Petit

:
Enabling Ring Exploration with Myopic Oblivious Robots. 490-499
Session 4: Wireless Networks and Distributed Systems
- Jian Tang, Mikel Larrea

, Sergio Arévalo
, Ernesto Jiménez:
Implementing Uniform Reliable Broadcast in Anonymous Distributed Systems with Fair Lossy Channels. 500-508 - Min Shen, Ajay D. Kshemkalyani

, Ta Yuan Hsu:
Causal Consistency for Geo-Replicated Cloud Storage under Partial Replication. 509-518 - Lucas Rodrigues Costa

, Lucas Saad N. Nunes, Jacir Luiz Bordim, Koji Nakano
:
Asterisk PBX Capacity Evaluation. 519-524 - Marcos Fagundes Caetano

, Jacir Luiz Bordim:
A Fair Randomized Contention Resolution Protocol for Wireless Nodes without Collision Detection Capabilities. 525-533
Workshop 7: HPBC - High Performance Big Data and Cloud Computing Workshop and HPDIC - High Performance Data Intensive Computing
- Eric E. Aubanel, Virendrakumar C. Bhavsar, Michael A. Frumkin:

HPBC Introduction and Committees. 534 - Tim Mattson:

HPBC Keynote. 535 - Christophe Cérin, R. K. Shyamasundar, Yuqing Gao, Congfeng Jiang:

HPDIC Introduction and Committees. 536
Session 1: Big Data and Cloud Computing: Storage, Analytics and Data Transfer
- Lars Lundberg, Håkan Grahn

, Dragos Ilie, Christian Melander:
Cache Support in a High Performance Fault-Tolerant Distributed Storage System for Cloud and Big Data. 537-546 - Madhushi Niluka Bandara

, Rajitha Madhushan Ranasinghe, Rashmi Woranga Mudugamuwa Arachchi, Channa Gayan Somathilaka, Srinath Perera, Daya Chinthana Wimalasuriya:
A Complex Event Processing Toolkit for Detecting Technical Chart Patterns. 547-556 - Eun-Sung Jung

, Rajkumar Kettimuthu:
High-Performance Serverless Data Transfer over Wide-Area Networks. 557-564
Session 2: High Performance Data Intensive Computing
- E. Wes Bethel, David Camp, David Donofrio, Mark Howison:

Improving Performance of Structured-Memory, Data-Intensive Applications on Multi-core Platforms via a Space-Filling Curve Memory Layout. 565-574 - Bhavik Shah, Trupti Padiya, Minal Bhise

:
Query Execution for RDF Data Using Structure Indexed Vertical Partitioning. 575-584 - Medha Abhijeet Shah, Dinesh B. Kulkarni:

Storm Pub-Sub: High Performance, Scalable Content Based Event Matching System Using Storm. 585-590
Workshop 8: ASHES - Accelerators and Hybrid Exascale Systems
- James Dinan, Wenguang Chen, Xiaosong Ma, Pavan Balaji, Satoshi Matsuoka, Jiayuan Meng, Yunquan Zhang:

AsHES Introduction and Committees. 591-592 - Michela Taufer

:
AsHES Keynote. 593
Session 1: Accelerating Analytics
- Sina Meraji, John Keenleyside, Sunil Kamath, Bob Blainey:

Towards a Combined Grouping and Aggregation Algorithm for Fast Query Processing in Columnar Databases with GPUs. 594-603 - Dipanjan Sengupta, Kapil Agarwal, Shuaiwen Leon Song, Karsten Schwan:

GraphReduce: Large-Scale Graph Analytics on Accelerator-Based HPC Systems. 604-609 - Shuai Che, Gregory Rodgers, Bradford M. Beckmann, Steven K. Reinhardt:

Graph Coloring on the GPU and Some Techniques to Improve Load Imbalance. 610-617
Session 2: Algorithm Design for Heterogeneous Systems
- Sushil K. Prasad

, Michael McDermott, Xi He, Satish Puri
:
GPU-based Parallel R-tree Construction and Querying. 618-627 - Aditya Deshpande, P. J. Narayanan

:
Fast Burrows Wheeler Compression Using All-Cores. 628-636 - Kiran Raj Ramamoorthy, Dip Sankar Banerjee

, Kannan Srinathan, Kishore Kothapalli:
A Novel Heterogeneous Algorithm for Multiplying Scale-Free Sparse Matrices. 637-646 - Kazuya Matsumoto, Toshihiro Hanawa

, Yuetsu Kodama, Hisafumi Fujii, Taisuke Boku:
Implementation of CG Method on GPU Cluster with Proprietary Interconnect TCA for GPU Direct Communication. 647-655
Workshop 9: PLC - Programming Models, Languages, and Compilers for Manycore and Heterogeneous Architectures
- Sunita Chandrasekaran:

PLC Introduction and Committees. 656-657 - Michael Gschwind:

PLC Keynote. 658
Session I: Programming and Compilation Techniques for Heterogeneous and Multicore Systems
- Meghana Gupta, Dibyendu Das, Prakash Raghavendra, Tony Tye, Leonid Lobachev, Amit Agarwal, Ravish Hegde:

Implementing Cross-Device Atomics in Heterogeneous Processors. 659-668 - Rajesh Kumar, Kishore Kothapalli:

A Novel Heterogeneous Framework for Local Dependency Dynamic Programming Problems. 669-678 - Peng Sun, Sunita Chandrasekaran, Barbara M. Chapman:

OpenMP-MCA: Leveraging Multiprocessor Embedded Systems Using Industry Standards. 679-688
Session II: Parallel Programming Experiences and Lessons Learned
- Guido Juckeland

, Alexander Grund
, Wolfgang E. Nagel:
Performance Portable Applications for Hardware Accelerators: Lessons Learned from SPEC ACCEL. 689-698 - Suttinee Sawadsitang, James Lin, Simon See, François Bodin, Satoshi Matsuoka:

Understanding Performance Portability of OpenACC for Supercomputers. 699-707
Session III: Novel Approaches for Emerging Platforms
- Deepak Majeti, Vivek Sarkar:

Heterogeneous Habanero-C (H2C): A Portable Programming Model for Heterogeneous Processors. 708-717 - Gil Rapaport, Ayal Zaks, Yosi Ben-Asher:

Streamlining Whole Function Vectorization in C Using Higher Order Vector Semantics. 718-727
Workshop 10: EduPar - NSF/TCPP Workshop on Parallel and Distributed Computing Education
- Andrew Lumsdaine

, Sushil K. Prasad
, Martina Barnas:
EduPar Introduction and Committees. 728-729 - Geoffrey Charles Fox:

EduPar Keynote. 730
Session 1: Methods and Tools
- Jörg Hilpert, Rüdiger Berlich, Peter Lürßen, Almut Zwölfer, Jochen Barwind:

Teaching Simulations and High Performance Computing at Secondary Schools in the German State of Baden-Württemberg. 731-738 - Nasser Giacaman, Simar Kalra, Oliver Sinnen

:
The Active classroom: Students and Instructors Parallel Programming in Parallel. 739-745 - Ian Finlayson, Jerome Mueller, Shehan Rajapakse, Daniel Easterling:

Introducing Tetra: An Educational Parallel Programming System. 746-751 - Joel C. Adams

:
Patternlets: A Teaching Tool for Introducing Students to Parallel Design Patterns. 752-759
Session 2: Course Design
- Julio Sahuquillo, Salvador Petit

, Vicent Selfa, María Engracia Gómez
:
A Research-Oriented Course on Advanced Multicore Architecture. 760-765 - Karen L. Karavanic, Daniel Leblanc:

Updating an Introductory Performance Course with PDC Topics. 766-771 - Jawwad A. Shamsi, Nouman M. Durrani, Nadeem Kafi Khan:

Novelties in Teaching High Performance Computing. 772-778
Session 3: Curriculum Integration
- Ali Abu El Humos, Sungbum Hong, Jacqueline M. Jackson, Xuejun Liang, Tzusheng Pei, Bernard Aldrich:

Incorporating PDC Modules Into Computer Science Courses at Jackson State University. 779-781 - Guoming Lu, Jie Xu, Jieyan Liu, Bo Dai, Shenglin Gui

, Siyu Zhan:
Integrating Parallel and Distributed Computing Topics into an Undergraduate CS Curriculum at UESTC. 782-787 - Ali Ebnenasir, Jean Mayo:

Fault-Tolerant Parallel and Distributed Computing for Software Engineering Undergraduates. 788-794
Workshop 11: GABB - Graph Algorithms Building Blocks
- Tim Mattson:

GABB Introduction and Committees. 795
GABB Session 1
- Marcin Zalewski, Nicholas Gerard Edmonds, Andrew Lumsdaine

:
Declarative Patterns for Imperative Distributed Graph Algorithms. 796-803 - Ariful Azad, Aydin Buluç

, John R. Gilbert:
Parallel Triangle Counting and Enumeration Using Matrix Algebra. 804-811
GABB Session 2
- Anil N. Hirani, Kaushik Kalyanaraman, Seth Watts:

Graph Laplacians and Least Squares on Graphs. 812-821 - Vijay Gadepally, Jake Bolewski, Dan Hook, Dylan Hutchison, Benjamin A. Miller, Jeremy Kepner:

Graphulo: Linear Algebra Graph Kernels for NoSQL Databases. 822-830 - Jeremiah Willcock, Andrew Lumsdaine

:
A Unifying Programming Model for Parallel Graph Algorithms. 831-840 - Carl Yang, Yangzihao Wang, John D. Owens:

Fast Sparse Matrix and Sparse Vector Multiplication Algorithm on the GPU. 841-847
Workshop 12: HPPAC - High-Performance, Power-Aware Computing
- Wu-chun Feng, Barry Rountree:

HPPAC Introduction and Committees. 848
Session 1: Provisioning and Management
- Akhil Langer, Harshit Dokania, Laxmikant V. Kalé, Udatta S. Palekar:

Analyzing Energy-Time Tradeoff in Power Overprovisioned HPC Data Centers. 849-854 - Daniel Balouek-Thomert, Eddy Caron, Laurent Lefèvre:

Energy-Aware Server Provisioning by Introducing Middleware-Level Dynamic Green Scheduling. 855-862 - Yiannis Georgiou, David Glesser, Denis Trystram:

Adaptive Resource and Job Management for Limited Power Consumption. 863-870
Session 2: Measurement, Modeling, and Optimization
- Rubasri Kalidas, Mayank Daga, Konstantinos Krommydas, Wu-chun Feng:

On the Performance, Energy, and Power of Data-Access Methods in Heterogeneous Computing Systems. 871-879 - Vignesh Adhinarayanan

, Wu-chun Feng, Jonathan Woodring, David H. Rogers, James P. Ahrens
:
On the Greenness of In-Situ and Post-Processing Visualization Pipelines. 880-887 - Nirmal Prajapati

, Waruna Ranasinghe, Vamshi Tandrapati, Rumen Andonov, Hristo N. Djidjev
, Sanjay V. Rajopadhye:
Energy Modeling and Optimization for Tiled Nested-Loop Codes. 888-895
Session 3: Efficiency
- Daniel Hackenberg

, Robert Schöne
, Thomas Ilsche, Daniel Molka, Joseph Schuchart, Robin Geyer
:
An Energy Efficiency Feature Survey of the Intel Haswell Processor. 896-904 - Rogelio Long, Shirley Moore, Barry Rountree:

Iso-Power-Efficiency: An Approach to Scaling Application Codes with a Power Budget. 905-910 - Sridutt Bhalachandra, Allan Porterfield, Jan F. Prins:

Using Dynamic Duty Cycle Modulation to Improve Energy Efficiency in High Performance Computing. 911-918
Workshop 13: PDSEC-Workshop on Parallel and Distributed Scientific and Engineering Computing
- Peter E. Strazdins, Raphaël Couturier

, Keita Teranishi, John O'Donnell, Thomas Rauber, Gudula Rünger, Laurence T. Yang:
PDSEC Introduction and Committees. 919-920 - Naoya Maruyama:

PDSEC Keynote. 921
Session 1: Best Paper
- Jean-Claude Charr, Raphaël Couturier

, Ahmed Fanfakh
, Arnaud Giersch:
Energy Consumption Reduction with DVFS for Message Passing Iterative Applications on Heterogeneous Architectures. 922-931
Session 2: Performance
- Steven A. Wright

, Stephen A. Jarvis
:
Quantifying the Effects of Contention on Parallel File Systems. 932-940 - Peter E. Strazdins, Md. Mohsin Ali

, Brendan Harding
:
Highly Scalable Algorithms for the Sparse Grid Combination Technique. 941-950 - Ananta Tiwari, Martin Schulz

, Laura Carrington:
Predicting Optimal Power Allocation for CPU and DRAM Domains. 951-959
Session 3: Linear Algebra
- Takeshi Fukaya, Toshiyuki Imamura:

Performance Evaluation of the Eigen Exa Eigensolver on Oakleaf-FX: Tridiagonalization Versus Pentadiagonalization. 960-969 - Sara S. Hamouda

, Josh Milthorpe
, Peter E. Strazdins, Vijay A. Saraswat:
A Resilient Framework for Iterative Linear Algebra Applications in X10. 970-979 - Massimiliano Fasi

, Yves Robert
, Bora Uçar
:
Combining Backward and Forward Recovery to Cope with Silent Errors in Iterative Solvers. 980-989 - Raphaël Couturier

, Lilia Ziane Khodja, Christophe Guyeux
:
TSIRM: A Two-Stage Iteration with Least-Squares Residual Minimization Algorithm to Solve Large Sparse Linear Systems. 990-997
Session 4: GPUs and Manycore
- Jiayuan Meng, Thomas D. Uram, Vitali A. Morozov, Venkatram Vishwanath, Kalyan Kumaran:

Modeling Cooperative Threads to Project GPU Performance for Adaptive Parallelism. 998-1007 - Takuro Udagawa, Masakazu Sekijima:

GPU Accelerated Molecular Dynamics with Method of Heterogeneous Load Balancing. 1008-1013 - Paolo Spallaccini, Farbod Kayhan, Stefano Chinnici, Guido Montorsi:

Parallel Methods for Optimizing High Order Constellations on GPUs. 1014-1023
Workshop 14: DPDNS - Dependable Parallel, Distributed, and Network-Centric Systems
- Dimiter R. Avresky, Erik Maehle, Nectarios Koziris, Anastassios Nanos

:
DPDNS Introduction and Committees. 1024
Session 1: Reliability and Threat-Detection
- Sanem Arslan

, Haluk Rahmi Topcuoglu
, Mahmut Taylan Kandemir, Oguz Tosun:
Performance and Energy Efficient Asymmetrically Reliable Caches for Multicore Architectures. 1025-1032 - Marc Eduard Frîncu

:
Distributed Scheduling Algorithm for Highly Available Component Based Applications. 1033-1041 - Paul Wood, Saurabh Bagchi, Alefiya Hussain:

Optimizing Defensive Investments in Energy-Based Cyber-Physical Systems. 1042-1051
Session 2: Fault Tolerance
- Nentawe Gurumdimma, Arshad Jhumka, Maria Liakata

, Edward Chuah
, James C. Browne:
Towards Detecting Patterns in Failure Logs of Large-Scale Distributed Systems. 1052-1061 - Salem Saker, Adnan Agbaria:

Communication Pattern-Based Distributed Snapshots in Large-Scale Systems. 1062-1071 - Alessandro Pellegrini

, Pierangelo di Sanzo
, Dimiter R. Avresky:
A Machine Learning-Based Framework for Building Application Failure Prediction Models. 1072-1081
Session 3: Algorithms, Protocols, and Topologies
- Théodore Jean Richard Relaza, Jacques Jorda, Abdelaziz Mzoughi:

Trapezoid Quorum Protocol Dedicated to Erasure Resilient Coding Based Schemes. 1082-1088 - Brendan Benshoof, Andrew Rosen, Anu G. Bourgeois, Robert W. Harrison

:
A Distributed Greedy Heuristic for Computing Voronoi Tessellations with Applications Towards Peer-to-Peer Networks. 1089-1096 - Kaliappa Ravindran:

Dependability Modeling and Assessment of Complex Adaptive Networked Systems. 1097-1105
Workshop 15: PCO - Parallel Computing and Optimization
- Didier El Baz

, Bora Uçar
:
PCO Introduction and Committees. 1106-1107 - Alex Pothen:

PCO Keynote. 1108
Session 1: Optimization Techniques for Parallel or Distributed Architectures
- Christian Toinard, Timothee Ravier, Christophe Cérin, Yanik Ngoko:

The Promethee Method for Cloud Brokering with Trust and Assurance Criteria. 1109-1118 - Maximilian Odendahl, Andres Goens

, Rainer Leupers, Gerd Ascheid, Tomas Henriksson:
Buffer Allocation Based On-Chip Memory Optimization for Many-Core Platforms. 1119-1124
Session 2: Combinatorial Scientific Computing and Parallel Optimization Algorithms
- Enver Kayaaslan, Bora Uçar

, Cevdet Aykanat:
Semi-two-dimensional Partitioning for Parallel Sparse Matrix-Vector Multiplication. 1125-1134 - Didier El Baz

, Moussa Elkihel:
Parallel Asynchronous Modified Newton Methods for Network Flows. 1135-1142 - Prashant Palkar

, Ashutosh Mahajan:
A Branch-and-Estimate Heuristic Procedure for Solving Nonconvex Integer Optimization Problems. 1143-1151
Workshop 16: ParLearning - Parallel and Distributed Computing for Large Scale Machine Learning and Big Data Analytics
- Sutanay Choudhury, Arindam Pal, Anand V. Panangadan, Yinglong Xia:

ParLearning Introduction and Committees. 1152-1153 - David A. Bader

, Yihua Huang, Ananth Kalyanaraman:
ParLearning Keynotes. 1154-1156 - Tao Luo, Yin Liao, Yurong Chen

, Jianguo Li, Victor Lee:
LFRTrainer: Large-Scale Face Recognition Training System. 1157-1165 - Charith Wickramaarachchi, Charalampos Chelmis

, Viktor K. Prasanna:
Empowering Fast Incremental Computation over Large Scale Dynamic Graphs. 1166-1171 - M. Sai Rajeswar, Adepu Ravi Sankar, Vineeth N. Balasubramanian

, C. D. Sudheer:
Scaling Up the Training of Deep CNNs for Human Action Recognition. 1172-1177 - Yusuke Nishioka, Kenjiro Taura

:
Scalable Task-Parallel SGD on Matrix Factorization in Multicore Architectures. 1178-1184 - Ravikant Dindokar, Neel Choudhury, Yogesh L. Simmhan

:
Analysis of Subgraph-Centric Distributed Shortest Path Algorithm. 1185-1190 - Bing Lin, Wenzhong Guo, Guolong Chen, Naixue Xiong, Rongrong Li:

Cost-Driven Scheduling for Deadline-Constrained Workflow on Multi-clouds. 1191-1198
Workshop 17: JSSPP - Workshop on Job Scheduling Strategies for Parallel Processing
- Walfredo Cirne, Narayan Desai:

JSSPP Introduction and Committees. 1199
Workshop 18: iWAPT - International Workshop on Automatic Performance Tuning
- Yusaku Yamamoto, Weichung Wang:

iWAPT Introduction and Committees. 1200-1201 - Ponnuswamy Sadayappan, Ray-Bing Chen:

iWAPT Invited Talks. 1202-1203
iWAPT Session 1
- Youcef Barigou, Vishwanath Venkatesan, Edgar Gabriel:

Auto-tuning Non-blocking Collective Communication Operations. 1204-1213 - Tomohiro Suzuki:

Improved Internode Communication for Tile QR Decomposition for Multicore Cluster Systems. 1214-1220
iWAPT Session 2
- Takahiro Katagiri, Satoshi Ohshima

, Masaharu Matsumoto:
Directive-Based Auto-Tuning for the Finite Difference Method on the Xeon Phi. 1221-1230 - Thomas L. Falch, Anne C. Elster

:
Machine Learning Based Auto-Tuning for Enhanced OpenCL Performance Portability. 1231-1240 - Martin Kong

, Louis-Noël Pouchet, Ponnuswamy Sadayappan:
A Roofline-Based Performance Estimator for Distributed Matrix-Multiply on Intel CnC. 1241-1250
iWAPT Session 3
- Shajulin Benedict

, R. S. Rejitha, Philipp Gschwandtner
, Radu Prodan
, Thomas Fahringer
:
Energy Prediction of OpenMP Applications Using Random Forest Modeling Approach. 1251-1260 - Sanath Jayasena

, Milinda Fernando, Tharindu Rusira, Chalitha Perera, Chamara Philips:
Auto-Tuning the Java Virtual Machine. 1261-1270
Workshop 19: Julia-Invited Workshop: A New Approach to High Performance Technical Computing
- Alan Edelman:

Julia Introduction. 1271

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














