


default search action
PACT 2012: Minneapolis, MN, USA
- Pen-Chung Yew, Sangyeun Cho, Luiz DeRose, David J. Lilja:

International Conference on Parallel Architectures and Compilation Techniques, PACT '12, Minneapolis, MN, USA - September 19 - 23, 2012. ACM 2012, ISBN 978-1-4503-1182-3
Keynote address
- Peter J. Ungaro:

The changing role of supercomputing. 1-2
Power
- Wim Heirman

, Souradip Sarkar, Trevor E. Carlson
, Ibrahim Hur, Lieven Eeckhout:
Power-aware multi-core simulation for early design stage hardware/software co-optimization. 3-12 - Kai Ma, Xiaorui Wang:

PGCapping: exploiting power gating for power capping and core lifetime balancing in CMPs. 13-22 - Cong Liu, Jian Li, Wei Huang, Juan Rubio, Evan Speight, Xiaozhu Lin:

Power-efficient time-sensitive mapping in heterogeneous systems. 23-32
Runtime
- Sreepathi Pai

, R. Govindarajan, Matthew J. Thazhuthaveetil:
Fast and efficient automatic memory management for GPUs using compiler-assisted runtime coherence scheme. 33-42 - Justin Talbot, Zachary DeVito, Pat Hanrahan:

Riposte: a trace-driven compiler and parallel VM for vector code in R. 43-52 - Scott Schneider, Martin Hirzel, Bugra Gedik, Kun-Lung Wu:

Auto-parallelizing stateful distributed streaming applications. 53-64
NoC
- Akbar Sharifi, Asit K. Mishra, Shekhar Srikantaiah, Mahmut T. Kandemir, Chita R. Das:

PEPON: performance-aware hierarchical power budgeting for NoC based multicores. 65-74 - Ronald G. Dreslinski, Thomas Manville, Korey Sewell, Reetuparna Das

, Nathaniel Ross Pinckney, Sudhir Satpathy, David T. Blaauw, Dennis Sylvester, Trevor N. Mudge:
XPoint cache: scaling existing bus-based coherence protocols for 2D and 3D many-core systems. 75-86 - Lei Wang, Poornachandran Kumar, Ki Hwan Yum, Eun Jung Kim:

APCR: an adaptive physical channel regulator for on-chip interconnects. 87-96
Scheduling and runtime
- Ioana Burcea, Livio Soares, Andreas Moshovos:

Pointy: a hybrid pointer prefetcher for managed runtime systems. 97-106 - Hiroshi Sasaki, Teruo Tanimoto

, Koji Inoue, Hiroshi Nakamura
:
Scalability-based manycore partitioning. 107-116 - Kshitij Sudan, Sadagopan Srinivasan, Rajeev Balasubramonian, Ravi R. Iyer:

Optimizing datacenter power with memory system levers for guaranteed quality-of-service. 117-126
Best papers
- Amy Wang, Matthew Gaudet, Peng Wu, José Nelson Amaral, Martin Ohmacht

, Christopher Barton, Raúl Silvera, Maged M. Michael:
Evaluation of blue Gene/Q hardware support for transactional memories. 127-136 - Víctor Jiménez, Roberto Gioiosa, Francisco J. Cazorla

, Alper Buyuktosunoglu, Pradip Bose, Francis P. O'Connell:
Making data prefetch smarter: adaptive prefetching on POWER7. 137-146 - Ashay Rane, James C. Browne:

Enhancing performance optimization of multicore chips and multichip nodes with data structure metrics. 147-156
Keynote address
- Kathy Yelick:

Compiling to avoid communication. 157-158
Transactional memory
- Justin Emile Gottschlich, Maurice Herlihy, Gilles Pokam, Jeremy G. Siek

:
Visualizing transactional memory. 159-170 - Luke Dalessandro, Michael L. Scott

:
Sandboxing transactional memory. 171-180 - Anurag Negi, Adrià Armejach

, Adrián Cristal
, Osman S. Unsal
, Per Stenström:
Transactional prefetching: narrowing the window of contention in hardware transactional memory. 181-190
Debugging and error detection
- Jingweijia Tan, Xin Fu:

RISE: improving the streaming processors reliability against soft errors in gpgpus. 191-200 - Michelle L. Goodstein, Shimin Chen, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry:

Chrysalis analysis: incorporating synchronization arcs in dataflow-analysis-based parallel monitoring. 201-212 - Ignacio Laguna, Dong H. Ahn, Bronis R. de Supinski, Saurabh Bagchi, Todd Gamblin:

Probabilistic diagnosis of performance faults in large-scale parallel applications. 213-222
Keynote address
- William T. C. Kramer:

Top500 versus sustained performance: the top problems with the top500 list - and what to do about them. 223-230
Multicore and mMultithread
- Yong Li, Rami G. Melhem, Alex K. Jones

:
Practically private: enabling high performance CMPs through compiler-assisted data classification. 231-240 - Alberto Ros

, Stefanos Kaxiras:
Complexity-effective multicore coherence. 241-252 - Yongbing Huang, Zehan Cui, Licheng Chen, Wenli Zhang, Yungang Bao, Mingyu Chen:

HaLock: hardware-assisted lock contention detection in multithreaded applications. 253-262
Optimization
- Torsten Hoefler, Timo Schneider:

Runtime detection and optimization of collective communication patterns. 263-272 - Md. Kamruzzaman, Steven Swanson

, Dean M. Tullsen
:
Coalition threading: combining traditional andnon-traditional parallelism to maximize scalability. 273-282 - Yi Yang, Ping Xiang, Mike Mantor, Norm Rubin, Huiyang Zhou:

Shared memory multiplexing: a novel way to improve GPGPU throughput. 283-292
Memory system (I)
- Mainak Chaudhuri, Jayesh Gaur, Nithiyanandan Bashyam, Sreenivas Subramoney

, Joseph Nuzman:
Introducing hierarchy-awareness in replacement and bypass algorithms for last-level caches. 293-304 - Andreas Sandberg

, David Black-Schaffer, Erik Hagersten:
Efficient techniques for predicting cache sharing and throughput. 305-314 - Lingda Li

, Dong Tong, Zichao Xie, Junlin Lu, Xu Cheng:
Optimal bypass monitor for high performance last-level caches. 315-324
Hybrid systems
- Vijay Sathish, Michael J. Schulte, Nam Sung Kim:

Lossless and lossy memory I/O link compression for improving performance of GPGPU workloads. 325-334 - Rafael Ubal, Byunghyun Jang, Perhaad Mistry, Dana Schaa, David R. Kaeli:

Multi2Sim: a simulation framework for CPU-GPU computing. 335-344 - Abdullah Gharaibeh, Lauro Beltrão Costa

, Elizeu Santos-Neto, Matei Ripeanu:
A yoke of oxen and a thousand chickens for heavy lifting graph processing. 345-354
Memory system (II)
- Vivek Seshadri, Onur Mutlu

, Michael A. Kozuch, Todd C. Mowry:
The evicted-address filter: a unified mechanism to address both cache pollution and thrashing. 355-366 - Lei Liu, Zehan Cui, Mingjie Xing, Yungang Bao, Mingyu Chen, Chengyong Wu:

A software memory partition approach for eliminating bank-level interference in multicore systems. 367-376 - Gennady Pekhimenko, Vivek Seshadri, Onur Mutlu

, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry:
Base-delta-immediate compression: practical data compression for on-chip caches. 377-388
Accelerators
- Anil Krishna, Timothy Heil, Nicholas Lindberg, Farnaz Toussi, Steven VanderWiel:

Hardware acceleration in the IBM PowerEN processor: architecture and performance. 389-400 - Hao Wang

, Vijay Sathish, Ripudaman Singh, Michael J. Schulte, Nam Sung Kim:
Workload and power budget partitioning for single-chip heterogeneous processors. 401-410 - Bharat Sukhwani, Hong Min, Mathew Thoennes, Parijat Dube, Balakrishna Iyer, Bernard Brezzo, Donna Dillenberger, Sameh W. Asaad:

Database analytics acceleration using FPGAs. 411-420
PACT poster papers
- Cheng Li, Mark Browning, Paul V. Gratz

, Samuel Palermo:
LumiNOC: a power-efficient, high-performance, photonic network-on-chip for future parallel architectures. 421-422 - Jong-Hyuk Lee, Ziyi Liu, Xiaonan Tian, Dong Hyuk Woo, Weidong Shi, Dainis Boumber

, Yonghong Yan, Kyeong-An Kwon:
Acceleration of bulk memory operations in a heterogeneous multicore architecture. 423-424 - Nilanjan Goswami, Zhongqi Li, Ajit Verma, Ramkumar Shankar, Tao Li:

Integrating nanophotonics in GPU microarchitecture. 425-426 - John Sartori, Rakesh Kumar:

Branch and data herding: reducing control and memory divergence for error-tolerant GPU applications. 427-428 - Huimin Cui, Qing Yi, Jingling Xue

, Xiaobing Feng:
Layout-oblivious optimization for matrix computations. 429-430 - Pierre Estérie, Mathias Gaunard, Joel Falcou, Jean-Thierry Lapresté, Brigitte Rozoy:

Boost.SIMD: generic programming for portable SIMDization. 431-432 - Zhijia Zhao, Bo Wu, Xipeng Shen:

Speculative parallelization needs rigor: probabilistic analysis for optimal speculation of finite-state machine applications. 433-434 - Vladimir Gajinov, Srdjan Stipic, Osman S. Unsal

, Tim Harris, Eduard Ayguadé
, Adrián Cristal
:
Supporting stateful tasks in a dataflow graph. 435-436 - Alexander Collins, Christian Fensch, Hugh Leather

:
MaSiF: machine learning guided auto-tuning of parallel skeletons. 437-438 - Lihang Zhao, Woojin Choi, Jeffrey T. Draper:

TMNOC: a case of HTM and NoC co-design for increased energy efficiency and concurrency. 439-440 - Nachiappan Chidambaram Nachiappan

, Asit K. Mishra, Mahmut T. Kandemir, Anand Sivasubramaniam, Onur Mutlu
, Chita R. Das:
Application-aware prefetch prioritization in on-chip networks. 441-442 - Jason Zebchuk, Harold W. Cain, Vijayalakshmi Srinivasan, Andreas Moshovos:

ReCaP: a region-based cure for the common cold cache. 443-444 - Syed Zohaib Gilani, Nam Sung Kim, Michael J. Schulte:

Power-efficient computing for compute-intensive GPGPU applications. 445-446 - Wei Ding, Mahmut T. Kandemir, Yuanrui Zhang, Emre Kultursay:

Off-chip access localization for NoC-based multicores. 447-448 - Ping Xiang, Yi Yang, Mike Mantor, Norm Rubin, Huiyang Zhou:

Many-thread aware instruction-level parallelism: architecting shader cores for GPU computing. 449-450 - Joan J. Valls

, Alberto Ros
, Julio Sahuquillo
, María Engracia Gómez
, José Duato
:
PS-Dir: a scalable two-level directory cache. 451-452 - Foivos S. Zakkak, Dimitrios Chasapis

, Polyvios Pratikakis, Angelos Bilas
, Dimitrios S. Nikolopoulos
:
Inference and declaration of independence: impact on deterministic task parallelism. 453-454 - Reetuparna Das

, Rachata Ausavarungnirun, Onur Mutlu
, Akhilesh Kumar, Mani Azimi:
Application-to-core mapping policies to reduce memory interference in multi-core systems. 455-456 - David Eklov, Nikos Nikoleris

, David Black-Schaffer, Erik Hagersten:
Bandwidth bandit: quantitative characterization of memory contention. 457-458 - Rakesh Kumar, Alejandro Martínez, Antonio González

:
Speculative dynamic vectorization for HW/SW co-designed processors. 459-460 - Bin Ren, Gagan Agrawal, James R. Larus, Todd Mytkowicz, Tomi Poutanen, Wolfram Schulte:

Fine-grained parallel traversals of irregular data structures. 461-462 - Aydin Buluç

, Armando Fox, John R. Gilbert, Shoaib Kamil, Adam Lugowski
, Leonid Oliker, Samuel Williams
:
High-performance analysis of filtered semantic graphs. 463-464 - Karthik T. Sundararajan, Timothy M. Jones

, Nigel P. Topham:
Energy-efficient cache partitioning for future CMPs. 465-466 - Christopher W. Fletcher, Rachael Harding, Omer Khan, Srinivas Devadas:

A low-overhead dynamic optimization framework for multicores. 467-468 - Congming Chen, Wei Huo, Xiaobing Feng:

Making it practical and effective: fast and precise may-happen-in-parallel analysis. 469-470
ACM SRC abstracts
- Woojin Choi, Lihang Zhao, Jeff Draper:

Mileage-based contention management in transactional memory. 471-472 - Shuaiwen Song, Kirk W. Cameron

:
System-level power-performance efficiency modeling for emergent GPU architectures. 473-474 - Matthew Gaudet, José Nelson Amaral:

Transactional event profiling in a best-effort hardware transactional memory system. 475-476 - Hari K. Pyla, Srinidhi Varadarajan:

Transparent runtime deadlock elimination. 477-478 - Peng Li, Kevin Gomez, David J. Lilja:

Design of a storage processing unit. 479-480 - Kanakagiri Raghavendra, Tripti S. Warrier

, Madhu Mutyam
:
SkipCache: miss-rate aware cache management. 481-482 - Arnamoy Bhattacharyya:

Using combined profiling to decide when thread level speculation is profitable. 483-484 - Biswabandan Panda, Shankar Balachandran:

Hardware prefetchers for emerging parallel applications. 485-486 - Fábio Coutinho, Luís Alfredo V. de Carvalho:

Strategies based on green policies to the grid resource allocation. 487-488 - Gennady Pekhimenko, Todd C. Mowry, Onur Mutlu

:
Linearly compressed pages: a main memory compression framework with low complexity and low latency. 489-490 - Cong Liu:

Energy-efficient workload mapping in heterogeneous systems with multiple types of resources. 491-492 - Lina Sawalha, Ronald D. Barnes:

Phase-based scheduling and thread migration for heterogeneous multicore processors. 493-494

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














