A scalable architecture for ordered parallelism

Mark C. Jeffrey; Suvinay Subramanian; Cong Yan; Joel Emer; Daniel Sanchez

A scalable architecture for ordered parallelism

Joel Emer

2015, Proceedings of the 48th International Symposium on Microarchitecture

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

We present Swarm, a novel architecture that exploits ordered irregular parallelism, which is abundant but hard to mine with current software and hardware techniques. In this architecture, programs consist of short tasks with programmer-specified timestamps. Swarm executes tasks speculatively and out of order, and efficiently speculates thousands of tasks ahead of the earliest active task to uncover ordered parallelism. Swarm builds on prior TLS and HTM schemes, and contributes several new techniques that allow it to scale to large core counts and speculation windows, including a new execution model, speculation-aware hardware task management, selective aborts, and scalable ordered commits. We evaluate Swarm on graph analytics, simulation, and database benchmarks. At 64 cores, Swarm achieves 51-122× speedups over a single-core system, and outperforms software-only parallel algorithms by 3-18×.

Daniel Sanchez

IEEE Micro, 2016

Log In

A scalable architecture for ordered parallelism

Sign up for access to the world's latest research

Abstract

Related papers

Related topics

Related papers