0% found this document useful (0 votes)

195 views29 pages

CoSynthesis Algorithms Partitioning

This document discusses co-synthesis algorithms for hardware/software partitioning of embedded systems. It describes two examples of partitioning algorithms: Vulcan and Cosyma. Vulcan takes a primal approach, starting with all functionality in hardware and moving parts to software to reduce cost. Cosyma takes a dual approach, starting with all functionality in software and moving parts to hardware to improve performance. Both algorithms partition the system specification into basic blocks or threads and estimate costs and performance to determine the partitioning.

Uploaded by

honeygupta480

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

195 views29 pages

CoSynthesis Algorithms Partitioning

Uploaded by

honeygupta480

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

Co-Synthesis Algorithms: HW/SW Partitioning

Part of HW/SW Codesign of Embedded Systems Course (CE 40-226)

Winter-Spring 2001 Codesign of Embedded Systems 1

Topics

Introduction Preliminaries Hardware/Software Partitioning Distributed System Co-Synthesis

Winter-Spring 2001

Codesign of Embedded Systems

Topics

Introduction A Classification Examples

Vulcan Cosyma

Winter-Spring 2001

Codesign of Embedded Systems

Introduction to HW/SW Partitioning

The first variety of co-synthesis applications Definition

A HW/SW partitioning algorithm implements a specification on some sort of multiprocessor architecture Multiprocessor architecture = one CPU + some ASICs on CPU bus

Usually

Winter-Spring 2001

Codesign of Embedded Systems

Introduction to HW/SW Partitioning (contd)

A Terminology

Allocation

Synthesis methods which design the multiprocessor topology along with the PEs and SW architecture The process of assigning PE (CPU and/or ASICs) time to processes to get executed

Scheduling

Winter-Spring 2001

Codesign of Embedded Systems

Introduction to HW/SW Partitioning (contd)

In most partitioning algorithms

Type of CPU is fixed and given ASICs must be synthesized

What function to implement on each ASIC? What characteristics should the implementation have? CDFG is the starting model

Are single-rate synthesis problems

Winter-Spring 2001

Codesign of Embedded Systems

HW/SW Partitioning (contd)

Normal use of architectural components

CPU performs less computationally-intensive functions ASICs used to accelerate core functions High-performance applications

Where to use?

No CPU is fast enough for the operations ASIC accelerators allow use of much smaller, cheaper CPU
Codesign of Embedded Systems 7

Low-cost application

Winter-Spring 2001

A Classification

Criterion: Optimization Strategy

Trade-off between Performance and Cost Performance is the primary goal First, all functionality in ASICs. Progressively move more to CPU to reduce cost. Cost is the primary goal First, all functions in the CPU. Move operations to the ASIC to meet the performance goal.

Primal Approach

Dual Approach

Winter-Spring 2001

Codesign of Embedded Systems

A Classification (contd)

Classification due to optimization strategy (contd)

Example co-synthesis systems

Vulcan (Stanford): Primal strategy Cosyma (Braunschweig, Germany): Dual strategy

Winter-Spring 2001

Codesign of Embedded Systems

Co-Synthesis Algorithms: HW/SW Partitioning

HW/SW Partitioning Examples: Vulcan

Winter-Spring 2001

Codesign of Embedded Systems

Partitioning Examples: Vulcan

Gupta, De Micheli, Stanford University Primal approach

1. All-HW initial implementation. 2. Iteratively move functionality to CPU to reduce cost.

System specification language

HardwareC

Is compiled into a flow graph

Winter-Spring 2001

Codesign of Embedded Systems

Partitioning Examples: Vulcan (contd)

x=a; y=b;
nop

HardwareC

1
x=a

1
y=b

cond

if (c>d) x=e; else y=f;

c>d
x=e

c<=d
y=f

HardwareC
Winter-Spring 2001 Codesign of Embedded Systems 12

Partitioning Examples: Vulcan (contd)

Flow Graph Definition

A variation of a (single-rate) task graph Nodes

Represent operations Typically low-level operations: mult, add Represent data dependencies Each contains a Boolean condition under which the edge is traversed

Edges

Winter-Spring 2001

Codesign of Embedded Systems

Partitioning Examples: Vulcan (contd)

Flow Graph

is executed repeatedly at some rate can have initiation-time constraints for each node

t(vj)+lij t(vj) t(vj)+uij mi Ri Mi

can have rate constraints on each node

Winter-Spring 2001

Codesign of Embedded Systems

Partitioning Examples: Vulcan (contd)

Vulcan Co-synthesis Algorithm

Partitioning quantum is a thread

Algorithm divides the flow graph into threads and allocates them Thread boundary is determined by
1. (always) a non-deterministic delay element, such as wait for an external variable 2. (on choice) other points of flow graph

Target architecture

CPU + Co-processor (multiple ASICs)

Winter-Spring 2001

Codesign of Embedded Systems

Partitioning Examples: Vulcan (contd)

Vulcan Co-synthesis algorithm (contd)

Allocation

Primal approach

Scheduling

is done by a scheduler on the target CPU

is generated as part of synthesis process schedules all threads (both HW and SW threads)

cannot be static, due to some threads non-deterministic initiation-time

Winter-Spring 2001

Codesign of Embedded Systems

Partitioning Examples: Vulcan (contd)

Vulcan Co-synthesis algorithm (contd)

Cost estimation

SW implementation

Code size relatively straight forward Data size Biggest challenge. Vulcan puts some effort to find bounds for each thread

HW implementation ?
Codesign of Embedded Systems 17

Winter-Spring 2001

Partitioning Examples: Vulcan (contd)

Vulcan Co-synthesis algorithm (contd)

Performance estimation

Both SW- and HW-implementation

From flow-graph, and basic execution times for the operators

Winter-Spring 2001

Codesign of Embedded Systems

Partitioning Examples: Vulcan (contd)

Algorithm Details

Partitioning goal

Allocate each thread to one of two partitions

CPU Set: FS Co-processor set: FH

Required execution-rate must be met, and total cost minimized

Winter-Spring 2001

Codesign of Embedded Systems

Partitioning Examples: Vulcan (contd)

Algorithm Details (contd) Algorithm steps

1. Put all threads in FH set 2. Iteratively do
2.1. Move some operations to FS. 2.1.1. Select a group of operations to move to FS. 2.1.2. Check performance feasibility, by computing worst-case delay through flow-graph given the new thread times 2.1.3. Do the move, if feasible 2.2. Incrementally update the new cost-function to reflect the new partition

Winter-Spring 2001

Codesign of Embedded Systems

Partitioning Examples: Vulcan (contd)

Algorithm Details (contd)

Vulcan cost function

f(w) = c1Sh(FH) - c2Ss(FS) + c3B - c4P + c5|m|

c: weight constants S(): Size functions B: Bus utilization (<1) P: Processor utilization (<1) m: total number of variables to be transferred between the CPU and the co-processor

Winter-Spring 2001

Codesign of Embedded Systems

Partitioning Examples: Vulcan (contd)

Algorithm Details (contd) Complementary notes

A heuristic to minimize communication

Once a thread is moved to FS, its immediate successors

are placed in the list for evaluation in the next iteration.

No back-track

Once a thread is assigned to FS, it remains there

Experimental results

considerably faster implementations than all-SW, but much cheaper than all-HW designs are produced
Codesign of Embedded Systems 22

Winter-Spring 2001

Co-Synthesis Algorithms: HW/SW Partitioning

HW/SW Partitioning Examples: Cosyma

Winter-Spring 2001

Codesign of Embedded Systems

Partitioning Examples: Cosyma

Rolf Ernst, et al: Technical University of Braunschweig, Germany Dual approach

1. All-SW initial implementation. 2. Iteratively move basic blocks to the ASIC accelerator to meet performance objective.

System specification language

Is compiled into an ESG (Extended Syntax Graph) ESG is much like a CDFG
Codesign of Embedded Systems 24

Winter-Spring 2001

Partitioning Examples: Cosyma (contd)

Cosyma Co-synthesis Algorithm

Partitioning quantum is a Basic Block

A Basic Blocks is a branch-free block of program

Target Architecture

CPU + accelerator ASIC(s)

Scheduling Allocation Cost Estimation Performance Estimation Algorithm Details

Codesign of Embedded Systems 25

Winter-Spring 2001

Partitioning Examples: Cosyma (contd)

Cosyma Co-synthesis Algorithm (contd)

Performance Estimation

SW implementation

Done by examining the object code for the basic block generated by a compiler Assumes one operator per clock cycle. Creates a list schedule for the DFG of the basic block. Depth of the list gives the number of clock cycles required. Done by data-flow analysis of the adjacent basic blocks. In Shared-Memory Proportional to number of variables to be accessed
Codesign of Embedded Systems

HW implementation

Communication

Winter-Spring 2001

Partitioning Examples: Cosyma (contd)

Algorithm Steps

Change in execution-time caused by moving basic block b from CPU to ASIC:

Dc(b) = w( tHW(b)-tSW(b) + tcom(Z) - tcom(ZUb)) x It(b)

w: Constant weight t(b): Execution time of basic block b tcom(b): Estimated communication time between CPU and the accelerator ASIC, given a set Z of basic blocks
implemented on the ASIC It(b): Total number of times that b is executed
Codesign of Embedded Systems

Winter-Spring 2001

Partitioning Examples: Cosyma (contd)

Experimental Results

By moving only basic-blocks to HW

Typical speedup of only 2x Reason:

Limited intra-basic-block parallelism Implement several control-flow optimizations to increase parallelism in the basic block, and hence in ASIC Examples: loop pipelining, speculative branch execution with multiple branch prediction, operator pipelining Speedups: 2.7 to 9.7 CPU times: 35 to 304 seconds on a typical workstation
Codesign of Embedded Systems

Cure:

Result:

Winter-Spring 2001

What we learned today

HW/SW Partitioning: One broad category of co-synthesis algorithms Criteria by which a co-synthesis algorithm is categorized

Winter-Spring 2001

Codesign of Embedded Systems

Unit - 5
No ratings yet
Unit - 5
36 pages
Review: Design Objectives: Thresholds
No ratings yet
Review: Design Objectives: Thresholds
19 pages
Hardware/Software Codesign Guide
No ratings yet
Hardware/Software Codesign Guide
42 pages
Hardware-Software Co-Design Techniques
No ratings yet
Hardware-Software Co-Design Techniques
43 pages
Hardware-Software Co-partitioning in Embedded Systems
No ratings yet
Hardware-Software Co-partitioning in Embedded Systems
41 pages
Q
No ratings yet
Q
3 pages
Microsoft PowerPoint - SoC Design Flow Tools Codesign
No ratings yet
Microsoft PowerPoint - SoC Design Flow Tools Codesign
110 pages
COSYN: Hardware-Software Co-Synthesis of Heterogeneous Distributed Embedded Systems
No ratings yet
COSYN: Hardware-Software Co-Synthesis of Heterogeneous Distributed Embedded Systems
13 pages
Hardware - Software Design Issues
No ratings yet
Hardware - Software Design Issues
22 pages
Hardware Software Codesign
No ratings yet
Hardware Software Codesign
22 pages
Embedded Systems Co-Design References
No ratings yet
Embedded Systems Co-Design References
37 pages
HWSW Co-Synthesis Algorithms
No ratings yet
HWSW Co-Synthesis Algorithms
38 pages
Embedded Systems Design Overview
No ratings yet
Embedded Systems Design Overview
20 pages
Design & Co-Design of Embedded Systems
No ratings yet
Design & Co-Design of Embedded Systems
20 pages
Hardware/Software Co-Design: Zebo Peng, Petru Eles
No ratings yet
Hardware/Software Co-Design: Zebo Peng, Petru Eles
52 pages
Es ZG626 Course Handout
No ratings yet
Es ZG626 Course Handout
11 pages
HLS Tips and Tricks
No ratings yet
HLS Tips and Tricks
23 pages
HS Codesign Overview
No ratings yet
HS Codesign Overview
48 pages
180407937
No ratings yet
180407937
11 pages
Daniel D. Gajski, Jianwen Zhu, Rainer Dömer (Auth.), Jørgen Staunstrup, ...
No ratings yet
Daniel D. Gajski, Jianwen Zhu, Rainer Dömer (Auth.), Jørgen Staunstrup, ...
405 pages
Hardware/Software Co-Design: Principles and Practice
100% (1)
Hardware/Software Co-Design: Principles and Practice
16 pages
ECE 587 - Hardware/Software Co-Design Lecture 09 Processor Modeling I
No ratings yet
ECE 587 - Hardware/Software Co-Design Lecture 09 Processor Modeling I
13 pages
M-1 Introduction
No ratings yet
M-1 Introduction
43 pages
Unit1 H - W S - W Codesign
No ratings yet
Unit1 H - W S - W Codesign
34 pages
ECT-401: Embedded Systems Overview
No ratings yet
ECT-401: Embedded Systems Overview
24 pages
Hardware-Software Co-Design Overview
No ratings yet
Hardware-Software Co-Design Overview
29 pages
Essential Issues in Codesign
No ratings yet
Essential Issues in Codesign
13 pages
Computers As Components: Principles of Embedded Computing System Design
No ratings yet
Computers As Components: Principles of Embedded Computing System Design
9 pages
My Lecture6 Partitioning
No ratings yet
My Lecture6 Partitioning
38 pages
Embedded Computing System Design Guide
No ratings yet
Embedded Computing System Design Guide
11 pages
High Level Synthesis With Catapultc: Michal Stala
No ratings yet
High Level Synthesis With Catapultc: Michal Stala
29 pages
CIA 1 HWSWCD QP Set B
No ratings yet
CIA 1 HWSWCD QP Set B
3 pages
Embedded Systems Real Time Operating Systems For Arm Cortex M Microcontrollers 4th Edition Valvano Digital Download
No ratings yet
Embedded Systems Real Time Operating Systems For Arm Cortex M Microcontrollers 4th Edition Valvano Digital Download
150 pages
PPT2
No ratings yet
PPT2
25 pages
HASCD
No ratings yet
HASCD
9 pages
High Level Synthesis II: ECE 3401 Digital Systems Design
No ratings yet
High Level Synthesis II: ECE 3401 Digital Systems Design
35 pages
108863
No ratings yet
108863
29 pages
HW SW Codesign Lecture2
100% (1)
HW SW Codesign Lecture2
39 pages
CIA 1 HWSWCD QP Set A
No ratings yet
CIA 1 HWSWCD QP Set A
3 pages
High-Level Synthesis (HLS) : ECE 3401 Digital Systems Design
No ratings yet
High-Level Synthesis (HLS) : ECE 3401 Digital Systems Design
32 pages
HSCD
No ratings yet
HSCD
277 pages
Chapter 1
No ratings yet
Chapter 1
66 pages
ESD Question Bank 2
No ratings yet
ESD Question Bank 2
20 pages
Real-Time Emulation in Embedded Design
No ratings yet
Real-Time Emulation in Embedded Design
9 pages
Ael Zg626 Ec-3r First Sem 2023-2024
No ratings yet
Ael Zg626 Ec-3r First Sem 2023-2024
5 pages
Chapter 07 - HWSW Co-Design and Prog Modeling
No ratings yet
Chapter 07 - HWSW Co-Design and Prog Modeling
25 pages
VLSI CAD Techniques Explained
No ratings yet
VLSI CAD Techniques Explained
37 pages
Software Hardware Co-Design Defense Embedded Systems
No ratings yet
Software Hardware Co-Design Defense Embedded Systems
39 pages
Chapter 4
No ratings yet
Chapter 4
23 pages
1st Session Unit 3 1
No ratings yet
1st Session Unit 3 1
37 pages
Ael ZG626 Course Handout
No ratings yet
Ael ZG626 Course Handout
4 pages
Hardware/Software Co-Design Course
No ratings yet
Hardware/Software Co-Design Course
4 pages
Course Script HWSW Codesign
No ratings yet
Course Script HWSW Codesign
144 pages
Unit 5-ERTS
No ratings yet
Unit 5-ERTS
48 pages
CS244-Introduction To Embedded Systems and Ubiquitous Computing
No ratings yet
CS244-Introduction To Embedded Systems and Ubiquitous Computing
38 pages
DSP Architecture for Engineers
No ratings yet
DSP Architecture for Engineers
33 pages
A. Assembly SW Routine For Linear Search On 8085
No ratings yet
A. Assembly SW Routine For Linear Search On 8085
9 pages
Hardware/Software Design of Embedded Systems: Sudeep Pasricha
No ratings yet
Hardware/Software Design of Embedded Systems: Sudeep Pasricha
50 pages
9553 Sony HCD-GTR6 GTR7 GTR8 Sistema Audio CD-USB Manual de Servicio
No ratings yet
9553 Sony HCD-GTR6 GTR7 GTR8 Sistema Audio CD-USB Manual de Servicio
92 pages
Mdu13 V3
No ratings yet
Mdu13 V3
12 pages
IT Analyst or Business Analyst or Oracle Developer or PL/SQL Dev
No ratings yet
IT Analyst or Business Analyst or Oracle Developer or PL/SQL Dev
3 pages
School Hall Sound Material Study
No ratings yet
School Hall Sound Material Study
6 pages
CATALOG-Truemax Stationary Concrete Batching Plant CBP120S
No ratings yet
CATALOG-Truemax Stationary Concrete Batching Plant CBP120S
12 pages
Water Pollution From Smelting, Metal Plating & Metal Finishing
0% (1)
Water Pollution From Smelting, Metal Plating & Metal Finishing
37 pages
Online Technical Bulletin: Volvo/Ford/Mazda Tf-80 SC
No ratings yet
Online Technical Bulletin: Volvo/Ford/Mazda Tf-80 SC
2 pages
In Process Inspection Report - Printing
No ratings yet
In Process Inspection Report - Printing
29 pages
Sgen-3000W Water-Cooled Generator Series: For Gas and Steam Power Applications From 540-1,300 Mva
100% (1)
Sgen-3000W Water-Cooled Generator Series: For Gas and Steam Power Applications From 540-1,300 Mva
4 pages
Strategy For Reducing Ticket Backlog
No ratings yet
Strategy For Reducing Ticket Backlog
5 pages
Inspection and Test Plan - Sprinkler System - Rev 01
No ratings yet
Inspection and Test Plan - Sprinkler System - Rev 01
7 pages
Performance Testing and Optimization The Complete Guide Paul Glavich and Chris Farrell
No ratings yet
Performance Testing and Optimization The Complete Guide Paul Glavich and Chris Farrell
397 pages
Advanced Visualization Workspace
No ratings yet
Advanced Visualization Workspace
8 pages
Retaining Walls
No ratings yet
Retaining Walls
13 pages
Methods of Costing
No ratings yet
Methods of Costing
7 pages
Adv Materials Inter - 2020 - Gao - Graphene MoS2 Graphene Vertical Heterostructure Based Broadband Photodetector With High
No ratings yet
Adv Materials Inter - 2020 - Gao - Graphene MoS2 Graphene Vertical Heterostructure Based Broadband Photodetector With High
6 pages
Antenna Installation Guide
No ratings yet
Antenna Installation Guide
47 pages
Residential Building Plan
No ratings yet
Residential Building Plan
1 page
Commercial Sidewall Sprinklers
No ratings yet
Commercial Sidewall Sprinklers
6 pages
Telma OEM Guidelines J1939 J2284: Revised: 22mar12
No ratings yet
Telma OEM Guidelines J1939 J2284: Revised: 22mar12
10 pages
CIS-CATUsersGuide 000 PDF
No ratings yet
CIS-CATUsersGuide 000 PDF
56 pages
XFS Dripline With Copper Shield™ Technology: Tech Spec
No ratings yet
XFS Dripline With Copper Shield™ Technology: Tech Spec
2 pages
KASTO 016 (로크웰 경도 시험기) PDF
No ratings yet
KASTO 016 (로크웰 경도 시험기) PDF
27 pages
DCG40152 T - 3 - Hydrographic Survey Planning
No ratings yet
DCG40152 T - 3 - Hydrographic Survey Planning
52 pages
SCX-6345N Service Manual Parts List
No ratings yet
SCX-6345N Service Manual Parts List
85 pages
Project Engineer Role Overview
100% (1)
Project Engineer Role Overview
2 pages
Auto Cuts
No ratings yet
Auto Cuts
11 pages
Ceiling Heights Exits
No ratings yet
Ceiling Heights Exits
1 page
Cold Mix For Road Construction
No ratings yet
Cold Mix For Road Construction
43 pages
3.CNOPTEC B302-1 Series Biological Catalogue
No ratings yet
3.CNOPTEC B302-1 Series Biological Catalogue
6 pages