0% found this document useful (0 votes)

63 views22 pages

Co 2

The document discusses various parallel computational algorithms, including prefix computation, matrix-vector and matrix-matrix multiplication, Gaussian elimination, vector addition, dot product, and trapezoidal integration. It highlights the importance of parallelism in enhancing performance and reducing computation time across different applications such as data analysis, machine learning, and numerical methods. Additionally, it covers techniques like pointer jumping for efficient traversal in linked structures and their applications in algorithms like union-find and tree processing.

Uploaded by

Anmol Agarwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views22 pages

Co 2

Uploaded by

Anmol Agarwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

Parallel Computational Algorithms

1. Prefix Computation
• Parallel algorithms organize an application’s computational work so that multiple parts of the workload can be performed concurrently.
• This can reduce the time to solution and increase performance.
• A prefix computational algorithm processes data sequentially to compute prefix sums.
• Mathematically, the prefix sum array P of A is defined as,
P[i]=A[0]+A[1]+⋯+A[i] for all i∈[0,n−1], where n is the size of the array.
Example:
Applications:
Input:
1.Cumulative Sums in data analysis.
A=[1,2,3,4,5]. 2.Parallel Algorithms for GPUs and multi-
Output: threaded systems.
Prefix Sums: P=[1,3,6,10,15]. 3.Range Queries in segment trees.
Sum(L,R) = prefix(R)-Prefix(L-1) 4.Scan Operations in functional
Sum(2,4) = 15 – 3 = 12 programming.
Algorithm Steps:
5.Image Processing (integral image
computations).
•
Initialize P[0]=A[0] Time Complexity: O(n) (linear time),
For each subsequent element i in A, where n is the size of the input array.
Note: O(n) + O(1).
P[i]=P[i−1]+A[i] •Space Complexity: O(n) for the output array P.
This ensures that P[i] holds the cumulative sum up to index i.
Examples:
• A = [2 4 5 6 10 20 30]
• B = [2 4 6 8 10]
• C = [3 4 8 9 14 23 25]
• D= [3 1 4 1 5 9 2]
• Prefix sum = [3 4 8 9 14 23 25]
• Index starts from 0.
• Prefix(L,R) = Prefix(R)-Prefix(L-1).
2D (Matrix):
Location 0,0 0,1 0,2 1,0 1,1 1,2 2,0 2,1 2,2
Index 0 1 2 3 4 5 6 7 8
Elements 7 2 1 3 5 2 4 6 8
sum 7 9 10 13 18 20 24 30 38
2. Matrix-Vector

• Matrix-vector multiplication is a
fundamental operation, which is
used in linear algebra, and machine
learning.
Matrix-Matrix Multiplication
• Matrix-matrix multiplication involves multiplying two matrices A (of size m×n) and B (of
size n×p). The result is a matrix C with dimensions m×p.
• Steps:
1.Take each row of the first matrix A and each column of the second matrix B.
2.Compute the dot product between the row of A and the column of B to get an entry in
the resulting matrix C.
3.Repeat this for every row-column pair.
3. Gaussian Elimination

• Gaussian elimination algorithm used for solving systems of linear equations.

• The basic concept is to subtract some scalar of an equation from another equation to
eliminate an unknown.
• It transforms a given system into an upper triangular matrix (row echelon form)
through a series of row operations, followed by back substitution to find the
solution.
• Example: x+y+z=2, x+2y+3z =5, 2x+3y+4z = 11
Forward Elimination
• Convert the matrix into an upper triangular form by eliminating elements below the pivot element (diagonal element).
• Perform row operations to make entries below the pivot zero.

Backward Substitution
• Solve for variables starting from the last row, moving upwards.
Parallel Computing in Gaussian Elimination (sequential computation)
• The algorithm performs O(n³) operations for an n×n matrix.
• It is computationally expensive for large datasets.
• Therefore, Parallel computing improves performance by distributing tasks across multiple processors.
Parallelization Strategies:
1. Row-wise Parallelism
1. Each row operation is independent and can be computed simultaneously by different processors.
2. Suitable for shared-memory systems where threads access the same matrix.
2. Column-wise Parallelism
1. Parallelize operations involving columns to perform computations on pivot elements simultaneously.
2. Often used in distributed-memory systems.
3. Block-based Decomposition
1. Divide the matrix into smaller submatrices (blocks) and operate on them concurrently.
2. Suitable for architectures like GPUs, which handle matrix blocks effectively.
4. Pipeline Parallelism
1. Break down tasks into stages and assign each stage to a processor.
2. Overlaps computation and communication for better utilization.
4. Vector Addition and Dot Product

• Due to the computational intensity of these operations for large datasets, parallel computing
techniques are employed to enhance performance.
Problem Statement
• Given two vectors and , compute their sum:
Parallel Algorithm
1. Input: Two vectors and of size .
2. Output: Resultant vector of size .
3. Parallel Steps:
1. Partition the vectors into segments, where is the number of processors available.
2. Assign each processor a subrange of indices to compute partial sums.
3. Aggregate results into the final output vector.
Discuss the complexity of the algorithm
Dot Product
Problem Statement
• Given two vectors and , compute their dot product
Parallel Algorithm
1. Input: Two vectors and of size .
2. Output: Scalar result .
3. Parallel Steps:
1. Partition vectors into segments.
2. Each processor computes partial products and sums them locally.
3. Perform a reduction operation to combine partial sums into the final result.

• Discuss the complexity of the algorithm

5. Parallel Computation of π

• Parallel computation of π (pi) involves distributing the

computation across multiple processors or threads to speed up
the process.
• Several algorithms can be adapted for parallel computation
including
• Monte Carlo Method
• Gregory–Leibniz Series
• Bailey–Borwein–Plouffe (BBP) Formula
• Gauss-Legendre Algorithm
• Chudnovsky Algorithm

Time Complexity: O(N*K)

Auxiliary Space: O(1)
Bailey–Borwein–Plouffe (BBP) Formula

• A digit-extraction algorithm for π:

• Each thread computes the sum for a specific range of k,

and combines the results.

• Discuss the complexity of the algorithm.

Trapezoidal Integration
• Trapezoidal integration is a numerical method used to approximate the definite
integral of a function.
• It divides the integration interval into smaller subintervals and approximates the
area under the curve using trapezoids.
Formula
For a function f(x) defined on the interval [a,b], the Trapezoidal Rule with n
subintervals is defined as,
Implementation Steps:

1. Divide the interval [a,b][a, b][a,b] into n subintervals.

2. Calculate the height h.
3. Evaluate the function at the endpoints and intermediate
points.
4. Sum the areas of the trapezoids formed.
Example:
• Compute the integral of f(x)=x2 on [0,1]using 4 subintervals
(n=4).
• Assignment problem: Use a parallel algorithm to compute the
trapezoidal integration of f(x)=x2 over the interval [0, 2] with
two subintervals.
TAsk1:
• Using the trapezoidal rule, calculate the integral of f(x)=x 3 over the interval [1, 3] using four
subintervals in parallel. Show the computations for each segment.
Mandelbrot set computation

To compute the Mandelbrot set in parallel,

• Identify which parts of the computation can be calculated independently.
• Split the work across multiple cores.
• Use dynamic scheduling instead of static scheduling.

• Example for implementation of the algorithm please refer to the following link

https://github.com/philipzhux/parallel-mandelbrot-compute/tree/master/src
Pointer Jumping Techniques:

• Pointer jumping (also known as path doubling or pointer doubling) is a common technique used to solve
problems involving linked structures, such as
• List Ranking
• Linked List Parallel Prefix
• Euler Tour

• It is particularly useful in parallel algorithms and graph theory applications, such as finding ancestors, shortest
paths, or connected components.

Why Pointer Jumping technique:

• Instead of traversing elements one step at a time, pointer jumping jumps multiple steps in each iteration
by updating pointers. to point further along the path.
• This exponentially reduces the number of iterations required, making traversal faster.
Applications:
Finding the Root or Representative in Disjoint Sets (Union-Find Algorithm)
• Used in disjoint-set union-find data structures to maintain and query connected components
efficiently.
• Pointer jumping helps flatten the tree structure during path compression for faster queries.
Lowest Common Ancestor (LCA) Problem
• Pointer jumping is used to precompute ancestors at powers of 2 (binary lifting).
• This allows querying ancestors in logarithmic time.
Successor Queries in Linked Lists
• Speeds up successor finding by skipping intermediate nodes.
Tree Algorithms
• Used for parent pointer doubling in tree traversal or processing.
• Finds ancestors or computes depths quickly.
Parallel Algorithms
Pointer jumping is useful in parallelizing computations, such as list ranking and spanning trees.
1. List Ranking

• Assignment
Euler Tour

• Example

• DFS traversal: [1, 2, 4, 4, 2, 5, 5, 2, 1, 3, 6, 6, 3, 7, 7, 3,

1]
• Discuss complexity.
Linked List Parallel Prefix

• Assignment

Lecture 03-Parallel Prefix
No ratings yet
Lecture 03-Parallel Prefix
6 pages
UNIT-8 Forms of Parallelism: 8.1 Simple Parallel Computation: Example 1: Numerical Integration Over Two Variables
No ratings yet
UNIT-8 Forms of Parallelism: 8.1 Simple Parallel Computation: Example 1: Numerical Integration Over Two Variables
12 pages
Parallel Random Access Machine (PRAM) : Control
No ratings yet
Parallel Random Access Machine (PRAM) : Control
9 pages
Achieving Superlinear Speedup in Parallel Algorithms
No ratings yet
Achieving Superlinear Speedup in Parallel Algorithms
7 pages
Chapter Six
No ratings yet
Chapter Six
19 pages
Chapter Six
No ratings yet
Chapter Six
18 pages
1.1 Parallelism Is Ubiquitous
No ratings yet
1.1 Parallelism Is Ubiquitous
3 pages
Parallel Algorithms: Theory and Practice
No ratings yet
Parallel Algorithms: Theory and Practice
44 pages
RG2 ParallelizationPrinciples HPCAI Jan2020
No ratings yet
RG2 ParallelizationPrinciples HPCAI Jan2020
40 pages
GPU Algorithms: Reduce, Scan, Histogram
No ratings yet
GPU Algorithms: Reduce, Scan, Histogram
61 pages
Pc10 NumLinAl I
No ratings yet
Pc10 NumLinAl I
58 pages
Current Trends in Numerical Linear Algebra
No ratings yet
Current Trends in Numerical Linear Algebra
19 pages
Thinking in Parallel: Some Basic Data-Parallel Algorithms and Techniques
No ratings yet
Thinking in Parallel: Some Basic Data-Parallel Algorithms and Techniques
104 pages
Lect 5 Brent
No ratings yet
Lect 5 Brent
10 pages
Lecture 9 - Parallel Algorithms
No ratings yet
Lecture 9 - Parallel Algorithms
28 pages
VSS NumericalLibraries
No ratings yet
VSS NumericalLibraries
21 pages
Unit - 2 HPC
No ratings yet
Unit - 2 HPC
96 pages
Iterative Methods for Sparse Systems
No ratings yet
Iterative Methods for Sparse Systems
24 pages
Parallel Algorithm for Pairwise Computation
No ratings yet
Parallel Algorithm for Pairwise Computation
2 pages
Lecture Parallelism DC PDF
No ratings yet
Lecture Parallelism DC PDF
7 pages
Parallel Scan Algorithm in Julia
No ratings yet
Parallel Scan Algorithm in Julia
32 pages
Parallel Computing with MPI
No ratings yet
Parallel Computing with MPI
26 pages
Unit II Matrix Multiplication
No ratings yet
Unit II Matrix Multiplication
23 pages
Unit 2 - 2.2 (Basic Algorithms)
No ratings yet
Unit 2 - 2.2 (Basic Algorithms)
8 pages
All HPC Programs
No ratings yet
All HPC Programs
16 pages
07 Parallel Algorithms in Parallel and Distributed Computing
No ratings yet
07 Parallel Algorithms in Parallel and Distributed Computing
13 pages
HPC Notes Unit 3
No ratings yet
HPC Notes Unit 3
7 pages
Parallel and Distributed Algorithms
No ratings yet
Parallel and Distributed Algorithms
21 pages
Parallel Algorithms for Shared Memory
No ratings yet
Parallel Algorithms for Shared Memory
23 pages
Parallel Computing Unit 3 - Principles of Parallel Computing Design
No ratings yet
Parallel Computing Unit 3 - Principles of Parallel Computing Design
78 pages
CS 240A: Solving Ax B in Parallel: Dense A: Gaussian Elimination With Partial Pivoting (LU)
No ratings yet
CS 240A: Solving Ax B in Parallel: Dense A: Gaussian Elimination With Partial Pivoting (LU)
35 pages
Comprehensive Guide to Algorithms and Complexity
No ratings yet
Comprehensive Guide to Algorithms and Complexity
39 pages
Chapter 14: Parallel Algorithms
No ratings yet
Chapter 14: Parallel Algorithms
23 pages
Parallel Algorithms Underlying MPI Implementations
No ratings yet
Parallel Algorithms Underlying MPI Implementations
55 pages
1 Parallel and Distributed Computation
No ratings yet
1 Parallel and Distributed Computation
10 pages
Pram
No ratings yet
Pram
23 pages
Introduction To Parallel Computing Design and Anal
No ratings yet
Introduction To Parallel Computing Design and Anal
53 pages
Bert 2a Parallel Algorithms Parfor Quicksort Reduction Listranking Rootfinding Postordernumbering
No ratings yet
Bert 2a Parallel Algorithms Parfor Quicksort Reduction Listranking Rootfinding Postordernumbering
73 pages
HPC Linear
No ratings yet
HPC Linear
52 pages
HPC Codes-2
No ratings yet
HPC Codes-2
15 pages
Unit-1 DAA Notes - Daa Unit 1 Note Unit-1 DAA Notes - Daa Unit 1 Note
No ratings yet
Unit-1 DAA Notes - Daa Unit 1 Note Unit-1 DAA Notes - Daa Unit 1 Note
26 pages
Simulating Ocean Currents
No ratings yet
Simulating Ocean Currents
35 pages
Unit 1 Daa Notes Daa Unit 1 Note
No ratings yet
Unit 1 Daa Notes Daa Unit 1 Note
26 pages
Btech Degree Examination, May2014 Cs010 601 Design and Analysis of Algorithms Answer Key Part-A 1
No ratings yet
Btech Degree Examination, May2014 Cs010 601 Design and Analysis of Algorithms Answer Key Part-A 1
14 pages
2.decomposition Done
No ratings yet
2.decomposition Done
4 pages
Recursion: Q.1 Explain With Example Three Different Types o
No ratings yet
Recursion: Q.1 Explain With Example Three Different Types o
19 pages
HPC2
No ratings yet
HPC2
22 pages
Unit 1 DAA Notes: Algorithms & Analysis
No ratings yet
Unit 1 DAA Notes: Algorithms & Analysis
26 pages
Parallel and Distributed Computing Lab Digital Assignment - 3
No ratings yet
Parallel and Distributed Computing Lab Digital Assignment - 3
10 pages
Parallel Algorithms for PRAM Models
No ratings yet
Parallel Algorithms for PRAM Models
4 pages
The Design and Analysis of Parallel Algorithms
No ratings yet
The Design and Analysis of Parallel Algorithms
412 pages
ETE Solution With Marking Scheme AAPS Jul 2025
No ratings yet
ETE Solution With Marking Scheme AAPS Jul 2025
25 pages
Sparse 1
No ratings yet
Sparse 1
68 pages
3.1.3 Processes and Mapping (1/5)
No ratings yet
3.1.3 Processes and Mapping (1/5)
74 pages
Parallel Algorithem
No ratings yet
Parallel Algorithem
15 pages
High Performance Computing Labs & Concepts
No ratings yet
High Performance Computing Labs & Concepts
5 pages
Parallel Computation Models Explained
No ratings yet
Parallel Computation Models Explained
3 pages
Advanced Dynamic Programming
No ratings yet
Advanced Dynamic Programming
4 pages
Solving Equations in Social Sciences
No ratings yet
Solving Equations in Social Sciences
29 pages
Line Segment Division Ratios
No ratings yet
Line Segment Division Ratios
12 pages
3-D Atlas of Stars and Galaxies - 1st Edition (Springer Publishing) (2000)
No ratings yet
3-D Atlas of Stars and Galaxies - 1st Edition (Springer Publishing) (2000)
95 pages
Machine Design Board Exam
100% (1)
Machine Design Board Exam
13 pages
STEP 7 V5.2 Getting Started
No ratings yet
STEP 7 V5.2 Getting Started
112 pages
ER TFTM070 4V2.1 - Datasheet
No ratings yet
ER TFTM070 4V2.1 - Datasheet
23 pages
G Án Unit 4-B-Food and Drink
100% (1)
G Án Unit 4-B-Food and Drink
7 pages
Vol CG017 Cable Glands
No ratings yet
Vol CG017 Cable Glands
12 pages
Kta19 G3
100% (1)
Kta19 G3
2 pages
Using The Apple Dylan Development Environment
No ratings yet
Using The Apple Dylan Development Environment
298 pages
Katana Technical Guide en
No ratings yet
Katana Technical Guide en
16 pages
Statistical Estimation for Analysts
No ratings yet
Statistical Estimation for Analysts
7 pages
Seismic Fragility Analysis of Jacket Type Offshore Platforms Considering Soil-Pile-Structure Interaction
No ratings yet
Seismic Fragility Analysis of Jacket Type Offshore Platforms Considering Soil-Pile-Structure Interaction
14 pages
Amravati TPP BTG Interlocks Guide
100% (5)
Amravati TPP BTG Interlocks Guide
62 pages
Dialysis Principles and Components
100% (3)
Dialysis Principles and Components
35 pages
7TH Ais Mock Paper
No ratings yet
7TH Ais Mock Paper
3 pages
Machine Learning for At-Risk Students
No ratings yet
Machine Learning for At-Risk Students
20 pages
Dynamic Converter Modelling Guide
No ratings yet
Dynamic Converter Modelling Guide
30 pages
Model 435 437 Datasheet Sensor de Presion
No ratings yet
Model 435 437 Datasheet Sensor de Presion
7 pages
The Effect of Inventory Turnover and Debt-To-Equity Ratio On Profitability With Inflation As Moderation
No ratings yet
The Effect of Inventory Turnover and Debt-To-Equity Ratio On Profitability With Inflation As Moderation
22 pages
Mechanical Design Problem Set Solutions
No ratings yet
Mechanical Design Problem Set Solutions
21 pages
Constructive Model Theory
No ratings yet
Constructive Model Theory
13 pages
Unit-III DS Search Trees
No ratings yet
Unit-III DS Search Trees
69 pages
Golang Mysql Tutorial
No ratings yet
Golang Mysql Tutorial
3 pages
The Structural Behaviour of Horizontally Curved PSC Box Girder Bridge
No ratings yet
The Structural Behaviour of Horizontally Curved PSC Box Girder Bridge
250 pages
(Season 2)
No ratings yet
(Season 2)
4 pages
Calculus 1st Edition Soo T. Tan ebook core content version
100% (3)
Calculus 1st Edition Soo T. Tan ebook core content version
124 pages
ME Curriculum Guide
No ratings yet
ME Curriculum Guide
2 pages
Hughes - Technical Overview of The HVNET System
No ratings yet
Hughes - Technical Overview of The HVNET System
21 pages
Paper Reddy 21
No ratings yet
Paper Reddy 21
22 pages

Co 2

Uploaded by

Co 2

Uploaded by

Parallel Computational Algorithms

• Gaussian elimination algorithm used for solving systems of linear equations.

• Discuss the complexity of the algorithm

• Parallel computation of π (pi) involves distributing the

Time Complexity: O(N*K)

• A digit-extraction algorithm for π:

• Each thread computes the sum for a specific range of k,

• Discuss the complexity of the algorithm.

1. Divide the interval [a,b][a, b][a,b] into n subintervals.

To compute the Mandelbrot set in parallel,

Why Pointer Jumping technique:

• DFS traversal: [1, 2, 4, 4, 2, 5, 5, 2, 1, 3, 6, 6, 3, 7, 7, 3,

You might also like