0% found this document useful (0 votes)

5 views10 pages

Project Report Restructured

The report discusses the use of Message Passing Interface (MPI) to accelerate vector computations, specifically focusing on the dot product of two vectors. It details the methodology, system design, and implementation, demonstrating performance gains and scalability through experimental results. The findings indicate that MPI effectively handles both uniform and irregular data distributions, with suggestions for future research on hybrid implementations and adaptive data partitioning.

Uploaded by

Philip Owusu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views10 pages

Project Report Restructured

Uploaded by

Philip Owusu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

KWAME NKRUMAH UNIVERSITY OF SCIENCE AND

TECHNOLOGY, KUMASI
Faculty of Physical and Computational Sciences
Department of Mathematics

Parallel Computing Project Report

(MPI)

Date: August 24, 2025

Contents

1 Background and Motivation 2

1.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Key Concepts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

2 Methodology 3
2.1 Mathematical Foundation . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.2 Parallel Strategy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

3 System Design and Implementation 4

3.1 Design Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
3.2 Implementation Workflow . . . . . . . . . . . . . . . . . . . . . . . . . . 4

4 Experimental Setup and Observations 5

5 Evaluation and Insights 6

5.1 Performance Trends . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
5.2 Scalability Assessment . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

6 Conclusion and Future Work 7

7 References 8

1
1 Background and Motivation

1.1 Introduction
This work investigates the use of Message Passing Interface (MPI) in accelerating vector
computations. The primary focus is the dot product of two vectors, a key operation
in scientific and engineering workloads. By distributing computations among multiple
processes, we aim to demonstrate both performance gains and scalability.

1.2 Key Concepts

Dot Product: The dot product of two vectors produces a scalar and is widely used in
numerical simulations, machine learning, and data analysis.
Parallel Execution: To leverage modern multi-core and multi-node systems, the
operation is parallelized with MPI, which allows flexible communication and workload
distribution, even when partition sizes differ.

2
2 Methodology

2.1 Mathematical Foundation

For vectors A[i] and B[i] of length N ,
N
X
R= A[i] · B[i] (2.1)
i=1

This direct computation is O(N ).

2.2 Parallel Strategy

The approach divides the data across MPI processes:

1. Initialize MPI and determine process ranks and total processes.

2. Partition arrays A and B into suitable chunks (not necessarily equal).

3. Each process computes its local dot product.

4. Results are combined at the root process using MPI Reduce.

3
3 System Design and Implementation

3.1 Design Overview

The design prioritizes load balance and efficient communication. Unequal data sizes are
supported via MPI Scatterv, which assigns tailored portions of vectors to each process.

3.2 Implementation Workflow

Initialization: MPI environment setup, rank identification, and allocation of local
arrays.

Distribution: The root process divides input vectors and uses MPI Scatterv for
communication.

Local Computation: Each process performs its assigned dot product segment.

Aggregation: Local results are reduced to a single global result.

Termination: Root process reports execution time; memory is freed; MPI is final-
ized.

4
4 Experimental Setup and Observations

A test case with vector size 2.5 × 108 was executed under varying numbers of processes.
Table 4.1 summarizes recorded runtimes.

Table 4.1: Execution Time under Different Core Counts

Vector Size Processes Runtime (s)
250000000 1 0.6697
250000000 2 0.4596
250000000 3 0.2876
250000000 4 0.2502
250000000 5 0.1808
250000000 6 0.1404
250000000 7 0.1096
250000000 8 0.0853

5
5 Evaluation and Insights

5.1 Performance Trends

Execution time consistently declined with additional processes, confirming the effective-
ness of parallel decomposition. The flexibility of MPI Scatterv ensured that uneven data
partitioning did not hinder performance.

5.2 Scalability Assessment

The speedup S(n) is defined as:
T (1)
S(n) = (5.1)
T (n)
where T (1) is the runtime with one process and T (n) is with n processes. Experimental
results demonstrate near-linear improvements up to 8 processes.

6
6 Conclusion and Future Work

This project confirms that MPI can significantly accelerate vector operations while han-
dling both uniform and irregular data distribution effectively. Future studies could exam-
ine hybrid MPI+OpenMP implementations, adaptive data partitioning, or experiments
on distributed cluster environments.

7
7 References

8
Bibliography

[1] MPI Forum, MPI Standard Documentation, https://www.mpi-forum.org/docs/.

[2] M. J. Quinn, Parallel Programming in C with MPI and OpenMP.

[3] V. Eijkhout, Introduction to High-Performance Scientific Computing.

Parallel Computing with MPI
No ratings yet
Parallel Computing with MPI
26 pages
PC Report
No ratings yet
PC Report
9 pages
Advanced Parallel Programming Guide
No ratings yet
Advanced Parallel Programming Guide
32 pages
Eij KH Out Parallel Programming
No ratings yet
Eij KH Out Parallel Programming
679 pages
MPI Python Workshop Day1 Fall2024
No ratings yet
MPI Python Workshop Day1 Fall2024
22 pages
Mpi Book
No ratings yet
Mpi Book
673 pages
OpenMP and MPI Parallel Programming Guide
No ratings yet
OpenMP and MPI Parallel Programming Guide
23 pages
MPI Parallel Programming Guide
No ratings yet
MPI Parallel Programming Guide
49 pages
New Performance Analysis of Parallel Image Rotation-1
No ratings yet
New Performance Analysis of Parallel Image Rotation-1
9 pages
Intro To MPI
No ratings yet
Intro To MPI
44 pages
PC Course Notes May17
No ratings yet
PC Course Notes May17
123 pages
Eij KH Out Parallel Programming
No ratings yet
Eij KH Out Parallel Programming
838 pages
RajSingh HPCexp6
No ratings yet
RajSingh HPCexp6
4 pages
Pseudo Code of Mpi Programs
No ratings yet
Pseudo Code of Mpi Programs
22 pages
MPI Tutorial Fall Break 2022
No ratings yet
MPI Tutorial Fall Break 2022
60 pages
2013 02 24 Ppopp Mpi Basic
No ratings yet
2013 02 24 Ppopp Mpi Basic
102 pages
ECE 1747H: Parallel Programming: Message Passing (MPI)
No ratings yet
ECE 1747H: Parallel Programming: Message Passing (MPI)
67 pages
Sunil Kumar L 24
No ratings yet
Sunil Kumar L 24
21 pages
MPI Programming for Parallel Computing
No ratings yet
MPI Programming for Parallel Computing
49 pages
RajSingh HPC Exp1-7
No ratings yet
RajSingh HPC Exp1-7
23 pages
Practical MPI Programming
No ratings yet
Practical MPI Programming
238 pages
Multi Core Architectures and Programming
No ratings yet
Multi Core Architectures and Programming
10 pages
PDC Lab 2-5
No ratings yet
PDC Lab 2-5
5 pages
Mini Project hpc-1
No ratings yet
Mini Project hpc-1
14 pages
MPI Parallel Programming Guide
No ratings yet
MPI Parallel Programming Guide
67 pages
Exploring Parallel Programming Models For Matrix Computation
No ratings yet
Exploring Parallel Programming Models For Matrix Computation
26 pages
Parallelizing Particle-In-Cell Codes With Openmp and Mpi: Nils Magnus Larsgård
No ratings yet
Parallelizing Particle-In-Cell Codes With Openmp and Mpi: Nils Magnus Larsgård
74 pages
MPI Matrix Multiplication 1 PDF
No ratings yet
MPI Matrix Multiplication 1 PDF
23 pages
Full Parallel QuickSort MPI Report
No ratings yet
Full Parallel QuickSort MPI Report
12 pages
Module 3 PC
No ratings yet
Module 3 PC
44 pages
Group 38
No ratings yet
Group 38
8 pages
Lecture 1
No ratings yet
Lecture 1
23 pages
Lab Manual - LP V - LA 3
No ratings yet
Lab Manual - LP V - LA 3
14 pages
Untitled Document
No ratings yet
Untitled Document
23 pages
ATPESC 2019 Track-2 1-7-30 830am Guo-Raffenetti-Thakur-MPI For Scalable Computing
No ratings yet
ATPESC 2019 Track-2 1-7-30 830am Guo-Raffenetti-Thakur-MPI For Scalable Computing
199 pages
Design of Parallel Algorithm'S: Faculty Guide: Group Members
No ratings yet
Design of Parallel Algorithm'S: Faculty Guide: Group Members
49 pages
Using MPI
No ratings yet
Using MPI
385 pages
HPC Summary
No ratings yet
HPC Summary
17 pages
Distributed Memory Programming With: Peter Pacheco
No ratings yet
Distributed Memory Programming With: Peter Pacheco
125 pages
Message Passing Interface (MPI)
No ratings yet
Message Passing Interface (MPI)
22 pages
Shan Vd. - 2018 - Improving MPI Reduction Performance For Manycore Architectures With OpenMP and Data Compression
No ratings yet
Shan Vd. - 2018 - Improving MPI Reduction Performance For Manycore Architectures With OpenMP and Data Compression
12 pages
Introduction To MPI Basics
No ratings yet
Introduction To MPI Basics
8 pages
Ee8218 Lab2
No ratings yet
Ee8218 Lab2
7 pages
PDC Lecture 02
No ratings yet
PDC Lecture 02
35 pages
Unit IV
No ratings yet
Unit IV
12 pages
HPC MPI LAB 1 Vector Addition
No ratings yet
HPC MPI LAB 1 Vector Addition
9 pages
Document 15
No ratings yet
Document 15
5 pages
Module 3
No ratings yet
Module 3
17 pages
Introduction to MPI and OpenMP Programming
No ratings yet
Introduction to MPI and OpenMP Programming
67 pages
MPI Workshop Overview and Notes
No ratings yet
MPI Workshop Overview and Notes
46 pages
Introduction On OpenMPI
No ratings yet
Introduction On OpenMPI
14 pages
12.revision Parallelization
No ratings yet
12.revision Parallelization
30 pages
MPI Basics for CSE Students
No ratings yet
MPI Basics for CSE Students
3 pages
MPI
No ratings yet
MPI
169 pages
Lab 05
No ratings yet
Lab 05
7 pages
CS621 Final Term Current Papers
100% (1)
CS621 Final Term Current Papers
9 pages
Potential Approaches To Parallel Computation of Rayleigh Integrals in Measuring Acoustic Pressure and Intensity
No ratings yet
Potential Approaches To Parallel Computation of Rayleigh Integrals in Measuring Acoustic Pressure and Intensity
5 pages
ijsrp-p5044
No ratings yet
ijsrp-p5044
7 pages
Biology For Mathematics Microbiology BIOL 181 16032023 1
No ratings yet
Biology For Mathematics Microbiology BIOL 181 16032023 1
90 pages
Front Template
No ratings yet
Front Template
1 page
Linear Programming MCQs Full75
No ratings yet
Linear Programming MCQs Full75
17 pages
Print Booklets in Adobe Acrobat Guide
No ratings yet
Print Booklets in Adobe Acrobat Guide
5 pages
An Online Visual Measurement Method For Workpiece Dimension Based On Deep Learning
No ratings yet
An Online Visual Measurement Method For Workpiece Dimension Based On Deep Learning
11 pages
Establishing A Smart Farm-Scale Piggery Wastewater
No ratings yet
Establishing A Smart Farm-Scale Piggery Wastewater
13 pages
Data Structures and Java Class Library
No ratings yet
Data Structures and Java Class Library
4 pages
List of Insurance Adjusters Licenses As of December 29 2023
No ratings yet
List of Insurance Adjusters Licenses As of December 29 2023
6 pages
Extended Warehouse Management Interview Questions
100% (2)
Extended Warehouse Management Interview Questions
14 pages
Test Bank For Financial Reporting and Analysis, 13th Edition, Charles H. Gibson, Full Chapters Included
100% (6)
Test Bank For Financial Reporting and Analysis, 13th Edition, Charles H. Gibson, Full Chapters Included
319 pages
Section A - Multiple Choice Questions (MCQS) Computer Hardware
No ratings yet
Section A - Multiple Choice Questions (MCQS) Computer Hardware
3 pages
Types of Addressing Modes Explained
No ratings yet
Types of Addressing Modes Explained
10 pages
Draft RRL Rrs
No ratings yet
Draft RRL Rrs
4 pages
Final PPT
No ratings yet
Final PPT
6 pages
DBMS End Sem
No ratings yet
DBMS End Sem
52 pages
Cisco Software Defined Access Networking PDF
100% (6)
Cisco Software Defined Access Networking PDF
687 pages
Gen 2 Exadata Cloud at Customer Security Controls
No ratings yet
Gen 2 Exadata Cloud at Customer Security Controls
36 pages
MCQ About Square Root
No ratings yet
MCQ About Square Root
60 pages
Azure DevOps Comprehensive Guide PDF
100% (1)
Azure DevOps Comprehensive Guide PDF
78 pages
Xiaomi 15S Pro and Its Xring O1 Chip Match The Xiaomi 15 Pro in Antutu Test
No ratings yet
Xiaomi 15S Pro and Its Xring O1 Chip Match The Xiaomi 15 Pro in Antutu Test
2 pages
Multiplexer, Decoder and Flipflop
No ratings yet
Multiplexer, Decoder and Flipflop
10 pages
Prototype Model in Software Development
No ratings yet
Prototype Model in Software Development
7 pages
Security Games With Layered Defenses: Adaptive Adversaries and Gittins Indices
No ratings yet
Security Games With Layered Defenses: Adaptive Adversaries and Gittins Indices
11 pages
Web Basics for Beginners
No ratings yet
Web Basics for Beginners
2 pages
APM Agents Installation Guide
No ratings yet
APM Agents Installation Guide
102 pages
Combinational Circuits Mux Demux Encoder Decoder
No ratings yet
Combinational Circuits Mux Demux Encoder Decoder
9 pages
Hardware, Software
No ratings yet
Hardware, Software
8 pages
Using ERwin Data Modeler
100% (1)
Using ERwin Data Modeler
46 pages
Simple SQL Cheat Sheet - DML Triggers
No ratings yet
Simple SQL Cheat Sheet - DML Triggers
1 page
Warehouse Management Final Project Report
100% (3)
Warehouse Management Final Project Report
23 pages
Admin Report View Dashboard
No ratings yet
Admin Report View Dashboard
14 pages
Finite Difference Methods Overview
No ratings yet
Finite Difference Methods Overview
5 pages
Template For Gigascience Journal Manuscript Submissions: First Author, Second Author, Third Author and Fourth Author
No ratings yet
Template For Gigascience Journal Manuscript Submissions: First Author, Second Author, Third Author and Fourth Author
6 pages

Project Report Restructured

Uploaded by

Project Report Restructured

Uploaded by

KWAME NKRUMAH UNIVERSITY OF SCIENCE AND

Parallel Computing Project Report

Date: August 24, 2025

1 Background and Motivation 2

3 System Design and Implementation 4

4 Experimental Setup and Observations 5

5 Evaluation and Insights 6

6 Conclusion and Future Work 7

1.2 Key Concepts

2.1 Mathematical Foundation

This direct computation is O(N ).

2.2 Parallel Strategy

1. Initialize MPI and determine process ranks and total processes.

2. Partition arrays A and B into suitable chunks (not necessarily equal).

3. Each process computes its local dot product.

4. Results are combined at the root process using MPI Reduce.

3.1 Design Overview

3.2 Implementation Workflow

 Aggregation: Local results are reduced to a single global result.

Table 4.1: Execution Time under Different Core Counts

5.1 Performance Trends

5.2 Scalability Assessment

[1] MPI Forum, MPI Standard Documentation, https://www.mpi-forum.org/docs/.

[2] M. J. Quinn, Parallel Programming in C with MPI and OpenMP.

[3] V. Eijkhout, Introduction to High-Performance Scientific Computing.

You might also like

Aggregation: Local results are reduced to a single global result.