Matrix multiplication on CPU

The purpose of this project is to delve into the topic of matrix matrix multiplication, which is a highly relevant operation in many contexts. Real-world applications often involve the use of large matrices, which represent a significant challenge for computational efficiency. Machines must perform a high number of calculations, and the computation process can be slow and difficult.

For these reasons, it is fondumental to manage large matrices and perform the multiplication between them in an intelligent way. This means exploring parallel programming, exploting specific libraries, and paying attention to the memory management of the system.

In this project, we explored variuos well-known techniques used in matrix matrix multiplication; we analyzed them individually, as well in combination, and at the end we analyzed the results and performance; in particular, we utilized SIMD approach, loop unroll technique, OpenMp features and cache-friendly management. To test them we used Google Benchmark library, and we saved the results in some reports file.

We exclusively concentrated on square matrices containing elements of type float or double.

To obtain consistent results, the dimensions of the matrices should be a power of 2.

Build with CMake

This project uses CMake as its build system, follow the steps below to build the project using it.

Prerequisites

CMake (version 3.20 or higher)
OpenMP
OpenBLAS

Building the project

Clone the repository

git clone https://github.com/AMSC22-23/Neural_Nets-Bettonte-Lacagnina-Lentini.git
cd Neural_Nets-Bettone-Lacagnina-Lentini

Create a Build Directory
```
mkdir build
cd build
```
Configure the project

To build with optimization flag (-OFast -march=native) you have to select this option
```
cmake .. -DCMAKE_BUILD_TYPE=Release
```
To build without optimization flag you have to select this option
```
cmake .. -DCMAKE_BUILD_TYPE=Debug
```
The default build is Release.
Build the project
```
make
```
Run the executable
```
./Neural_Nets
```

How to execute

Once the program is launched you will be asked to provide the following inputs:

Dimension of the matrices.
Type of matrices elements: insert f for float and d for double.
Number of test repetitions.

The number of test repetitions will impact on the output of the program, in particular 1 repetition does not produce aggregate data (mean, value, standard deviation, cv).

To save the results produced by the benchmark, it is necessary to add the flag -benchmark_out=filename when running the executable.

Note

Save the output files in the reports directory in order to plot them with the plotting.py script.

Reporting results

The JSON files in the reports directory contain the results produced by Google Benchmark coming from our testing. The naming convention we adopted is reportXYZ.json, where:

X is the matrix dimension
Y is an abbreviation for the type of data contained in the matrices (in this case f for float and d for double)
Z could be the string opt, indicating tests run with optimization flags, or an empty string.

Warning

Using a different file naming or a different format is incompatible with the functioning of the plotting script.

To plot the data contained in the report files, type on the terminal in the src directory:

python3 plotting.py

The plots shown will also be saved in the plots directory.

Note

It is necessary to have installed python3 and the matplotlib library to run the script.

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
plots		plots
reports		reports
src		src
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Matrix multiplication on CPU

Build with CMake

Prerequisites

Building the project

How to execute

Reporting results

Contributors

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

License

fablnt/Matrix-Multiplication-on-CPU

Folders and files

Latest commit

History

Repository files navigation

Matrix multiplication on CPU

Build with CMake

Prerequisites

Building the project

How to execute

Reporting results

Contributors

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages