Dot Product

This is the official source code for my blog post on common pitfalls designing high performance mathematical routines on the GPU and CPU. It consists of multiple implementations of calculating a dot product for very large vectors.

Introduction

While graphics professing units have deservedly found astounding success in many general parallel programming applications in recent years, one must not become fixated by them and try to brute complicated solutions for problems that are better served by other approaches.

In the blog post accompanying this repository it is shown how a discrete GPU approach loses out to a vectorized CPU version in an almost embarassingly parallel task. In fact, it is investigated that on many computers the GPU version has no hope of catching up to the CPU approach due bandwith limits of PCI-Express itself. Nevertheless, the code in this repostiory serves as an approachable example that the expected results depend on the underlying hardware and proper analysis of all components is required to implement the right approach.

Contributing

Interested in making contributions to this project? Please review the guides below.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.github		.github
LICENSES		LICENSES
src		src
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
REUSE.toml		REUSE.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Dot Product

Introduction

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

CowFreedom/gpu_dotproduct

Folders and files

Latest commit

History

Repository files navigation

Dot Product

Introduction

Contributing

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages