CUDA and PyTorch Operations

This repository contains CUDA and PyTorch-based implementations for matrix operations and performance comparisons. It includes:

1. CUDA Matrix Multiplication (C++)

Implements matrix multiplication using CUDA cores.
Compares GPU performance with CPU performance.
Demonstrates the use of CUDA kernels for parallel computation.

2. PyTorch Matrix Multiplication (Python)

Compares matrix multiplication using CUDA cores (FP32) and Tensor cores (FP16).
Highlights the performance benefits of Tensor cores for half-precision computations.
Includes result validation to ensure accuracy between FP32 and FP16 computations.

Requirements

For the CUDA C++ Code:

NVIDIA GPU with CUDA support.
CUDA Toolkit installed.
C++ compiler (e.g., nvcc).

For the PyTorch Python Code:

PyTorch installed with CUDA support.
NVIDIA GPU with Tensor Core support (for FP16 acceleration).

How to Run

1. CUDA C++ Code:

Compile and run using nvcc:
```
nvcc -o MatMul MatMul.cu
./MatMul
```

2. PyTorch Python Code:

Run the Python script:
```
python matmul_pytorch.py
```

Features

Demonstrates GPU acceleration for matrix operations.
Compares CPU and GPU performance for matrix multiplication.
Highlights the use of Tensor cores for FP16 computations in PyTorch.

Notes

Ensure your system has the required hardware and software for CUDA and PyTorch.
The code includes result validation to ensure correctness of GPU computations.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
MatMul.cu		MatMul.cu
MatMul_PyTorch.py		MatMul_PyTorch.py
README.md		README.md
vector_add.cu		vector_add.cu

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CUDA and PyTorch Operations

1. CUDA Matrix Multiplication (C++)

2. PyTorch Matrix Multiplication (Python)

Requirements

For the CUDA C++ Code:

For the PyTorch Python Code:

How to Run

1. CUDA C++ Code:

2. PyTorch Python Code:

Features

Notes

About

Uh oh!

Releases

Packages

Languages

adiManethia/CUDA-codes

Folders and files

Latest commit

History

Repository files navigation

CUDA and PyTorch Operations

1. CUDA Matrix Multiplication (C++)

2. PyTorch Matrix Multiplication (Python)

Requirements

For the CUDA C++ Code:

For the PyTorch Python Code:

How to Run

1. CUDA C++ Code:

2. PyTorch Python Code:

Features

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages