hip_cuda_examples

Different NVIDIA CUDA and AMD HIP implementations of matrix multiplication, vector add, reduce operations, and layernorm kernels. Each kernel also uses different data types like fp64, fp32, fp16(half), and half2.

Name		Name	Last commit message	Last commit date
Latest commit History 107 Commits
amd_matrix_core		amd_matrix_core
amd_sparse_matrix		amd_sparse_matrix
amd_wmma		amd_wmma
bkup		bkup
common		common
cuda		cuda
cuda_rt		cuda_rt
hip-python		hip-python
hip		hip
hip_rt		hip_rt
others		others
rocblas		rocblas
scripts		scripts
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

hip_cuda_examples

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

hip_cuda_examples

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages