Skip to content

Commit 9fa4241

Browse files
committed
Add native nvidia backend for flash attention.
1 parent f3f4bf1 commit 9fa4241

File tree

4 files changed

+544
-12
lines changed

4 files changed

+544
-12
lines changed

src/infinicore/ops/flash_attention/flash_attention.cc

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,7 @@
11
#include "infinicore/ops/flash_attention.hpp"
22

33
#include "../../utils.hpp"
4+
#include "infinicore/context/context.hpp"
45

56
namespace infinicore::op {
67

0 commit comments

Comments
 (0)