Commit 2fbd2b8
add diverse_stage2 add optimize diverse_stage1 (#1174)
Co-authored-by: wangzaijun <[email protected]>1 parent f0481a8 commit 2fbd2b8
File tree
77 files changed
+1160
-189
lines changed- lightllm/common
- all_kernel_configs
- _fwd_kernel_flash_decode_diverse_stage1:v2
- _fwd_kernel_flash_decode_diverse_stage2:v1
- basemodel
- attention/triton
- triton_kernel/att/decode_att/int8kv
- triton_utils/autotune_kernel_configs/triton_3.5.1
- NVIDIA_GeForce_RTX_4090_D/_fwd_kernel_flash_decode_diverse_stage2:v1
- NVIDIA_GeForce_RTX_5090/_fwd_kernel_flash_decode_diverse_stage2:v1
- test
- benchmark/static_inference
- kernel
- unit_tests/common/basemodel/triton_kernel/att/decode_att/int8kv
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
77 files changed
+1160
-189
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
0 commit comments