Skip to content

Commit 2e04b4e

Browse files
authored
[Cherry-Pick][Optimization]Decode attention support(#5767) (#5833)
* [Optimizer] Support decode attention static c8 op * [Feature] Support decode attention backend * code style fix
1 parent 5210f8d commit 2e04b4e

28 files changed

+5416
-11
lines changed

.github/workflows/_logprob_test_linux.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -163,7 +163,7 @@ jobs:
163163
-d "{\"messages\": [{\"role\": \"user\", \"content\": \"1+1=?\"}], \"logprobs\": true}"
164164
set +e
165165
rm -rf ./baseline_output
166-
cp -r baseline/ERNIE-4.5-0.3B-Paddle ./baseline_output
166+
cp -r baseline_1131/ERNIE-4.5-0.3B-Paddle ./baseline_output
167167
LOGPROB_EXIT_CODE=0
168168
python3.10 lanucher.py --request_template TOKEN_LOGPROB --url http://localhost:${FD_API_PORT}/v1/chat/completions --case ./cases/demo.yaml --concurrency 1 --name demo --exe logprob || LOGPROB_EXIT_CODE=$?
169169
echo "LOGPROB_EXIT_CODE=${LOGPROB_EXIT_CODE}" > /workspace/exit_code.env

0 commit comments

Comments
 (0)