GitHub - EMI-Group/openPangu-7B-math-consistency-evals

Requirements

You can install the required packages with the following command:

Nvidia

conda create -name pangu-dev python=3.12
conda activate pangu-dev
pip install -r requirements.txt

Ascend

Assume the vllm==0.9.2 and vllm-ascend==0.9.2rc1 are already installed, then:

pip install -r requirements-ascend.txt

Evaluation

# eval of openPangu (slow thinking) on amc23 for avg@16
ASCEND_RT_VISIBLE_DEVICES=0,1 ./ascend-eval.sh amc23 16 slow

# eval other model
ASCEND_RT_VISIBLE_DEVICES=0,1 ./ascend-eval-other.sh Qwen/Qwen2.5-MATH-7B aime24 16

# Summarize eval metrics on amc23 and var_amc23:
python ./score_analysis.py --dataset amc23

If you want to create a new evaluation dataset:

python ./csv2json.py

Acknowledgement

The codebase is adapted from math-evaluation-harness.

We would like to express our gratitude to the OpenPangu team for open-sourcing the OpenPangu-Embedded-7B-V1.1 model. Their contributions to the community have been instrumental in this evaluation project.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.vscode		.vscode
data		data
latex2sympy		latex2sympy
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ascend-eval-other.sh		ascend-eval-other.sh
ascend-eval.sh		ascend-eval.sh
csv2json.py		csv2json.py
data_loader.py		data_loader.py
eval.sh		eval.sh
evaluate.py		evaluate.py
grader.py		grader.py
math_eval.py		math_eval.py
math_utils.py		math_utils.py
model_utils.py		model_utils.py
parser.py		parser.py
read_results.py		read_results.py
requirements-ascend.txt		requirements-ascend.txt
requirements.txt		requirements.txt
run.sh		run.sh
score_analysis.py		score_analysis.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Requirements

Nvidia

Ascend

Evaluation

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Requirements

Nvidia

Ascend

Evaluation

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages