Releases: InternLM/xtuner
Releases · InternLM/xtuner
XTuner Release V0.1.16
What's Changed
- set dev version by @LZHgrla in #487
- Fix type error when the visual encoder is not CLIP by @hhaAndroid in #496
- [Feature] Support Sequence parallel by @HIT-cwh in #456
- [Bug] Fix bugs in flash_attn1_pytorch by @HIT-cwh in #513
- [Fix] delete cat in varlen attn by @HIT-cwh in #508
- bump version to 0.1.16 by @HIT-cwh in #520
- [Improve] Add
generation_kwargsforEvaluateChatHookby @LZHgrla in #501 - [Bugs] Fix bugs when training in non-distributed env by @HIT-cwh in #522
- [Fix] Support transformers>=4.38 and require transformers>=4.36.0 by @HIT-cwh in #494
- [Fix] Fix throughput hook by @HIT-cwh in #527
- Update README.md by @JianxinDong in #528
- [Fix] dispatch internlm rote by @HIT-cwh in #530
- Limit transformers != 4.38 by @HIT-cwh in #531
New Contributors
- @hhaAndroid made their first contribution in #496
- @JianxinDong made their first contribution in #528
Full Changelog: v0.1.15...v0.1.16
XTuner Release V0.1.15
What's Changed
- set dev version by @LZHgrla in #437
- [Bugs] Fix bugs when using EpochBasedRunner by @HIT-cwh in #439
- [Feature] Support processing ftdp dataset and custom dataset offline by @HIT-cwh in #410
- Update prompt_template.md by @aJupyter in #441
- [Doc] Split finetune_custom_dataset.md to 6 parts by @HIT-cwh in #445
- [Improve] Add notes for demo_data examples by @LZHgrla in #458
- [Fix] Gemma prompt_template by @LZHgrla in #454
- [Feature] Add LLaVA-InternLM2-1.8B by @LZHgrla in #449
- show more info about datasets by @amulil in #464
- [Fix] write text with
encoding='utf-8'by @LZHgrla in #477 - support offline process llava data by @HIT-cwh in #448
- [Fix]
msagent_react_map_fnerror by @LZHgrla in #470 - [Improve] Reorg
xtuner/configs/llava/configs by @LZHgrla in #483 - limit pytorch version <= 2.1.2 as there may be some bugs in triton2… by @HIT-cwh in #452
- [Fix] fix batch sampler bs by @HIT-cwh in #468
- bump version to v0.1.15 by @LZHgrla in #486
New Contributors
Full Changelog: v0.1.14...v0.1.15
XTuner Release V0.1.14
What's Changed
- set dev version by @LZHgrla in #341
- [Feature] More flexible
TrainLoopby @LZHgrla in #348 - [Feature]Support CEPH by @pppppM in #266
- [Improve] Add
--repetition-penaltyforxtuner chatby @LZHgrla in #351 - [Feature] Support MMBench DDP Evaluate by @pppppM in #300
- [Fix]
KeyErrorofencode_fnby @LZHgrla in #361 - [Fix] Fix
batch_sizeof full fine-tuing LLaVA-InternLM2 by @LZHgrla in #360 - [Fix] Remove
systemforalpaca_map_fnby @LZHgrla in #363 - [Fix] Use
DEFAULT_IMAGE_TOKENinstead of'<image>'by @LZHgrla in #353 - [Feature] Support internlm sft by @HIT-cwh in #302
- [Fix] Add
attention_maskfordefault_collate_fnby @LZHgrla in #371 - [Fix] Update requirements by @LZHgrla in #369
- [Fix] Fix rotary_base, add
colors_map_fntoDATASET_FORMAT_MAPPINGand rename 'internlm_repo' to 'intern_repo' by @HIT-cwh in #372 - update by @HIT-cwh in #377
- Delete useless codes and refactor process_untokenized_datasets by @HIT-cwh in #379
- [Feature] support flash attn 2 in internlm1, internlm2 and llama by @HIT-cwh in #381
- [Fix] Fix installation docs of mmengine in
intern_repo_dataset.mdby @LZHgrla in #384 - [Fix] Update InternLM2
apply_rotary_pos_embby @LZHgrla in #383 - [Feature] support saving eval output before save checkpoint by @HIT-cwh in #385
- fix lr scheduler setting by @gzlong96 in #394
- [Fix] Remove pre-defined
systemofalpaca_zh_map_fnby @LZHgrla in #395 - [Feature] Support
Qwen1.5by @LZHgrla in #407 - [Fix] Fix no space in chat output using InternLM2. (#357) by @KooSung in #404
- [Fix] typo:
--system-promptto--system-templateby @LZHgrla in #406 - [Improve] Add
output_with_lossfor dataset process by @LZHgrla in #408 - [Fix] Fix dispatch to support transformers>=4.36 & Add USE_TRITON_KERNEL environment variable by @HIT-cwh in #411
- [Feature]Add InternLM2-Chat-1_8b full config by @KMnO4-zx in #396
- [Fix] Fix extract_json_objects by @fanqiNO1 in #419
- [Fix] Fix pth_to_hf error by @LZHgrla in #426
- [Feature] Support
Gemmaby @PommesPeter in #429 - add refcoco to llava by @LKJacky in #425
- [Fix] Inconsistent BatchSize of
LengthGroupedSamplerby @LZHgrla in #436 - bump version to v0.1.14 by @LZHgrla in #431
New Contributors
- @gzlong96 made their first contribution in #394
- @KooSung made their first contribution in #404
- @KMnO4-zx made their first contribution in #396
- @fanqiNO1 made their first contribution in #419
- @PommesPeter made their first contribution in #429
- @LKJacky made their first contribution in #425
Full Changelog: v0.1.13...v0.1.14
XTuner Release V0.1.13
What's Changed
- set dev version by @LZHgrla in #329
- [Docs] Add LLaVA-InternLM2 results by @LZHgrla in #332
- Update internlm2_chat template by @RangiLyu in #339
- [Fix] Fix examples demo_data configs by @LZHgrla in #334
- bump version to v0.1.13 by @LZHgrla in #340
New Contributors
Full Changelog: v0.1.12...v0.1.13
XTuner Release V0.1.12
What's Changed
- set dev version by @LZHgrla in #281
- [Fix] Update LLaVA results by @LZHgrla in #283
- [Fix] Update LLaVA results (based on VLMEvalKit) by @LZHgrla in #285
- [Fix] Fix filter bug for test data by @LZHgrla in #293
- [Fix] Fix
ConcatDatasetby @LZHgrla in #298 - [Improve] Redesign the
prompt_templateby @LZHgrla in #294 - [Fix] Fix errors about
stop_wordsby @LZHgrla in #313 - [Fix] Fix Mixtral LoRA setting by @LZHgrla in #312
- [Feature] Support DeepSeek-MoE by @LZHgrla in #311
- [Fix] Set
torch.optim.AdamWas the default optimizer by @LZHgrla in #318 - [FIx] Fix
pth_to_hffor LLaVA model by @LZHgrla in #316 - [Improve] Add
demo_dataexamples by @LZHgrla in #278 - [Feature] Support InternLM2 by @LZHgrla in #321
- [Fix] Fix the resume of seed by @LZHgrla in #309
- [Feature] Accelerate
xtuner xxxby @pppppM in #307 - [Fix] Fix InternLM2 url by @LZHgrla in #325
- [Fix] Limit the version of python,
>=3.8, <3.11by @LZHgrla in #327 - [Fix] Add
trust_remote_code=Truefor AutoModel by @LZHgrla in #328 - [Docs] Improve README by @LZHgrla in #326
- bump verion to v0.1.12 by @pppppM in #323
Full Changelog: v0.1.11...v0.1.12
XTuner Release V0.1.11
What's Changed
- [Docs] Update Mixtral 8x7b docs by @LZHgrla in #265
- [Bug] Fix bugs when chat with --lagent by @ooooo-create in #269
- [Feature] Support setting the random seed for
xtuner trainby @LZHgrla in #272 - [Fix] Update Mixtral-8x7b repo_id; Add mixtral template by @LZHgrla in #275
- [Feature] Add Qwen 72b config by @xiaohangguo in #254
- [Improve] Add notes for requirements; Improve badges by @LZHgrla in #277
- [Feature] Support LLaVA by @LZHgrla in #196
- [Feature] Add
warmupfor all configs by @LZHgrla in #274 - bump version to v0.1.11 by @LZHgrla in #280
New Contributors
- @ooooo-create made their first contribution in #269
Full Changelog: v0.1.10...v0.1.11
XTuner Release V0.1.10
What's Changed
- [Feature] Support for full-scale fine-tuning of large language models such as Llama2 70B. by @HIT-cwh in #231
- [Feature] Support to process internlm-style datasets by @HIT-cwh in #232
- [Fix] Fix bugs of llama dispatch by @LZHgrla in #229
- [Bug] Resolve the bug introduced by higher versions of DeepSpeed. by @HIT-cwh in #240
- [Doc] Add internlm dataset doc by @HIT-cwh in #242
- add
wizardcodertemplate by @xiaohangguo in #243 - [Feature] Filter negative labels by @xiaohangguo in #244
- [Bug] Support auto detect torch_dtype in chat.py by @HIT-cwh in #250
- [Feature] Add Qwen 1.8b config by @xiaohangguo in #252
- [Feature]Add Deepseekcoder config by @xiaohangguo in #253
- [Bug] Fix bugs when grad clip == 0 by @HIT-cwh in #262
- [Feature] Support Mixtral 8x7b by @pppppM in #263
- bump version to v0.1.10 by @pppppM in #264
New Contributors
- @xiaohangguo made their first contribution in #243
- @pppppM made their first contribution in #263
Full Changelog: v0.1.9...v0.1.10
XTuner Release V0.1.9
XTuner Release V0.1.8
What's Changed
- [Feature] Add mistral pretrain by @DumoeDss in #204
- [Feature] add yi-6b and yi-34b sft script by @amulil in #216
- [Docs] Add Introduction docs for config by @LZHgrla in #212
- [Fix] Fix MMLU evaluation by @LZHgrla in #208
- [Feature] Support ChatGLM3-6B by @LZHgrla in #222
- [Fix] Set default
eta_minto 0. by @LZHgrla in #223 - bump version to 0.1.8 by @LZHgrla in #224
New Contributors
Full Changelog: v0.1.7...v0.1.8
XTuner Release V0.1.7
What's Changed
- add zephyr config by @maxchiron in #188
- [Feature] Support "auto" fp16/bf16 for DeepSpeed by @LZHgrla in #195
- [Fix] Temporarily limit the version of
transformersby @LZHgrla in #200 - bump version to 0.1.7 by @LZHgrla in #201
New Contributors
- @maxchiron made their first contribution in #188
Full Changelog: v0.1.6...v0.1.7