Skip to content

Pull requests: huggingface/transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

perf(qwen3_vl): replace Conv3d with F.linear in patch embed forward
#45771 opened May 4, 2026 by jashshah999 Contributor Loading…
3 tasks done
Unwrap text_config in AutoModelFor*.from_config
#45770 opened May 4, 2026 by jamesbraza Contributor Loading…
End-to-end test of Gemma 3 + FA2 construction
#45760 opened May 3, 2026 by jamesbraza Contributor Loading…
fix attribute access in PermuteForRope._apply
#45756 opened May 3, 2026 by CharlieKerfoot Loading…
Fix mps device check for moe histogram routing
#45754 opened May 3, 2026 by belamaran96-coder Loading…
3 of 6 tasks
Add Conformer model
#45751 opened May 3, 2026 by jonghwanhyeon Contributor Loading…
4 of 6 tasks
fix: correct spelling in continuous_api docstring
#45749 opened May 3, 2026 by Dhruv908615 Loading…
6 tasks
Fix split batch size
#45747 opened May 2, 2026 by Prachi-kushwaha Loading…
6 tasks
Fix link to modular transformers documentation
#45746 opened May 2, 2026 by SangbumChoi Contributor Loading…
6 tasks
deepseek r1 distilled tokenizer fix for qwen2 mapping
#45741 opened May 2, 2026 by itazap Collaborator Loading…
DeepSeek OCR specifies an incorrect tokenizer class on the Hub
#45739 opened May 1, 2026 by hmellor Member Loading…
[CB] Fixes for SDPA and CPU offloading
#45733 opened May 1, 2026 by remi-or Collaborator Loading…
[skills] fine-tuning
#45732 opened Apr 30, 2026 by stevhliu Member Draft
ProTip! Filter pull requests by the default branch with base:main.