Skip to content

Add multi-batch inference support, fix hivemind dependency, and improve installation process#27

Merged
HaibaraAiChan merged 10 commits intoai-decentralized:mainfrom
JiuChen0:upload
Nov 1, 2025
Merged

Add multi-batch inference support, fix hivemind dependency, and improve installation process#27
HaibaraAiChan merged 10 commits intoai-decentralized:mainfrom
JiuChen0:upload

Conversation

@JiuChen0
Copy link
Copy Markdown
Contributor

@JiuChen0 JiuChen0 commented Nov 1, 2025

Key Changes

  1. Multi-batch inference support
    Enables inference with variable batch sizes for better performance and scalability.

  2. Fix hivemind dependency installation issue
    Resolved issues with installing hivemind dependencies in certain environments.

  3. Remove hardcoded model restriction
    Removed the hardcoded limitation that only allowed loading LLaMA 7B, enabling support for other model sizes.

  4. Update installation instructions in README.md
    Simplified the installation process. The entire BloomBee environment can now be installed and configured with a single command:

    pip install -e .

@HaibaraAiChan HaibaraAiChan merged commit 862bd3b into ai-decentralized:main Nov 1, 2025
JiuChen0 added a commit to JiuChen0/BloomBee that referenced this pull request Mar 22, 2026
…ve installation process (ai-decentralized#27)

* Add batch inference support and CPU compatibility

- Add --batch_size CLI argument for parallel sequence processing
- Add conditional CUDA stream creation for CPU-only mode
- Add device-aware ExecutionEnv and Policy resource distribution
- Fix MPS compatibility on macOS

* fix hardcode of model loading and support batch size

* Resolving dependency conflicts

* docs: refine README setup and usage sections for clarity and correctness

* Add batch size related updates

* delete ddebug output

* delete .id files

* fix max token size problem

* add prompt

* clear the debug print

---------

Co-authored-by: Danny Willow Liu <dannywillowliu@uchicago.edu>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants