Skip to content

hfmohammed18/ArcticInference

 
 

Repository files navigation

License Apache 2.0 PyPI version

Latest news

ArcticInference

ArcticInference is a new library from Snowflake AI Research that contains current and future LLM inference optimizations developed at Snowflake. It is integrated with vLLM v0.8.1 using vLLM’s custom plugin feature, allowing us to develop and integrate inference optimizations quickly into vLLM and make them available to the community.

Once installed, ArcticInference automatically patches vLLM to use Arctic Ulysses and other optimizations implemented in ArcticInference, and users can continue to use their familiar vLLM APIs and CLI. It’s easy to get started!

Installation

$ pip install "git+https://github.com/snowflakedb/ArcticInference.git#egg=arctic-inference[vllm]

Projects

To better understand what features ArcticInference supports please refer to the following list of projects we have released under this framework:

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Jupyter Notebook 83.4%
  • Python 15.8%
  • Other 0.8%