zzhang-fr

Follow

Zhen Zhang zzhang-fr

Follow

4 followers · 2 following

Popular repositories Loading

vllm-omni vllm-omni Public

Forked from vllm-project/vllm-omni

A framework for efficient model inference with omni-modality models

Python
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
ProxyAttn ProxyAttn Public

Forked from wyxstriker/ProxyAttn

Implementation of the paper "ProxyAttn: Guided Sparse Attention via Representative Heads".

Python
FastVideo FastVideo Public

Forked from hao-ai-lab/FastVideo

A unified inference and post-training framework for accelerated video generation.

Python
RULER RULER Public

Forked from NVIDIA/RULER

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python