workload-prediction

Here are 3 public repositories matching this topic...

Sakura66 / sagesched

SageSched: Intelligent LLM Request Scheduler with Workload Prediction — QoS-aware dual-queue scheduling for black-box LLM APIs (OpenAI/Azure/Doubao/Gemini)

api-gateway scheduler load-balancer openai qos faiss fastapi workload-prediction llm llm-inference llm-proxy gittins-index

Updated May 18, 2026
Python

belindanju / microservice-paper-readings

Star

Research paper and technical notes on the microservice ecosystem

microservice workload-prediction resourcemanagement

Updated May 22, 2023
HTML

LiuPengJugx / TORN-Join

Star

Self-adaptive data layout for distribute joins

data-layout workload-prediction distributed-joins horizontal-partition

Updated Oct 11, 2022
Jupyter Notebook

Improve this page

Add a description, image, and links to the workload-prediction topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the workload-prediction topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly