Skip to content

[Feature Request] Implement User Quotas and Token Limits for Enterprise Deployment #965

@DLM-unipd

Description

@DLM-unipd

Describe the solution you'd like

We'd like to have a way to control and limit the user interactions.
Some solutions could be: Token limits/quotas, hourly/daily/monthly token or cost limitations, definition of user quotas based on user group.

Why the solution needed

If the solution has to be deployed for a medium/large organization, we think that a user-based cost/usage control is a good thing to have.

Additional context

For each user we should monitor the current token consumption, and show the status to th user.

Implementation feasibility

Are you willing to collaborate with us to discuss the solution, decide on the approach, and assist with the implementation?

  • Yes, I am able to implement the feature and create a pull request.
  • No, I am unable to implement the feature, but I am open to discussing the solution.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions