GitHub / InseeFrLab / auto-tuning-vllm
Auto-tuning for vllm. Getting the best performance out of your LLM deployment (vllm+guidellm+optuna)
JSON API: https://repos.data.code.gouv.fr/api/v1/hosts/GitHub/repositories/InseeFrLab%2Fauto-tuning-vllm
Stars: 2
Forks: 0
Open issues: 11
License: apache-2.0
Language: Python
Size: 2.82 MB
Dependencies parsed at: Pending
Created at: 26 days ago
Updated at: 15 days ago
Pushed at: 4 days ago
Last synced at: 4 days ago
Readme
Loading...