GitHub / France-Travail / vllm-ft
A high-throughput and memory-efficient inference and serving engine for LLMs
JSON API: https://repos.ecosystem.code.gouv.fr/api/v1/hosts/GitHub/repositories/France-Travail%2Fvllm-ft
Stars: 7
Forks: 1
Open issues: 1
License: apache-2.0
Language: Python
Size: 67.3 MB
Dependencies parsed at: Pending
Created at: 8 months ago
Updated at: 17 days ago
Pushed at: 2 months ago
Last synced at: 6 days ago
Funding Links https://github.com/sponsors/vllm-project, https://opencollective.com/vllm
Loading...