asprenger/ray-vllm-inference
asprenger/ray-vllm-inference: A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving. License: apache-2.0. Hugging Bay hosted release. Scan: pending.
- License
- apache-2.0
- Scan status
- pending
- Hosting status
- external
- Upstream
- asprenger/ray_vllm_inference
Open interactive artifact page