GPU on Manvendra Rajpoot

GPU on Manvendra Rajpoot https://blog.rajpoot.dev/tags/gpu/ Recent content in GPU on Manvendra Rajpoot Manvendra Rajpoot https://blog.rajpoot.dev/img/personal/cover.png https://blog.rajpoot.dev/img/personal/cover.png Hugo en Manvendra Rajpoot Sun, 17 May 2026 17:50:46 +0530 Self-Hosted LLMs in 2026 — Ollama, vLLM, and When to Skip the API https://blog.rajpoot.dev/posts/ai/self-hosted-llms-vllm-ollama-2026/ Tue, 28 Apr 2026 20:50:00 +0530 https://blog.rajpoot.dev/posts/ai/self-hosted-llms-vllm-ollama-2026/ When to self-host LLMs in 2026 — Ollama for dev, vLLM and SGLang for production, model choice, hardware sizing, and the latency/cost tradeoffs vs hosted APIs.