AI/LLM Cheatsheet 11 — Cost Optimization

Cheatsheet: prompt caching, batching, model selection, output limits.

May 26, 2026 · 3 min · 427 words · Manvendra Rajpoot

AI/LLM Cheatsheet 12 — Local LLMs (Ollama, vLLM)

Cheatsheet: Ollama, vLLM, llama.cpp, when to self-host.

May 26, 2026 · 3 min · 506 words · Manvendra Rajpoot

AI/LLM Cheatsheet 13 — Fine-tuning

Cheatsheet: when to fine-tune, LoRA, QLoRA, OpenAI fine-tune.

May 26, 2026 · 2 min · 385 words · Manvendra Rajpoot

AI/LLM Cheatsheet 14 — Multimodal LLMs

Cheatsheet: vision LLMs, image inputs, audio, video.

May 26, 2026 · 3 min · 452 words · Manvendra Rajpoot

AI/LLM Cheatsheet 15 — Security and Prompt Injection

Cheatsheet: prompt injection, defenses, PII, jailbreaks.

May 26, 2026 · 3 min · 556 words · Manvendra Rajpoot

AI/LLM Cheatsheet 16 — Vector DBs Deep Dive

Cheatsheet: vector DBs, HNSW, hybrid search, sharding.

May 26, 2026 · 3 min · 553 words · Manvendra Rajpoot

AI/LLM Cheatsheet 17 — Observability for LLMs

Cheatsheet: logging, traces, metrics, evals in prod.

May 26, 2026 · 3 min · 485 words · Manvendra Rajpoot

AI/LLM Cheatsheet 18 — LLM Application Patterns

Cheatsheet: classification, extraction, summarization, routing, decomposition.

May 26, 2026 · 3 min · 525 words · Manvendra Rajpoot

AI/LLM Cheatsheet 19 — Building Chat UI

Cheatsheet: chat UI, streaming, markdown rendering, code blocks.

May 26, 2026 · 3 min · 624 words · Manvendra Rajpoot

AI/LLM Cheatsheet 20 — Production LLM App

Cheatsheet: full prod LLM app stack.

May 26, 2026 · 4 min · 752 words · Manvendra Rajpoot