AI/LLM Cheatsheet 13 — Fine-tuning

Cheatsheet: when to fine-tune, LoRA, QLoRA, OpenAI fine-tune.

May 26, 2026 · 2 min · 385 words · Manvendra Rajpoot

AI/LLM Cheatsheet 14 — Multimodal LLMs

Cheatsheet: vision LLMs, image inputs, audio, video.

May 26, 2026 · 3 min · 452 words · Manvendra Rajpoot

AI/LLM Cheatsheet 15 — Security and Prompt Injection

Cheatsheet: prompt injection, defenses, PII, jailbreaks.

May 26, 2026 · 3 min · 556 words · Manvendra Rajpoot

AI/LLM Cheatsheet 17 — Observability for LLMs

Cheatsheet: logging, traces, metrics, evals in prod.

May 26, 2026 · 3 min · 485 words · Manvendra Rajpoot

AI/LLM Cheatsheet 18 — LLM Application Patterns

Cheatsheet: classification, extraction, summarization, routing, decomposition.

May 26, 2026 · 3 min · 525 words · Manvendra Rajpoot

FastAPI Cheatsheet 18 — Streaming and LLM Integration

Cheatsheet: streaming Claude / GPT / vLLM tokens via SSE, tool-call loops, cancellation, prompt caching.

May 11, 2026 · 3 min · 470 words · Manvendra Rajpoot

AI/LLM Cheatsheet 19 — Building Chat UI

Cheatsheet: chat UI, streaming, markdown rendering, code blocks.

May 26, 2026 · 3 min · 624 words · Manvendra Rajpoot

AI/LLM Cheatsheet 20 — Production LLM App

Cheatsheet: full prod LLM app stack.

May 26, 2026 · 4 min · 752 words · Manvendra Rajpoot

Self-Hosting LLMs in 2026 — When the Math Actually Works

Practical LLM self-hosting math: GPU pricing, throughput per GPU, sustained load break-even, vLLM tuning, and when API still wins.

May 5, 2026 · 5 min · 881 words · Manvendra Rajpoot

Synthetic Data with LLMs in 2026 — Use Cases, Risks, and the Patterns That Work

Practical synthetic data: fine-tune training data, eval set generation, edge case enumeration, and the model-collapse / quality risks to watch.

May 5, 2026 · 5 min · 886 words · Manvendra Rajpoot