LLM | Manvendra Rajpoot

AI/LLM Cheatsheet 13 — Fine-tuning

Cheatsheet: when to fine-tune, LoRA, QLoRA, OpenAI fine-tune.

Cheatsheet: vision LLMs, image inputs, audio, video.

Cheatsheet: prompt injection, defenses, PII, jailbreaks.

Cheatsheet: logging, traces, metrics, evals in prod.

Cheatsheet: classification, extraction, summarization, routing, decomposition.

Cheatsheet: streaming Claude / GPT / vLLM tokens via SSE, tool-call loops, cancellation, prompt caching.

Cheatsheet: chat UI, streaming, markdown rendering, code blocks.

Cheatsheet: full prod LLM app stack.

Practical LLM self-hosting math: GPU pricing, throughput per GPU, sustained load break-even, vLLM tuning, and when API still wins.

Practical synthetic data: fine-tune training data, eval set generation, edge case enumeration, and the model-collapse / quality risks to watch.