AI/LLM Cheatsheet 13 — Fine-tuning
Cheatsheet: when to fine-tune, LoRA, QLoRA, OpenAI fine-tune.
Cheatsheet: when to fine-tune, LoRA, QLoRA, OpenAI fine-tune.
Cheatsheet: vision LLMs, image inputs, audio, video.
Cheatsheet: prompt injection, defenses, PII, jailbreaks.
Cheatsheet: logging, traces, metrics, evals in prod.
Cheatsheet: classification, extraction, summarization, routing, decomposition.
Cheatsheet: streaming Claude / GPT / vLLM tokens via SSE, tool-call loops, cancellation, prompt caching.
Cheatsheet: chat UI, streaming, markdown rendering, code blocks.
Cheatsheet: full prod LLM app stack.
Practical LLM self-hosting math: GPU pricing, throughput per GPU, sustained load break-even, vLLM tuning, and when API still wins.
Practical synthetic data: fine-tune training data, eval set generation, edge case enumeration, and the model-collapse / quality risks to watch.