Python Data Pipelines in 2026 — Polars, Ibis, DuckDB, and the Practical Stack

Practical Python data: Polars for in-memory DataFrames, DuckDB for SQL on files, Ibis for portable expression API, and how to compose them.

May 5, 2026 · 4 min · 780 words · Manvendra Rajpoot

Synthetic Data with LLMs in 2026 — Use Cases, Risks, and the Patterns That Work

Practical synthetic data: fine-tune training data, eval set generation, edge case enumeration, and the model-collapse / quality risks to watch.

May 5, 2026 · 5 min · 886 words · Manvendra Rajpoot

Argo Workflows vs Airflow vs Dagster vs Prefect in 2026

Picking a workflow orchestrator in 2026. Argo Workflows for Kubernetes-native; Airflow for mature ETL; Dagster for data-aware orchestration; Prefect for Python-first. The decision matrix.

April 30, 2026 · 3 min · 465 words · Manvendra Rajpoot