About
I'm E. Marlow. I've spent the last several years building and babysitting batch and streaming data pipelines — the unglamorous plumbing that moves rows from one place to another and, occasionally, loses a few of them along the way.
This is a personal notebook. I write things down here mostly so I stop re-solving the same problems: retries, ordering, partitioning, the eternal argument about exactly-once. If a note is useful to someone else, good. If it's wrong, tell me and I'll fix it.
Opinions here are my own and tend to change as I get burned by new things. Tools I reach for most often: Postgres, Kafka, Spark, Airflow, dbt, and a lot of plain SQL.
No comments, no newsletter, no tracking. Just the RSS feed.