Article
The Lean Table Format
Hudi, Iceberg, and Delta are the hottest jargon in modern data lake design. They bring impressive capabilities to cloud object storage such as ACID transactions, schema evolution, time…
Blog
Practical writing on simpler architectures, maintainable analytics systems, and choosing the right level of complexity for the problem in front of you.
Article
Hudi, Iceberg, and Delta are the hottest jargon in modern data lake design. They bring impressive capabilities to cloud object storage such as ACID transactions, schema evolution, time…
Article
Every week, I come across teams running massive PySpark clusters to process datasets that could easily fit on a single machine. The result? Bloated AWS bills and a false sense of “future…