About

October 13, 2022

Started as a place to collect useful information while putting together presentations on the importance of testing in modern data pipelines, TestLakeHaus aims to grow to be a useful resource for anyone looking to learn about Data Lakes, Lakehouses, Data Testing, and Pipeline orchestration.

We will, without a doubt, make forays into common software tools, and the basics of getting started with these tools. To give you an idea what to expect, here is a brief list of some tools we work with:

It would be impossible to talk about these tools without also talking about the underlying languages that drive the modern data stack, so we will have some content related to popular data languages:

We like to dabble in the equities markets, and lurk on @FinTwit so will be collecting some of the more sage wisdom that can be found in the bowels of twitter, along with some code samples, or library references that we find useful when analyzing market data.

Finally, shout out to the Hugo team for providing the platform so many quality bloggers, and companies use to get information out quickly in neat formatted fashion. If you aren’t familiar, Hugo is for people who want to hand code their own website without worrying about setting up complicated runtimes, dependencies and databases.

Websites built with Hugo are extremely fast, secure and can be deployed anywhere including Cloudflare, AWS, GitHub Pages, Heroku, Netlify and any other hosting provider.

Learn more and contribute on Hugo GitHub.