Blog Post

Book Review – Deciphering Data Architectures

,

I recently purchased and read the book Deciphering Data Architectures – Choosing Between a Modern Data Warehouse, Data Fabric, Data Lakehouse, and Data Mesh by James Serra. James – who works for Microsoft since quite some time now – has an interesting blog on data architecture and data warehousing, and I was really looking forward to read this book once it was announced. I have to tell you, it did not disappoint.

The power of this book is that it is quite short. At 242 pages, it is definitely not the biggest book in my library. But James succeeded in giving a decent overview of relevant data architectures, and explaining them in just enough detail so you can understand what their purpose are, what there advantages and disadvantages are, and when to use them. Of course, you won’t get the same detail as you would get in other data architecture books, for example the Data Warehouse Toolkit by Ralph Kimball, but as stated before, that’s not the point of the book. He does go in great length though to say data mesh isn’t probably useful for most companies 🙂

In my opinion, everyone working with data should read this book. Especially if you’re working with analytical data (which is the main focus), but also when you’re for example a DBA supporting an operational system. It will broaden your perspective and you’ll learn what all those buzzwords are that the business (or the consultants) are throwing around. Even non-technical people can read this book to familiarize themselves with the concepts. The book doesn’t go into technology (aside from a few sections in the last chapter that talk about Hadoop, Snowflake and Databricks). Even though James works for Microsoft, it’s not a book praising the Azure ecosystem or something like that.

The book does a great job at describing the different data architectures, but it also talks about data modelling (e.g. normalization, denormalization etc.), people & processes and it has a great chapter on architecture design sessions. I don’t agree 100% with everything (and James explicitly states this is OK, nothing is set in stone, discussion is always possible). For example, James states Inmon is more commonly used than Kimball and I tend to disagree. This might be a geographical thing or something more anecdotal, but I’ve seen way more Kimball-style data warehouse implementations than Inmon. There’s only one error (that I’m aware of) and that is in the description of a type 3 slowly-changing dimension. James says this is a dimension where change is kept for every attribute. That is still a type 2 dimension though. A type 3 dimension will add a new column when a change is detected. For example, a column storing the current e-mail address, and a column for the previous e-mail address.

The only disadvantage of this book is – in my opinion – the price. It has a list price of $79.99, which seems a bit excessive for 242 pages of content. The figures aren’t even printed in color. This reminds me of the data vault book which also had a ridiculously high price (but printed in color though). However, don’t let this stop you from buying this book, it is definitely an asset and you might get it at a discount anyways.

Conclusion: read this book. It will make you a better data practitioner.

The post Book Review – Deciphering Data Architectures first appeared on Under the kover of business intelligence.

Original post (opens in new tab)
View comments in original post (opens in new tab)

Rate

You rated this post out of 5. Change rating

Share

Share

Rate

You rated this post out of 5. Change rating