Last week we had some performance issues in a bitemporal model, which by the looks of it was the result of a poorly selected execution plan in SQL Server. The reasoning behind this conclusion was that if parts of the query were first run separately with results stored in temp tables, and these later used, the issues were gone. This had me thinking though: Could something be done in order to get a better plan through the point-in-time views?| Anchor Modeling
I have been thinking about emergent properties lately, given abilities large language models exhibit that are hard to explain from the nature of their architecture and training alone. To experiment with such properties, we will be using a simpler device than the typical neuron found in artificial neural networks, which in turn are simplifications of those found in biological … Continue reading The Resilience of Emergence| Anchor Modeling
This article was written in its entirety by Bing Chat. I wanted to entertain the idea of a bot endorsing the JBOT (just a bunch of tables) style data lake. Its technology might be used to circumvent many of the numerous issues that follow from such a style of data management. Lars Rönnbäck Data lakes … Continue reading The Intelligent Lake| Anchor Modeling
Christian Kaul and Lars Rönnbäck Simplicity — the art of maximizing the amount of work not done — is essential. Mike Beedle, Arie van Bennekum, Alistair Cockburn, Ward Cunningham, Martin Fowler, Jim Highsmith, Andrew Hunt, Ron Jeffries, Jon Kern, Brian Marick, Robert C. Martin, Ken Schwaber, Jeff Sutherland, Dave Thomas, Principles behind the Agile Manifesto (2001). Part … Continue reading Towards a Model-Driven Organization (Part 2)| Anchor Modeling
We have just released the very first version of unstruct, v0.1.3. Unstruct is an Open Source program that parses simple XML files into text files, suitable for bulk inserts into a relational database. It is written in Rust and the goal is to be more performant than loading XML into the database and doing the … Continue reading Unstruct| Anchor Modeling
Back in 2012 data modelers were fighting back an invasion of JBOTs, and I was reporting back from the front lines at various conferences. JBOTs were rapidly taking over our data warehouses, destroying them to the point where they had to be rebuilt from scratch. The average lifetime of a data warehouse was becoming shorter … Continue reading The Return of the JBOT| Anchor Modeling
We are bringing back public models to the Anchor modeling tool. This is still in test and models are loaded from GitHub gists. If you put your model XML (with .xml file extension) in a public gist and make a note of the gist ID, you can call the modeling tool with this ID as … Continue reading Public Models| Anchor Modeling
In this article, the first in a series, we briefly describe the issues resulting from this disconnect and their origins within a traditional organization. We then suggest a radical shift, to a model-driven organization, where all applications work towards a single data platform with a unified model. Instead of creating models that mirror the existing organization and its dysfunctions, we suggest first creating a unified model based on the goals of the organization, and thereafter derive the o...| Anchor Modeling
Quoting the video description: The Data Vault approach gives the data modelers a lot of options to choose from: how many satellites to create, how to connect hubs with links, what historicity to use, which field to use as a business key. Such flexibilites leaves a lot of options for inoptimal modeling decisions. I want … Continue reading Large Scale Anchor Modeling| Anchor Modeling
We failed. I recently attended the Knowledge Gap conference, where we had several discussions related to data modeling. We all agreed that we are in a distressful situation both concerning the art as a whole but also its place in modern architectures, at least when it comes to integrated data models. As an art, we … Continue reading Atomic Data| Anchor Modeling