definition – Page 3

What is a Snowflake Schema?

March 9, 2026January 29, 2026 by Ian

In relational databases, a snowflake schema is a variation of the star schema where dimension tables are normalized into multiple related tables. Instead of keeping all dimensional attributes in a single wide table, you break them down into a hierarchy of tables that branch out like (you guessed it) a snowflake.

It’s basically what happens when you apply database normalization principles to a star schema’s dimension tables while keeping the central fact table intact.

What is Database as a Service (DBaaS)?

March 2, 2026January 26, 2026 by Ian

Database as a Service (DBaaS) is a cloud computing model where a provider hosts and manages your database for you. Instead of installing database software on your own servers, configuring it, and handling ongoing maintenance, you access a fully managed database over the internet. The provider takes care of all the technical heavy lifting while you focus on using the database for your applications.

What is a Distributed Database?

February 2, 2026January 23, 2026 by Ian

A distributed database is a database system where data is stored across multiple physical locations. This could be either on different servers in the same data center or across multiple data centers in different geographic regions. Instead of all your data residing on a single machine, it’s spread across several computers that work together as a unified system.

From an application’s perspective, a distributed database often looks like a single database. You connect to it and run queries as usual. Behind the scenes, however, the database system coordinates multiple servers to store data, process queries, and maintain consistency across all locations.

What is an Attribute in a Database?

January 21, 2026 by Ian

In database terminology, an attribute is a piece of information that describes an entity. If an entity is a thing you want to store data about, attributes are the specific details you’re storing. They’re the characteristics, properties, or facts that define what you know about each entity.

When you create a relational database, attributes become the columns in your tables. Each attribute represents one type of information you’re tracking.

What is a Database Entity?

January 21, 2026January 20, 2026 by Ian

In database design, an entity is something you want to store information about. It’s a person, place, thing, event, or concept that matters to your application and has data associated with it that you need to track.

Entities are the building blocks of database design. Before you create tables, write queries, or think about indexes, you need to identify what entities exist in your domain and what information you need to store about them.

What is a Missing Index in SQL Server?

January 17, 2026 by Ian

SQL Server has a concept of “missing indexes”. And no, it’s not referring to an index that used to be there but has now disappeared. Rather, the missing index concept is designed to help us improve the performance of our database.

A missing index is an index that doesn’t yet exist on your table but probably should. SQL Server actually tracks queries that would benefit from indexes and stores suggestions in a set of dynamic management views (DMVs) called the missing index DMVs.

When you run a query and the optimizer thinks that this would be way faster with an index on those columns, it logs that suggestion. Over time, these suggestions accumulate, giving you a prioritized list of indexes that could improve your database’s performance.

What is a Slowly Changing Dimension?

March 9, 2026January 14, 2026 by Ian

In data warehousing, a slowly changing dimension (SCD) is a dimension table where the attributes change over time, but not very frequently. The term “slowly changing” refers to the fact that these changes happen occasionally (perhaps days, weeks, or months apart) rather than constantly like transactional data.

The challenge is figuring out how to handle these changes while maintaining accurate historical analysis. When a customer moves to a new state or a product gets recategorized, you need a strategy that preserves the integrity of your historical data.

What is Polyglot Persistence?

February 2, 2026January 13, 2026 by Ian

Polyglot persistence is an architectural approach where an application uses multiple different database technologies, each chosen for its specific strengths and matched to particular data storage needs. Instead of forcing all your data into one database system, you use the right database for each job. For example, a relational database for transactional data, a document store for flexible content, a cache for session data, and a graph database for relationships.

What is a CRDT?

January 10, 2026 by Ian

A CRDT (Conflict-Free Replicated Data Type) is a special type of data structure designed for distributed systems that guarantees multiple replicas of data will eventually converge to the same state without requiring coordination between nodes. Even when different users simultaneously modify the same data in different locations, CRDTs automatically resolve conflicts in a mathematically consistent way that ensures all replicas eventually agree.

The main insight behind CRDTs is that certain operations can be designed to be commutative, meaning the order in which you apply them doesn’t matter. If operation A followed by operation B produces the same result as operation B followed by operation A, you can apply updates in any order and still reach the same final state. This property eliminates the need for complex conflict resolution logic.

What is Eventual Consistency?

January 10, 2026January 8, 2026 by Ian

Eventual consistency is a consistency model used in distributed databases where data updates don’t immediately propagate to all copies, but given enough time without new updates, all copies will eventually become identical. When you write data to one location in the system, other locations might temporarily see old data, but they’ll all catch up eventually. This usually happens within milliseconds or seconds (although it can be longer during network issues or node failures).

This approach contrasts with strong consistency, where every read is guaranteed to return the most recent write immediately. With eventual consistency, the system prioritizes availability and performance over immediate accuracy. You’re accepting that different parts of your database might temporarily disagree about the current state of the data in exchange for faster operations and better fault tolerance.