What is Collation in Databases?

In database systems, Collation specifies how data is sorted and compared in a database. Collation provides the sorting rules, case, and accent sensitivity properties for the data in the database.

For example, when you run a query using the ORDER BY clause, collation determines whether or not uppercase letters and lowercase letters are treated the same.

Collation is also used to determine how accents are treated, as well as character width and Japanese kana characters. Collation can also be used to distinguish between various ideographic variation selectors in certain collations (such as the Japanese_Bushu_Kakusu_140 and Japanese_XJIS_140 collations that were introduced in SQL Server 2017).

Different database management systems will provide different collation options. Depending on the DBMS, collation can be specified at the server level, the database level, the table level, and the column level. Collations can also be specified at the expression level (so you can specify which collation to use when you run a query), and at the identifier level.
Continue reading

“This Data” vs “These Data” – Which is Correct?

If you happen to read a lot of data-related material, you might occasionally find the word “data” being treated in different ways. In some cases you’ll see “this data is…” and in other cases “these data are…”.  You might even think “they obviously made a mistake with their grammar”.

Not so fast!

Continue reading

What is OLAP?

OLAP (Online Analytical Processing) is a category of database processing that facilitates business intelligence.

OLAP provides analysts, managers, and executives with the information they need to make effective decisions about an organization’s strategic directions. OLAP can provide valuable insights into how their business is performing, as well as how they can make improvements.

Continue reading

What is Normalization?

Normalization is the process of organizing a database to reduce redundancy and improve data integrity.

Normalization also simplifies the database design so that it achieves the optimal structure composed of atomic elements (i.e. elements that cannot be broken down into smaller parts).

Also referred to as database normalization or data normalization, normalization is an important part of relational database design, as it helps with the speed, accuracy, and efficiency of the database.

Continue reading

RDBMS vs DBMS – What’s the Difference?

There’s a lot of confusion regarding the difference between an RDBMSs and a DBMS. I’ve even seen “RDBMS vs DBMS” forum posts where the accepted answer outlines the differences between RDBMSs and DBMSs, as though they were two distinct and different things.

However, this can be misleading.

The fact is, an RDBMS is a DBMS. But a DBMS is not always an RDBMS (but it often is).

So, is there a difference between an RDBMS and a DBMS or not? Is “RDBMS vs DBMS” the right way of looking at it, or is there more to it?

Continue reading

What is a Document Store Database?

A document store database (also known as a document-oriented database, aggregate database, or simply document store or document database) is a database that uses a document-oriented model to store data.

Document store databases store each record and its associated data within a single document. Each document contains semi-structured data that can be queried against using various query and analytics tools of the DBMS.

Continue reading

What is a Key-Value Database?

A key-value database (also known as a key-value store and key-value store database) is a type of NoSQL database that uses a simple key/value method to store data.

The key-value part refers to the fact that the database stores data as a collection of key/value pairs. This is a simple method of storing data, and it is known to scale well.

Continue reading

What is NoSQL?

NoSQL is a term that refers loosely to a particular type of database model, or database management system (DBMS).

NoSQL is a very broad term that doesn’t refer to one particular database model. Rather, it refers to a whole variety of different models that don’t fit into the relational model.

Although NoSQL databases have been around since the 1960s, it wasn’t until the early 2000s that the NoSQL approach started to pick up steam, and a whole new generation of NoSQL systems began to hit the market.

Continue reading