Understanding EXTRACT() in DuckDB

DuckDB offers a variety of date functions, one of which is the extract() function. This function is designed to retrieve a specific date part from a date or timestamp value, and it can also be applied to intervals.

In this article, we’ll take a closer look at the extract() function and provide some straightforward examples to illustrate how it works.

What is the extract() Function?

The extract() function extracts a specific part of a date/time such as the year, month, day, hour, minute, or second. It can also extract various components from an interval value.

The extract() function does the same thing that the date_part() function does, but with a different syntax.

Syntax

The syntax for the extract() function is as follows:

extract(part from date)
  • part: This is a string that specifies the date part to extract. Some commonly used values include 'year', 'month', 'day', 'hour', 'minute', 'second', etc. See the tables at the end of this article for a full list of values that can be used here.
  • date: This is the date, timestamp, or interval value from which you want to extract the specified part.

The function returns the extracted part as a numeric value.

Example

Here’s a simple example that illustrates how it works:

SELECT extract( 'year' FROM DATE '2025-07-18') AS 'year';

Result:

+------+
| year |
+------+
| 2025 |
+------+

So we pass the date part that we want to extract, followed by the FROM keyword, followed by the date itself.

Here it is with some more date parts specified:

SELECT 
    extract( 'day' FROM DATE '2025-07-18' ) AS 'day',
    extract( 'dayofweek' FROM DATE '2025-07-18') AS 'dayofweek',
    extract( 'dayofmonth' FROM DATE '2025-07-18') AS 'dayofmonth',
    extract( 'dayofyear' FROM DATE '2025-07-18') AS 'dayofyear',
    extract( 'month' FROM DATE '2025-07-18') AS 'month',
    extract( 'year' FROM DATE '2025-07-18') AS 'year',
    extract( 'century' FROM DATE '2025-07-18') AS 'century';

Result:

+-----+-----------+------------+-----------+-------+------+---------+
| day | dayofweek | dayofmonth | dayofyear | month | year | century |
+-----+-----------+------------+-----------+-------+------+---------+
| 18 | 5 | 18 | 199 | 7 | 2025 | 21 |
+-----+-----------+------------+-----------+-------+------+---------+

Compared to date_part() and datepart()

DuckDB also has a date_part() function, as well as its synonym datepart(). These functions do the same thing that extract() does, but with a slightly different syntax. Here they all are, side by side:

SELECT 
    extract( 'year' FROM DATE '2025-07-18') AS 'extract',
    date_part( 'year', DATE '2025-07-18') AS 'date_part',
    datepart( 'year', DATE '2025-07-18') AS 'datepart';

Result:

+---------+-----------+----------+
| extract | date_part | datepart |
+---------+-----------+----------+
| 2025 | 2025 | 2025 |
+---------+-----------+----------+

Both date_part()and datepart() require a comma instead of the FROM keyword. Other than that, everything else is pretty much the same.

Timestamp Values

The previous examples use a DATE value, but we can also use a TIMESTAMP value. We can use it to get the date parts or the time parts:

SELECT 
    extract( 'year' FROM TIMESTAMP '2025-07-18 18:09:39.123456' ) AS 'year',
    extract( 'hour' FROM TIMESTAMP '2025-07-18 18:09:39.123456') AS 'hour',
    extract( 'minute' FROM TIMESTAMP '2025-07-18 18:09:39.123456') AS 'minute',
    extract( 'second' FROM TIMESTAMP '2025-07-18 18:09:39.123456') AS 'second',
    extract( 'millisecond' FROM TIMESTAMP '2025-07-18 18:09:39.123456') AS 'millisecond',
    extract( 'microsecond' FROM TIMESTAMP '2025-07-18 18:09:39.123456') AS 'microsecond';

Result:

+------+------+--------+--------+-------------+-------------+
| year | hour | minute | second | millisecond | microsecond |
+------+------+--------+--------+-------------+-------------+
| 2025 | 18 | 9 | 39 | 39123 | 39123456 |
+------+------+--------+--------+-------------+-------------+

Interval Values

We can also use the extract() function on intervals:

SELECT 
    extract( 'year' FROM INTERVAL '10 years 5 months 4 days 3 hours 45 minutes') AS 'years',
    extract( 'months' FROM INTERVAL '10 years 5 months 4 days 3 hours 45 minutes') AS 'months',
    extract( 'days' FROM INTERVAL '10 years 5 months 4 days 3 hours 45 minutes') AS 'days',
    extract( 'hours' FROM INTERVAL '10 years 5 months 4 days 3 hours 45 minutes') AS 'hours',
    extract( 'minutes' FROM INTERVAL '10 years 5 months 4 days 3 hours 45 minutes') AS 'minutes';

Result:

+-------+--------+------+-------+---------+
| years | months | days | hours | minutes |
+-------+--------+------+-------+---------+
| 10 | 5 | 4 | 3 | 45 |
+-------+--------+------+-------+---------+

There are quite a few date specifiers that can be used with the extract() function, but not all of them can be used on interval values. See the table later in this article for a list of specifiers that can be used with intervals.

Database Examples

Suppose we have the following table and data:

-- Create the 'sales' table
CREATE TABLE sales (
    sale_id INTEGER PRIMARY KEY,
    sale_date TIMESTAMP,
    amount DECIMAL(10, 2)
);

-- Insert sample data into the 'sales' table
INSERT INTO sales (sale_id, sale_date, amount) VALUES
(1, '2024-01-15 09:30:45', 100.50),
(2, '2024-01-20 14:15:10', 200.75),
(3, '2024-01-25 11:45:30', 150.00),
(4, '2024-02-10 16:20:55', 300.25),
(5, '2024-02-12 08:05:20', 250.60),
(6, '2024-03-05 12:35:40', 175.90),
(7, '2024-03-18 18:50:15', 400.00),
(8, '2024-03-22 10:10:10', 225.30),
(9, '2024-04-01 13:25:50', 275.45),
(10, '2024-04-05 20:40:05', 500.75),
(11, '2024-04-10 07:55:25', 125.80),
(12, '2024-05-03 22:00:00', 350.90);

-- Query to verify the data
SELECT * FROM sales;

Output:

+---------+---------------------+--------+
| sale_id | sale_date | amount |
+---------+---------------------+--------+
| 1 | 2024-01-15 09:30:45 | 100.50 |
| 2 | 2024-01-20 14:15:10 | 200.75 |
| 3 | 2024-01-25 11:45:30 | 150.00 |
| 4 | 2024-02-10 16:20:55 | 300.25 |
| 5 | 2024-02-12 08:05:20 | 250.60 |
| 6 | 2024-03-05 12:35:40 | 175.90 |
| 7 | 2024-03-18 18:50:15 | 400.00 |
| 8 | 2024-03-22 10:10:10 | 225.30 |
| 9 | 2024-04-01 13:25:50 | 275.45 |
| 10 | 2024-04-05 20:40:05 | 500.75 |
| 11 | 2024-04-10 07:55:25 | 125.80 |
| 12 | 2024-05-03 22:00:00 | 350.90 |
+---------+---------------------+--------+

Example 1: Extract the Month from a Date

Let’s start by extracting the month from the sale_date column:

SELECT 
    sale_id,
    extract('month' FROM sale_date) AS sale_month
FROM sales;

Output:

+---------+------------+
| sale_id | sale_month |
+---------+------------+
| 1 | 1 |
| 2 | 1 |
| 3 | 1 |
| 4 | 2 |
| 5 | 2 |
| 6 | 3 |
| 7 | 3 |
| 8 | 3 |
| 9 | 4 |
| 10 | 4 |
| 11 | 4 |
| 12 | 5 |
+---------+------------+

Example 2: Filter Data by Month

If we want to filter sales that occurred in a specific month, such as April (month 4), we can use the extract() function in the WHERE clause:

SELECT *
FROM sales
WHERE extract('month' FROM sale_date) = 4;

Output:

+---------+---------------------+--------+
| sale_id | sale_date | amount |
+---------+---------------------+--------+
| 9 | 2024-04-01 13:25:50 | 275.45 |
| 10 | 2024-04-05 20:40:05 | 500.75 |
| 11 | 2024-04-10 07:55:25 | 125.80 |
+---------+---------------------+--------+

Example 3: Group the Sales by Month

In this example we use the GROUP BY clause to group the sales by month:

SELECT
    extract('month' FROM sale_date) AS sale_month,
    SUM(amount) AS total_sales
FROM sales
GROUP BY sale_month
ORDER BY sale_month;

Output:

+------------+-------------+
| sale_month | total_sales |
+------------+-------------+
| 1 | 451.25 |
| 2 | 550.85 |
| 3 | 801.20 |
| 4 | 902.00 |
| 5 | 350.90 |
+------------+-------------+

In this case we also used the SUM() function to tally up the sales for the respective month.

More Functionality with strftime()

We’ve already seen that the date_part() and datepart() functions can do exactly what extract() does. But DuckDB also provides other functions that can do the same thing, and more in some cases.

In particular, the strftime() function can be used to format a whole date into a specified format. It returns its output as a string, and so we can have string characters in the result. This can be handy if we want to get the day or month name from the date.

Let’s use strftime() in order to get the month name instead of the month number:

SELECT
    strftime(sale_date, '%B') AS sale_month,
    SUM(amount) AS total_sales
FROM sales
GROUP BY sale_month
ORDER BY sale_month;

Result:

+------------+-------------+
| sale_month | total_sales |
+------------+-------------+
| April | 902.00 |
| February | 550.85 |
| January | 451.25 |
| March | 801.20 |
| May | 350.90 |
+------------+-------------+

Bear in mind that doing this changes the ordering of the results. Ordering by month name will not be the same as ordering by month number. So that’s something to be mindful of.

See Formatting Dates with strftime() for more information and examples.

Supported Date Parts

The two tables below contain the various date parts that can be specified when using the extract() function. Some of these can be used with both dates and intervals, while others can only be used with dates.

Date Part Specifiers and in Intervals

The following specifiers can be used in dates/timestamps and also with intervals:

SpecifierDescriptionSynonymsExample
centuryGregorian centurycentcenturiesc21
dayGregorian daydaysddayofmonth3
decadeGregorian decadedecdecadesdecs202
hourHourshrhourshrsh11
microsecondsSub-minute microsecondsmicrosecondususecusecsuseconduseconds44123456
millenniumGregorian millenniummilmillenniumsmilleniamilsmillenium3
millisecondsSub-minute millisecondsmillisecondmsmsecmsecsmsecondmseconds44123
minuteMinutesminminutesminsm59
monthGregorian monthmonmonthsmons8
quarterQuarter of the year (1-4)quarters3
secondSecondssecsecondssecss44
yearGregorian yearyryyearsyrs2021

Date Part Specifiers Only

The following specifiers can only be used with date/time values (i.e. they can’t be used with intervals):

SpecifierDescriptionSynonymsExample
dayofweekDay of the week (Sunday = 0, Saturday = 6)weekdaydow2
dayofyearDay of the year (1-365/366)doy215
epochSeconds since 1970-01-01 1627991984
eraGregorian era (CE/AD, BCE/BC) 1
isodowISO day of the week (Monday = 1, Sunday = 7) 2
isoyearISO Year number (Starts on Monday of week containing Jan 4th) 2021
timezone_hourTime zone offset hour portion 0
timezone_minuteTime zone offset minute portion 0
timezoneTime zone offset in seconds 0
weekWeek numberweeksw31
yearweekISO year and week number in YYYYWW format 202131

The above tables are sourced from the DuckDB documentation.

If you can’t find a specifier for the job you need to perform, see the list of format specifiers that can be used with the strftime() function. These include format specifiers that do things like, return the day or month name, return a zero padded day or month number, etc.