Published: August 24, 2023·Updated: May 8, 2025

Categories:

Written by:
Nathan Rosidi

Showing you what the cumulative sum is and how to calculate it in SQL. We’ll go through three distinct methods, so you can use whichever method you like best.

Cumulative sum is one of the common places in data analysis. SQL is also one of the common tools in extracting data from databases and analyzing it. From this, it follows that calculating cumulative sum is common in SQL. Yes, it is. We can confirm that!

Luckily, SQL allows you to calculate the cumulative sum relatively easily. Unfortunately for you, it’s not that easy if you don’t know how!

We’ll guide you through the logic of calculating the cumulative sum and how to apply this logic to SQL. Then we’ll go through several ways of computing the cumulative sum in SQL and show you how it’s done on one of our interview questions.

Let’s start with the foundations.

What is the Cumulative Sum in SQL?

A cumulative sum is calculated by adding all previous values in a sequence to the current value. The sequence is usually a date or time, so the cumulative sum gives you a sum at a certain time.

In SQL, this means accessing all the previous rows, summing them, and adding the sum to the current row’s value.

Imagine you’re working with the table showing the daily sales.

Example to understand what is Cumulative Sum in SQL

The cumulative sum for January 1 is 4,578.00, i.e., the same as the sales value for that date. The reason is there are no dates before that.

For January 2, the cumulative sum is the sum of January 1 and January 2 sales: 4,578.00 + 6,486.00 = 11,064.00.

You follow the same way until you reach the end of the table, as shown below.

Understanding the Importance of Cumulative Sums in Data Analysis

SQL cumulative sums play an essential role in data analysis. It allows you to track the accumulation of values over time.

This is helpful when wanting to identify trends and monitor growth or decline for various values important to businesses.

Based on that, you can also build different analytical views and data visualization. That way, you’re helping businesses make better decisions.

The cumulative sums also contribute to the creation of different analytical views of data and can provide insights for business decisions.

Step-by-Step Guide: Performing Cumulative Sum Calculations in SQL

There are several methods for calculating cumulative sums in SQL. We'll go through the three most common techniques.

Cumulative Sum Using Self Join

The self join method involves joining a table with itself. When doing that, you’re giving the table two different aliases. That way, you’re able to join the table with itself like it’s two different tables.

It’s also important to note that self join is not a special type of join – any type of join in SQL can be used for self joining the table.

We won’t go any further into explaining self joins, as you can learn more about it in our illustrated guide to self joins.

We’ll focus here on how to use self join to calculate the cumulative sum in SQL.

Before going into an actual example, we’ll show you a generic syntax in three SQL dialects: PostgreSQL, MySQL, and SQL Server.

Syntax of Cumulative Sum in SQL: Self Join Method

When using self join to calculate the cumulative sum, the syntax is the same in all three dialects.

Here’s the approach when you’re absolutely sure that dates are unique.

SELECT a.date_column,
       a.value_column,
       SUM(b.value_column) AS cumulative_sum
FROM table_name a
JOIN table_name b 
ON a.ordering_column >= b.ordering_column
GROUP BY a.date_column, a.value_column
ORDER BY a.date_column;

However, to make your code more robust, include a unique row identifier, e.g., the ID column. Otherwise, if you have multiple entries per date, the code will collapse multiple rows into one per date-value combination and give you wrong cumulative sums.

SELECT a.row_id,
 a.date_column,
       a.value_column,
       SUM(b.value_column) AS cumulative_sum
FROM table_name a
JOIN table_name b 
ON a.ordering_column >= b.ordering_column
GROUP BY a.row_id, a.date_column, a.value_column
ORDER BY a.date_column;

Example

We’ll use the question from Meta/Facebook to show you how this method works. However, we’ll change the question requirements.

Instead of calculating the cumulative sum over all three continents, we’ll do that only for Europe.

Cum Sum Energy Consumption

Last Updated: April 2020

Cumulative Sum Using Correlated Subquery

A subquery in SQL is a type of query that is written inside the other query. That’s why the subquery is also called an inner or nested query, and the query in which it is embedded is called an outer or main query.

A correlated subquery is a special type of subquery. Its specialness lies in the fact that it relies on the result returned by the main query. Also, it means that the subquery is evaluated repeatedly, once for each row returned by the main query.

Syntax of Cumulative Sum in SQL: Correlated Subquery Method

The syntax is again the same in all three SQL dialects.

This is the approach when you have unique dates in the table.

SELECT a.date_column,
       a.value_column,
      (SELECT SUM(b.value_column)
       FROM table_name b
       WHERE a.ordering_column >= b.ordering_column
      ) AS cumulative_sum
FROM table_name a
ORDER BY a.date_column;

The more robust version for multiple values per date is this.

SELECT a.row_id,
       a.date_column,
       a.value_column,
      (SELECT SUM(b.value_column)
       FROM table_name b
       WHERE a.ordering_column >= b.ordering_column
      ) AS cumulative_sum
FROM table_name a
ORDER BY a.date_column;

Following that logic, the query from the previous example can be rewritten like this, so it uses the correlated subquery.

SELECT eu1.recorded_date,
       eu1.consumption,
      (SELECT SUM(eu2.consumption)
       FROM fb_eu_energy eu2
       WHERE eu1.recorded_date >= eu2.recorded_date
      ) AS cumulative_consumption
FROM fb_eu_energy eu1
ORDER BY eu1.recorded_date;

The logic is similar as earlier, only this time we don’t use self join. However, we still use the same table twice.

The first time, it’s in the main query.

The second time, it’s in a subquery. This subquery uses SUM() to calculate the consumption sum. Then, WHERE uses the same condition we used in the ON clause in the previous example. Same as there, it looks for the dates in one table that are equal to or greater than the dates in the second table. This gives us cumulative, not the ‘regular’ sum.

Now that you know the logic of using a correlated subquery to get a cumulative sum, let’s try to solve the question without changing the requirements.

Example

It’s the same question from Meta/Facebook, only we’ll now do everything it asks.

Cum Sum Energy Consumption

Last Updated: April 2020

Cumulative Sum Using OVER and ORDER BY CLAUSE

The easiest and, at the same time, the most complex way of calculating cumulative sum is by using the window function.

It’s the easiest because the code will have fewer lines. It’s the most complex because it requires, well, knowing window functions – they are hard before you learn them. Isn’t that the case with everything?

The window functions are functions that are applied to the rows that are somehow related to the current row. We call these rows a window, hence the window functions. If you feel your knowledge of the window functions is lacking, please read more in The Ultimate Guide to SQL Window Functions.

One of the important characteristics of the window functions is that they don’t collapse the individual rows when aggregating data. In other words, it allows us to show the individual and aggregated values at the same time. This important feature makes calculating cumulative sum much more efficient than in the previous examples.

We’ll use the SUM() aggregate window functions for our calculations. Yes, it sums values, just like a regular aggregate function. But with the window function features, it gets much more possibilities.

Syntax of Cumulative Sum in SQL: Using OVER and ORDER BY

The two important clauses in window functions in general are OVER() and ORDER BY.

OVER() is a mandatory clause that creates a window function. In our case, the regular aggregate SUM() function now becomes the aggregate window function, thus being given more features.

ORDER BY is considered an optional clause. However, in our calculation of the cumulative sum, it is mandatory in a way. It will sort rows in the window frame – a set of rows related to the current row – or partitions, if they are defined. This sounds similar to ORDER BY in SELECT, but it’s not; it’s used for sorting the output.

Here’s the cumulative sum syntax using OVER and ORDER BY, i.e., a window function.

SELECT a.date_column,
       a.value_column,
       SUM(a.value_column) OVER (ORDER BY a.ordering_column) AS cumulative_sum
FROM table_name a
ORDER BY a.date_column;

ORDER BY is not necessary, as databases will typically emit the rows in the same order they processed them. However, this is not guaranteed by the SQL standard, so it’s better to always include that ORDER BY in the last code line and guarantee your output is sorted ascendingly by date. Better safe than sorry!

In case there are duplicate dates in your data and there’s a unique row identifier, use this more robust approach.

SELECT a.row_id,
       a.date_column,
       a.value_column,
       SUM(a.value_column) OVER (ORDER BY a.ordering_column) AS cumulative_sum
FROM table_name a
ORDER BY a.date_column;

Note: Depending on your task, you might also need to use PARTITION BY, another interesting clause in window functions. It is used when you want to partition a window into smaller groups. For example, if you had a table showing consumption in Europe and several of its cities, PARTITION BY would allow you to calculate the cumulative sum for each city, not only for the whole continent. We have a nice explanation of PARTITION BY in our SQL cheat sheet.

Now, using the syntax above, we can go back to our interview questions and calculate the cumulative sum in Europe like this.

SELECT recorded_date, 
       consumption,
       SUM(consumption) OVER (ORDER BY recorded_date ASC) AS cumulative_consumption
FROM fb_eu_energy
ORDER BY recorded_date;

Instead of self joining or writing a subquery, we calculate the cumulative consumption much easier this way. So, we apply the SUM() function to the consumption column. Then we use the OVER() clause to turn it into a window function.

In OVER(), we use ORDER BY so that the cumulative sum is calculated from the oldest to the latest date. Which is exactly how it should be done. Following earlier advice, we also add ORDER BY at the end of the query. It’s to make sure the output, too, is sorted ascendingly by date.

Example

We’ll use the same Meta/Facebook interview question as earlier and solve it as it is.

Cum Sum Energy Consumption

Last Updated: April 2020

Optimizing Performance: Tips and Tricks for Efficient Cumulative Sum Computations in SQL

For large datasets, the performance of SQL cumulative sum calculations can become a concern. Here are some tips:

Use window functions: As shown above, window functions tend to be more efficient than self joins or subqueries.
Index your data: Properly indexing your data can speed up the computations.
Break down complex queries: If your cumulative sum involves complex conditions, breaking it down into smaller parts can help optimize the process.

Real-World Use Cases and Applications of Cumulative Sums in SQL

SQL cumulative sum is extensively used in various fields for different types of data analysis. Here are some real-world applications.

1. Finance and Accounting – cumulative sums of transactions, sales, costs, or other financial data over a certain period. They can help identify trends, such as sales growth or decline, over time. Depending on the decision, these trends can be monitored, for example, daily, monthly, or annually to check the performance and adjust strategies.

2. Inventory Management – tracking the quantity of inventory over time. By recording each addition or subtraction from inventory, a running total provides the current inventory level at any point in time.

3. Performance Monitoring – tracking the total number of errors logged in a system, the total downtime of a service over a period of time, or the number of hours worked. This can help in identifying patterns and making improvements.

4. Sports and Games – tracking the total points scored by a player or team over a season or career or ranking the players based on their cumulative scores.

5. Healthcare – the total number of cases of a disease over time in medical research or public health monitoring. It can be crucial in identifying and responding to outbreaks. Or counting the number of people infected by COVID-19; that was not that long ago.

6. Telecommunications – for calculating the total data used in a certain period, the total minutes a customer spent on calls, or the number of SMS they sent.

7. Website Analytics – for tracking the total number of page views, clicks, or other user interactions over a period.

8. Weather data analysis – used for calculating the cumulative rainfall in a particular region during a certain period, which can be useful in climate studies or planning agricultural activities.

Cumulative Sum Versus Running Total

While cumulative sum and running total might sound similar, they have slightly different applications.

Both calculations will return a cumulative result for a certain period. However, here’s the difference – defining the period.

The cumulative sum shows the cumulative result from the first date until today. As you go on, the time frame increases. For example, if you’re calculating daily cumulative revenue, with each day, your time frame increases by one day.

The running total, however, has a fixed size of a frame. However, the frame moves with each new period. Hence, ‘running’ in running total. In other words, running total looks at a certain time frame in history. For example, a 3-day running total revenue means that the running total will include the revenues of the last two days plus the current day. The next day, it will again include the same number of days. But they won’t be the same days as yesterday, as the time frame moves (not increases, moves!) along with the current date.

You can see how this works in the GIF below.

Yes, the first three running totals will be the same as the cumulative sum. But don’t be mistaken that this GIF is the same as the previous one!

After three days, the running total still keeps taking into account only the last three days, while the cumulative cum includes all the previous days.

Conclusion

The cumulative sum as a mathematical concept is rather simple. However, calculating it in SQL requires some advanced SQL techniques. It can seem complicated because of that. But, with a solid understanding of the language's concepts and the right techniques, you can easily translate the cumulative sum logic to SQL.

There are three distinct ways of calculating the cumulative sum in SQL, and they involve three important SQL topics: (self) joins, (correlated) subqueries, and window functions.

These concepts have a much broader application in SQL overall, so knowing them will help you in many more tasks other than the cumulative sum. Virtually no intermediate or advanced SQL queries can be written without these three concepts. Plenty of our interview questions require this knowledge, so we recommend using them for code practice.

Computing Cumulative Sum in SQL Made Easy

What is the Cumulative Sum in SQL?

Understanding the Importance of Cumulative Sums in Data Analysis

Step-by-Step Guide: Performing Cumulative Sum Calculations in SQL

Cumulative Sum Using Self Join

Syntax of Cumulative Sum in SQL: Self Join Method

Example

Cum Sum Energy Consumption

Cumulative Sum Using Correlated Subquery

Syntax of Cumulative Sum in SQL: Correlated Subquery Method

Example

Cum Sum Energy Consumption

Cumulative Sum Using OVER and ORDER BY CLAUSE

Syntax of Cumulative Sum in SQL: Using OVER and ORDER BY

Example

Cum Sum Energy Consumption

Optimizing Performance: Tips and Tricks for Efficient Cumulative Sum Computations in SQL

Real-World Use Cases and Applications of Cumulative Sums in SQL

Cumulative Sum Versus Running Total

Conclusion

Latest Posts:

Unsupervised Clustering: Methods, Examples, and When to Use

A Guide to Master Machine Learning Modeling from Scratch

How to Create a Bubble Plot with Python and Matplotlib?