Module 6: Window Operations•20 min

Comparing Adjacent Rows

Progress Tracking

Accessing Previous Rows with .shift()

Python

df = orders.sort_values("order_date")
df["prev_date"] = df["order_date"].shift(1)
df[["order_date", "prev_date"]].head()

Table: amazon_transactions

id	user_id	item	created_at	revenue
1	109	milk	2020-03-03	123
2	139	biscuit	2020-03-18	421
3	120	milk	2020-03-18	176
4	108	banana	2020-03-18	862
5	130	milk	2020-03-28	333

Tables: amazon_transactions

Sort First!

.shift() operates on row position. If your data isn’t sorted, you’ll compare to the wrong row. Always .sort_values() before .shift().

Now extend the pattern: calculate the gap, then use it to flag interesting rows.

Tables: amazon_transactions

Tables: amazon_transactions

Table: sf_transactions

id	created_at	value	purchase_id
1	2019-01-01	172692	43
2	2019-01-05	177194	36
3	2019-01-09	109513	30
4	2019-01-13	164911	30
5	2019-01-17	198872	39

Tables: sf_transactions

.shift(1) = LAG (previous row); .shift(-1) = LEAD (next row).
Always sort before shifting — position-based, not value-based.
Use within .groupby() to shift within groups: df.groupby("col")["val"].shift(1).
First row in each group gets NaN (no previous row exists).
Subtract shifted values for period-over-period change.

Next: running totals, cumulative counts, and percentage-of-total calculations using .cumsum(), .expanding(), and .rolling().