python - Optimisation of window aggregations: Pushing per-element expressions out of the window aggregation - Stack Overflow

IT技术

更新时间：2025-04-210

admin管理员组
文章数量:1415420

I want to understand the performance implications of elementwise transformations on rolling window aggregation. Consider the following two versions of a rolling aggregation (of floating values):

X = frame.rolling(index_column="date", group_by="group", period="360d").agg(
    pl.col("value").sin().sum().alias("sin(value)"),
    pl.col("value").cos().sum().alias("cos(value)"),
    pl.col("value").sum()
)

II)

Y = frame.with_columns(
    pl.col("value").sin().alias("sin(value)"),
    pl.col("value").cos().alias("cos(value)")
).rolling(index_column="date", group_by="group", period="360d").agg(
    pl.col("sin(value)").sum(),
    pl.col("cos(value)").sum(),
    pl.col("value").sum())

Naively I'd expect the second version to be universally faster than the first version, since by design it avoids redundant re-computation of sin(value) and cos(value) per each window (and group).

I was however surprised to find that both versions are almost identical in runtime for different size of the group and the time dimension. How is that possible? Is polars automagically pushing the elementwise transformations (sin and cos) out of the rolling window aggregation?

In addition for a large number of dates the second version can be slower than the first version, cf. image below.

Can anyone help me understand what is going on here?

Full code for the experiment below

import datetime
import itertools
import time

import numpy as np
import polars as pl
import polars.testing

def run_experiment():
    start = datetime.date.fromisoformat("1991-01-01")
    result = {"num_dates": [], "num_groups": [], "version1": [], "version2": [], }
    for n_dates in [1000, 2000, 5000, 10000]:
        end = start + datetime.timedelta(days=(n_dates - 1))
        dates = pl.date_range(start, end, eager=True)
        for m_groups in [10, 20, 50, 100, 200, 500, 1000]:
            groups = [f"g_{i + 1}" for i in range(m_groups)]
            groups_, dates_ = list(zip(*itertools.product(groups, dates)))

            frame = pl.from_dict({"group": groups_, "date": dates_, "value": np.random.rand(n_dates * m_groups)})

            t0 = time.time()
            X = frame.rolling(index_column="date", group_by="group", period="360d").agg(
                pl.col("value").sin().sum().alias("sin(value)"),
                pl.col("value").cos().sum().alias("cos(value)"),
                pl.col("value").sum()
            )
            t1 = time.time() - t0

            t0 = time.time()
            Y = frame.with_columns(
                pl.col("value").sin().alias("sin(value)"),
                pl.col("value").cos().alias("cos(value)")
            ).rolling(index_column="date", group_by="group", period="360d").agg(
                pl.col("sin(value)").sum(),
                pl.col("cos(value)").sum(),
                pl.col("value").sum()
            )
            t2 = time.time() - t0
            polars.testing.assert_frame_equal(X, Y)

            result["num_dates"].append(n_dates)
            result["num_groups"].append(m_groups)
            result["version1"].append(t1)
            result["version2"].append(t2)

   return pl.from_dict(result)

本文标签：

版权声明：本文标题：python - Optimisation of window aggregations: Pushing per-element expressions out of the window aggregation - Stack Overflow 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1745236459a2649067.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

更多相关文章

javascript - AngularJS: set same height for divs in ng-repeat - Stack Overflow

IT技术

28分钟前

I'm an AngularJS newbie developing an app in phonegap using Angular + Angular Mobile UI.I have a l

swiftui - How to filter data with budget ranges like 'Under $50' in Swift? - Stack Overflow

IT技术

28分钟前

My filter logic works correctly for the matchesPlatform variable. If I try to also filter by the budget

javascript - Get all values of variable using JS - Stack Overflow

IT技术

27分钟前

So in my js code I have some global variable that changes its value several times, for examplevar x = 0

javascript - Make a div bigger and wider while scrolling - Stack Overflow

IT技术

26分钟前

How do I enlarge a div while scrolling from a size of 20% width and height in the center to 100% width

javascript - Synchronize Resources with Google Calendar for node.js - Stack Overflow

IT技术

22分钟前

BackgroundI've been setting up a push-notification client in node.js to watch for changes in calen

javascript - Disable IE back Button - Stack Overflow

IT技术

22分钟前

I have a application where i have disabled the back button of IE8 by using the following code.window.hi

vulkan - Dynamic index in vk::binding(index, 0)? - Stack Overflow

IT技术

20分钟前

I've recently been introduced to a new rendering system which uses the bindings from Vulkan and I&

javascript - Post FormData file along with model data using angular js - Stack Overflow

IT技术

19分钟前

I am uploading image using input type file along with other user data. My model consists of the User cl

Firefox Javascript: Why does .all not work? - Stack Overflow

IT技术

18分钟前

In IE, I can go like:var x = document.getElementById("header");alert(x.all[0].tagName);If I t

javascript - How to make chat like UI with chat bubbles in React JS - Stack Overflow

IT技术

15分钟前

I have some JSON data in dummyData. I am not sure how can I place the chat bubbles on left and right ac

javascript - Docxtemplater. Nested objects - Stack Overflow

IT技术

12分钟前

I'm using My data:var person = {name: 'Joe',address: {city: 'Stockholm',postal

javascript - Firebase: doc.data() returns empty object - Stack Overflow

IT技术

12分钟前

I am working on a Firebase Cloud Function to take all the data in a Firestore collection and put it int

python - Inheriting from SQLModel with table=True raises Value error if parent has a non-trivial field type - Stack Overflow

IT技术

10分钟前

On sqlmodel 0.0.22, the following code will crash with ValueError: <class 'list'> has n

javascript - Extracting most important words from Elasticsearch index, using Node JS client - Stack Overflow

IT技术

9分钟前

Inspired by the following git and video I'm trying to create a conceptual search for my domain, us

power automate - Extract text from textList located in a determined position - Stack Overflow

IT技术

9分钟前

In Power Automate desktop, I am trying to extract text from a TextList which is located in a determined

javascript - JQuery Dialog() can't open again after close - Stack Overflow

IT技术

7分钟前

Following code is a pretty simple and plete JQuery Dialog. Everything works. Problem is as mentioned at

react native - Expo CLI builds and deploys to android but RN or gradle builddeploy doesn't - Stack Overflow

IT技术

7分钟前

A newly created Local Expo CLI project that I ran a prebuild on to generate the Android React native bu

html - How can I display a Javascript prompt once on website load? - Stack Overflow

IT技术

6分钟前

I have a javascript prompt that executes when I open my website for the first time. The basic function

javascript - Testing Google One Tap - closed and now getting "suppressed-by-user" message - Stack Overflow

IT技术

3分钟前

I am adding the Google One Tap api to a React application. I am correctly getting the one tap login mod

asp.net - access a backend variable in front end using javascript - Stack Overflow

IT技术

1分钟前

I had Public declared Dictonary in code behind as :Public dics As New Dictionary(Of String, String()) F

发表评论

全部评论 0

暂无评论

编程频道|软件玩家 - 软件改变生活！

python - Optimisation of window aggregations: Pushing per-element expressions out of the window aggregation - Stack Overflow

更多相关文章

javascript - AngularJS: set same height for divs in ng-repeat - Stack Overflow

swiftui - How to filter data with budget ranges like &#39;Under $50&#39; in Swift? - Stack Overflow

javascript - Get all values of variable using JS - Stack Overflow

javascript - Make a div bigger and wider while scrolling - Stack Overflow

javascript - Synchronize Resources with Google Calendar for node.js - Stack Overflow

javascript - Disable IE back Button - Stack Overflow

vulkan - Dynamic index in vk::binding(index, 0)? - Stack Overflow

javascript - Post FormData file along with model data using angular js - Stack Overflow

Firefox Javascript: Why does .all not work? - Stack Overflow

javascript - How to make chat like UI with chat bubbles in React JS - Stack Overflow

javascript - Docxtemplater. Nested objects - Stack Overflow

javascript - Firebase: doc.data() returns empty object - Stack Overflow

python - Inheriting from SQLModel with table=True raises Value error if parent has a non-trivial field type - Stack Overflow

javascript - Extracting most important words from Elasticsearch index, using Node JS client - Stack Overflow

power automate - Extract text from textList located in a determined position - Stack Overflow

javascript - JQuery Dialog() can&#39;t open again after close - Stack Overflow

react native - Expo CLI builds and deploys to android but RN or gradle builddeploy doesn&#39;t - Stack Overflow

html - How can I display a Javascript prompt once on website load? - Stack Overflow

javascript - Testing Google One Tap - closed and now getting &quot;suppressed-by-user&quot; message - Stack Overflow

asp.net - access a backend variable in front end using javascript - Stack Overflow

发表评论

推荐文章

javascript - How to make column-resizer work in React - Stack Overflow

filters - How to get list of all hooks of current themeplugin?

javascript - XMLHttpRequest cannot load http:localhost:8081sample.xml. Origin null is not allowed by Access-Control-Allow-Origin

javascript - Download File after submission Contact Form 7 Wordpress - Stack Overflow

How does post-build.sh work in Buildroot? - Stack Overflow

热门文章

How to get all post of custom post type by rest api?

404 error - Moving local wordpress page to a real server subdomain broke my permalinks?

javascript - Non-blocking asynchronous tests using QUnit - Stack Overflow

shortcode to display post by category entered by user without plugin

regex - How do I resolve an &quot;invalid quantifier&quot; error with regexp in javascript? - Stack Overflow

java - Solve error Field &lt;com.JdkClientHttpRequestFactoryConfiguration$$SpringCGLIB$$0.$$beanFactory&gt; does not hav

posts - Order by meta_key field in Wordpress not meta_value field value

javascript - React: TextField onchange - Stack Overflow

javascript - How to trigger routeUpdate without changing the value of params in $location.search - Stack Overflow

Error scheduling data extraction on Autodesk Platform Services - Stack Overflow

最新文章

windows设置断电重启开机后自动输入锁屏密码登录

Windows系统设置开机默认开启数字小键盘

Windows11 开机自动同步时间（开机时间不更新问题）

windows配置开机自启动软件或脚本

【Redis】Windows设置Redis为开机自启动

javascript - Access iPhone GPS coordinates - Stack Overflow

javascript - How do i access childNode of &lt;label&gt; using document.getElementsByClassName()? - Stack Overflow

woocommerce offtopic - Get posts from subcategory by parent category slug

asp.net - access a backend variable in front end using javascript - Stack Overflow

javascript - Don&#39;t check checkbox when clicking a button inside the label - Stack Overflow

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价

swiftui - How to filter data with budget ranges like 'Under $50' in Swift? - Stack Overflow

javascript - JQuery Dialog() can't open again after close - Stack Overflow

react native - Expo CLI builds and deploys to android but RN or gradle builddeploy doesn't - Stack Overflow

javascript - Testing Google One Tap - closed and now getting "suppressed-by-user" message - Stack Overflow

regex - How do I resolve an "invalid quantifier" error with regexp in javascript? - Stack Overflow

java - Solve error Field <com.JdkClientHttpRequestFactoryConfiguration$$SpringCGLIB$$0.$$beanFactory> does not hav

javascript - How do i access childNode of <label> using document.getElementsByClassName()? - Stack Overflow

javascript - Don't check checkbox when clicking a button inside the label - Stack Overflow