python - What's the fastest way of skipping tuples with a certain structure in a itertool product? - Stack Overflow

IT技术

更新时间：2025-02-024

admin管理员组
文章数量:1295066

I have to process a huge number of tuples made by k integers, each ranging from 1 to Max_k.

Each Max can be different. I need to skip the tuples where an element has reached is max value, in that case keeping only the tuple with "1" in the remaining position. The max is enforced by design, so it cannot be that some item is > of its max For example, if the max of the second element of a triple is 4, i need to keep (1,4,1) but skip (1,4,2) , (1,4,3) ... (2,4,1) etc.

I am pretty sure I am missing a much faster way to do that. My typical scenario is tuples with 16 to 20 elements, with maxes in the 50-70 mark. What would be the recommended approach ?

In Python, as a toy example with hardcoded Maxes (5,4,2), is the following:

from itertools import *

def filter_logic(y):
    if y[0]==5:
        if y[1] > 1 or y[2] >1:
            return True
    if y[1]==4:
        if y[0] > 1 or y[2] >1:
            return True
    if y[2]==2:
        if y[0] > 1 or y[1] >1:
            return True
    return False
   
def tuples_all(max_list):
    my_iterables = []
    for limit in max_list:
        my_iterables.append(range(1, limit+1))
    return product(*my_iterables)

def tuples_filtered(max_list):
    return filterfalse(filter_logic, tuples_all(max_list))

max_list = [5,4,2]

print("Original list")
for res in tuples_all(max_list):
    print(res)

print("After filtering")
for fil in tuples_filtered(max_list):
    print(fil)

Output of the filtered tuples:

After filtering
(1, 1, 1)
(1, 1, 2)
(1, 2, 1)
(1, 3, 1)
(1, 4, 1)
(2, 1, 1)
(2, 2, 1)
(2, 3, 1)
(3, 1, 1)
(3, 2, 1)
(3, 3, 1)
(4, 1, 1)
(4, 2, 1)
(4, 3, 1)
(5, 1, 1)

I have to process a huge number of tuples made by k integers, each ranging from 1 to Max_k.

I am pretty sure I am missing a much faster way to do that. My typical scenario is tuples with 16 to 20 elements, with maxes in the 50-70 mark. What would be the recommended approach ?

In Python, as a toy example with hardcoded Maxes (5,4,2), is the following:

from itertools import *

def filter_logic(y):
    if y[0]==5:
        if y[1] > 1 or y[2] >1:
            return True
    if y[1]==4:
        if y[0] > 1 or y[2] >1:
            return True
    if y[2]==2:
        if y[0] > 1 or y[1] >1:
            return True
    return False
   
def tuples_all(max_list):
    my_iterables = []
    for limit in max_list:
        my_iterables.append(range(1, limit+1))
    return product(*my_iterables)

def tuples_filtered(max_list):
    return filterfalse(filter_logic, tuples_all(max_list))

max_list = [5,4,2]

print("Original list")
for res in tuples_all(max_list):
    print(res)

print("After filtering")
for fil in tuples_filtered(max_list):
    print(fil)

Output of the filtered tuples:

After filtering
(1, 1, 1)
(1, 1, 2)
(1, 2, 1)
(1, 3, 1)
(1, 4, 1)
(2, 1, 1)
(2, 2, 1)
(2, 3, 1)
(3, 1, 1)
(3, 2, 1)
(3, 3, 1)
(4, 1, 1)
(4, 2, 1)
(4, 3, 1)
(5, 1, 1)

Share Improve this question edited Jan 25 at 2:29 no comment 10.1k5 gold badges20 silver badges40 bronze badges asked Jan 23 at 16:26 user58327 212 bronze badges

1 If you have 16 elements all we with max 50, aren't you keeping all tuples with values 1 to 49? That's over 10^27. Infeasible. – no comment Commented Jan 23 at 16:36
Can you clarify the scenario? If an element at index n in the tuple has reached its max AND any element at index m>n is greater than 1 - then skip the entire tuple? And the tuple can have k integers to check? How do we know what the max value at any given index is? – Vegard Commented Jan 23 at 16:38
Shouldn't == be >=? – Barmar Commented Jan 23 at 17:58
I added a working example. – user58327 Commented Jan 24 at 12:25
If an element has reached it max, "all" other elements in the list are only allowed to be 1. The tuple is built in a way that the a element cannot be more than its max. Each element can range from 1 to its max inclusive. All elements are integer. I don't need to exhaust all the list, but only to find a tuple that once inserted in a further function will give a result higher than a certain High score number. – user58327 Commented Jan 24 at 12:33

| Show 10 more comments

2 Answers 2

Sorted by: Reset to default 0

Since you commented that order between tuples isn't important, we can simply produce the tuples with max value and then the tuples without max value:

from itertools import *


def tuples_direct(max_list):
    n = len(max_list)

    # Special case
    if 1 in max_list:
        yield (1,) * n
        return

    # Tuples with a max.
    for i, m in enumerate(max_list):
        yield (1,) * i + (m,) + (1,) * (n-i-1)

    # Tuples without a max.
    yield from product(*(range(1, m) for m in max_list))


max_list = [5,4,2]
for tup in tuples_direct(max_list):
    print(tup)

Output (Attempt This Online!):

(5, 1, 1)
(1, 4, 1)
(1, 1, 2)
(1, 1, 1)
(1, 2, 1)
(1, 3, 1)
(2, 1, 1)
(2, 2, 1)
(2, 3, 1)
(3, 1, 1)
(3, 2, 1)
(3, 3, 1)
(4, 1, 1)
(4, 2, 1)
(4, 3, 1)

Assumptions about the rules for discarding a tuple:

Any value >= the max value for its index, AND
Any other value (meaning not the value that satisfied #1) >= 1

Two or more values >= the max value for their respective indices

such that if either rule A or rule B is hit, the tuple is tossed.

If this ruleset describes your problem, the solution is quite managable. From my basic testing, it's relatively performant even in native python - but I've also included a numpy solution which should hypothetically be orders of magnitude faster if you have so much data data that native python is noticeably slow.

import itertools
import numpy as np

def should_discard(values, max_values):
    any_max_val    = False
    any_gt_one_val = False

    for val, max_val in zip(values, max_values): # zip truncates automatically
        #print(f"Val: {val} < {max_val}: {max_val >= val}") # Manual inspection
        if val >= max_val:
            if any_max_val:
                any_gt_one_val = True
            any_max_val = True
        elif val > 1:
            any_gt_one_val = True

        if any_max_val and any_gt_one_val: 
            return True

    return False

def should_discard_numpy(values, max_values):
    values = np.array(values)
    max_values = np.array(max_values[:len(values)])  # Truncate max_values

    ge_max_arr = values >= max_values

    if np.sum(ge_max_arr) >= 2:
        return True

    any_ge_max = np.any(ge_max_arr)
    any_gt_one = np.any((values > 1) & ~ge_max_arr)  # A.2 - compare for >1 but exclude the val(s) that hit A.1

    if any_ge_max and any_gt_one:
        return True

    return False

def filter_tuples(func, tuples, max_values):
    return itertools.filterfalse(lambda t: func(t, max_values), tuples)

vals = [
    (1, 4, 1), # Keep
    (1, 4, 2), # Discard - rule A
    (2, 4, 1), # Discard - rule A
    (0, 4, 1), # Keep
    (1, 3, 3), # Discard - rule A
    (2, 1, 3), # Discard - rule A OR rule B
]
max_vals = (2, 4, 3)

vals2 = [
    (1, 2, 1, 5, 3, 1, 6, 1, 0, 1, 1, 1, 1, 2, 0, 1, 0, 0, 0, 7), # Discard - rule A
    (2, 1, 0, 1, 0, 1, 0, 1, 0, 1, 0, 1, 0, 1, 0, 1, 0, 1, 0, 1), # Discard - rule B
    (0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 5), # Keep
    (1, 1, 0, 4, 3, 0, 2, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1), # Keep
    (1, 1, 0, 4, 3, 0, 2, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 2, 1), # Discard - rule A
]
max_vals2 = (2, 2, 2, 5, 4, 1, 6, 1, 1, 1, 2, 1, 1, 2, 1, 1, 1, 2, 1, 7)

print("Native python:", list(filter_tuples(should_discard,       vals,  max_vals)))
print("Numpy        :", list(filter_tuples(should_discard_numpy, vals,  max_vals)))
print("Native python:", list(filter_tuples(should_discard,       vals2, max_vals2)))
print("Numpy        :", list(filter_tuples(should_discard_numpy, vals2, max_vals2)))

本文标签：

版权声明：本文标题：python - What's the fastest way of skipping tuples with a certain structure in a itertool product? - Stack Overflow 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1738481974a2089195.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

发表评论

全部评论 0

暂无评论

编程频道|软件玩家 - 软件改变生活！

python - What&#39;s the fastest way of skipping tuples with a certain structure in a itertool product? - Stack Overflow

2 Answers 2

更多相关文章

node.js - How to log an error&#39;s message without stack trace in JavaScript? - Stack Overflow

javascript - what is CSRF check failed when going on a website which doesn&#39;t require login? - Stack Overflow

javascript - putting php codes on onclick event of a button - Stack Overflow

c# - Multi tenancy support for Single database shared schema - Stack Overflow

Return numbers which appear only once (JavaScript) - Stack Overflow

php - how to pass checkbox value to javascript function - Stack Overflow

url rewriting - Remove wp-admin from the URL

javascript - Displaying Datepicker when i click the button - Stack Overflow

javascript - Implementing and working with ace for the first time - Stack Overflow

javascript - Block scroll when Lightbox appears - Stack Overflow

visual studio code - VSCode Misinterprets SASS Interpolation Syntax with `@apply`, Showing False Validation Errors - Stack Overf

javascript - google maps not applying style - Stack Overflow

Download PDF from javascript blob, without replacing the open page - Stack Overflow

java - What is the minimum configuration to define a REST service on Apache Camel running on Spring Boot? - Stack Overflow

wp insert post - wp_insert_post() crashing website

javascript - What&#39;s the best solution for storing a users id? - Stack Overflow

javascript - CasperJS : Why does my url change to about:blank when my page is loaded? - Stack Overflow

javascript - How to handle null response in fetch api - Stack Overflow

block editor - Gutenberg element: How to usw Rangecontrol with two values to imitate a rangeslider?

reactjs - Failed to fetch loading &quot;GET&quot; in my Next.js 15 app - Stack Overflow

发表评论

推荐文章

javascript - Istanbul gives me coverage but ends output with an error - Stack Overflow

mysql - Laravel Database Connections Not Closing with Apache2 on AWS ALB (EC2 Target Group) - Stack Overflow

multithreading - Managing Worker Thread Wake-Up in C++: Separate vs. Shared Condition Variables? - Stack Overflow

javascript - Cache mapbox Tile images - Stack Overflow

javascript - $().countdown is not a function - Stack Overflow

热门文章

Why does inspect identify decorated methods as functions instead of methods in Python? - Stack Overflow

sql server - Default parameter in SQL where clause - Stack Overflow

google analytics - VIEW_ITEM event is not triggered on Android devices - Stack Overflow

javascript - jquery datepicker not working in IE7 and IE8 - Stack Overflow

PHPJavascript Session Timeout with warning - Stack Overflow

Correct way to expand custom WordPress plugin functions

javascript - Refetching content with nuxt content using $fetch or useFetch with watch - Stack Overflow

javascript - How can I reduce excess space on the left in react-native-pickerpicker? - Stack Overflow

Delete all users with Editor role and their content mysql

Why can I log into wp-login.php and not wp-admin.php?

最新文章

Win7各正式版下载地址和SHA验证

怎么样把中文版的Windows7改成英文版的Windows7

Win7系统笔记本蓝牙打开指南：详细步骤助你轻松连接

win7开机弹计算机,win7开机弹出Windows Installer窗口的解决方法

windows7虚拟机安装vmtools方法

javascript - Why can&#39;t I use jQuery to fire an AJAX request from an unload event handler? - Stack Overflow

object - Javascript constructor function to count the number of instances - Stack Overflow

php - Xero API PKCE giving Invalid_client error - Stack Overflow

javascript - TypeScript Array&lt;T&gt; inheritance - Stack Overflow

conditional tags - How to determine whether we are in add New pagepostCPT or in edit pagepostCPT in wordpress admin?

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价

python - What's the fastest way of skipping tuples with a certain structure in a itertool product? - Stack Overflow

node.js - How to log an error's message without stack trace in JavaScript? - Stack Overflow

javascript - what is CSRF check failed when going on a website which doesn't require login? - Stack Overflow

javascript - What's the best solution for storing a users id? - Stack Overflow

reactjs - Failed to fetch loading "GET" in my Next.js 15 app - Stack Overflow

javascript - Why can't I use jQuery to fire an AJAX request from an unload event handler? - Stack Overflow

javascript - TypeScript Array<T> inheritance - Stack Overflow