filter - Execution of complex filtering procedures in PySpark - Stack Overflow

IT技术

更新时间：2025-04-144

admin管理员组
文章数量:1391991

Currently I'm trying to execute some filtering procedures in PySpark (educational purposes).

I'm new to PySpark, so decided to ask for a help.

My dataframe look like this:

ID     ApplicationDate  Loansum Company Decision
ID1    2020-06-01       100     B       Negative
ID1    2020-06-04       50      M       Positive
ID1    2020-06-05       50      M       Positive

ID1    2020-06-10       10      M       Positive

ID1    2020-06-15       60      B       Negative
ID1    2020-07-15       40      B       Positive
ID1    2020-06-22       20      M       Positive

ID1    2020-07-01       100     B       Negative
ID1    2020-07-02       40      B       Positive
ID1    2020-07-03       70      M       Positive

ID1    2020-08-01       100     B       Negative
ID1    2020-08-01       40      B       Positive
ID1    2020-08-02       100     M       Positive

ID2    2020-10-01       100     B       Negative
ID2    2020-10-04       50      M       Positive
ID2    2020-10-05       50      M       Positive

ID2    2020-10-10       10      M       Positive

ID2    2020-10-15       60      B       Negative
ID2    2020-10-15       40      B       Positive
ID2    2020-10-22       20      M       Positive

ID2    2020-10-01       100     B       Negative
ID2    2020-10-02       40      B       Positive
ID2    2020-10-03       70      M       Positive

My goal is to filter my dataframe is such a way so for each ID I should find and extract all the cases where:

The ApplicationDate between the first Loansum issued by Company "B" and the next nearest Loansums issued by Company "M" should not exceed 5 days;
The Loansums of all "Positive" issued loans should not be 20% more than a Lonasum of a loan with "Negative" Decision.

My expected result:

ID     ApplicationDate  Loansum Company Decision
ID1    2020-06-01       100     B       Negative
ID1    2020-06-04       50      M       Positive
ID1    2020-06-05       50      M       Positive

ID1    2020-07-01       100     B       Negative
ID1    2020-07-02       40      B       Positive
ID1    2020-07-03       70      M       Positive

ID2    2020-10-01       100     B       Negative
ID2    2020-10-04       50      M       Positive
ID2    2020-10-05       50      M       Positive

Any help is highly appreciated!

本文标签： filterExecution of complex filtering procedures in PySparkStack Overflow

版权声明：本文标题：filter - Execution of complex filtering procedures in PySpark - Stack Overflow 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1744604880a2615291.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

编程频道|软件玩家 - 软件改变生活！

filter - Execution of complex filtering procedures in PySpark - Stack Overflow

更多相关文章

filter - Execution of complex filtering procedures in PySpark - Stack Overflow

发表评论

推荐文章

javascript - How to use the :first-of-type rule inside a styled-componentsemotion partial? - Stack Overflow

How to export bbPress (forums, topics, replies) and all users?

javascript - Clear BehaviorSubject of queued events as each is addressed in Angular2 with Observables - Stack Overflow

javascript - Fields breaking after Upgrade to JQuery 3.5.1 - Stack Overflow

powerbi - Power BI table filtering with Slicers - Stack Overflow

热门文章

javascript - Jquery input text allow decimal and value between 0.00 and 100 - Stack Overflow

javascript - jQuery bug? .appendTo() not working in IE7 - Stack Overflow

javascript - DisplayHide a child component in React - Stack Overflow

javascript - batch file to open multiple URLs - Stack Overflow

javascript - How do I properly map attributes of relations in sequelize.js? - Stack Overflow

javascript - Getting "TransformError: Invalid call" when trying to use require with string concatenation - Sta

javascript - Mark bookedreserved time slots - Stack Overflow

php - error_log() output for print_r() appearing on page

svelte - Passing props to `{@render children()}` - Stack Overflow

windows - VBA - Microsoft Purview - Sensitivity Labels - detecting label event and determining what label's text should

最新文章

windows设置断电重启开机后自动输入锁屏密码登录

Windows系统设置开机默认开启数字小键盘

Windows11 开机自动同步时间（开机时间不更新问题）

windows配置开机自启动软件或脚本

【Redis】Windows设置Redis为开机自启动

javascript - Query junction table without getting both associations in Sequelize - Stack Overflow

How to create a link to jump to "Leave a comment" part?

javascript - Place markers from JSON data for Google MAPS API v3 - Stack Overflow

jquery - How to read alert message with javascript? - Stack Overflow

javascript - Does linking to JQuery CDN slow my site down? - Stack Overflow

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价

编程频道|软件玩家 - 软件改变生活！

filter - Execution of complex filtering procedures in PySpark - Stack Overflow

更多相关文章

filter - Execution of complex filtering procedures in PySpark - Stack Overflow

发表评论

推荐文章

javascript - How to use the :first-of-type rule inside a styled-componentsemotion partial? - Stack Overflow

How to export bbPress (forums, topics, replies) and all users?

javascript - Clear BehaviorSubject of queued events as each is addressed in Angular2 with Observables - Stack Overflow

javascript - Fields breaking after Upgrade to JQuery 3.5.1 - Stack Overflow

powerbi - Power BI table filtering with Slicers - Stack Overflow

热门文章

javascript - Jquery input text allow decimal and value between 0.00 and 100 - Stack Overflow

javascript - jQuery bug? .appendTo() not working in IE7 - Stack Overflow

javascript - DisplayHide a child component in React - Stack Overflow

javascript - batch file to open multiple URLs - Stack Overflow

javascript - How do I properly map attributes of relations in sequelize.js? - Stack Overflow

javascript - Getting &quot;TransformError: Invalid call&quot; when trying to use require with string concatenation - Sta

javascript - Mark bookedreserved time slots - Stack Overflow

php - error_log() output for print_r() appearing on page

svelte - Passing props to `{@render children()}` - Stack Overflow

windows - VBA - Microsoft Purview - Sensitivity Labels - detecting label event and determining what label&#39;s text should

最新文章

windows设置断电重启开机后自动输入锁屏密码登录

Windows系统设置开机默认开启数字小键盘

Windows11 开机自动同步时间（开机时间不更新问题）

windows配置开机自启动软件或脚本

【Redis】Windows设置Redis为开机自启动

javascript - Query junction table without getting both associations in Sequelize - Stack Overflow

How to create a link to jump to &quot;Leave a comment&quot; part?

javascript - Place markers from JSON data for Google MAPS API v3 - Stack Overflow

jquery - How to read alert message with javascript? - Stack Overflow

javascript - Does linking to JQuery CDN slow my site down? - Stack Overflow

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价

javascript - Getting "TransformError: Invalid call" when trying to use require with string concatenation - Sta

windows - VBA - Microsoft Purview - Sensitivity Labels - detecting label event and determining what label's text should

How to create a link to jump to "Leave a comment" part?