graphics - Why does early fragment tests need to be specified in shader if I write to a storage buffer? - Stack Overflow

IT技术

更新时间：2025-03-071

admin管理员组
文章数量:1278984

In my Vulkan shader I get around 500 frames per second. Now if I write to a storage buffer the frame rate drops to 200 fps. I discovered that what it's doing is disabling the early fragment tests. I know this because if I place:

layout(early_fragment_tests) in; // Forces early depth tests

At the top of the file the frame rate goes back up to 500 fps.

I'm wondering, is this normal? So early fragment tests seem to be enabled, and then upon writing to a storage buffer it disabled it automatically, and I have to enable it manually with that line, or "force" it, so to speak.

Really interestingly writing to gl_FragDepth.z didn't have the same effect. So writing to a storage buffer disabled the early fragment tests automatically but writing to gl_FragDepth.z didn't, which is strange, because writing to gl_FragDepth.z is the one that I thought was supposed to disable the early fragment tests.

layout(early_fragment_tests) in; // Forces early depth tests

At the top of the file the frame rate goes back up to 500 fps.

Share Improve this question edited Feb 27 at 22:29 solidpixel 12.2k1 gold badge22 silver badges37 bronze badges asked Feb 24 at 3:04 Zebrafish 14.3k3 gold badges64 silver badges152 bronze badges

Add a comment |

1 Answer 1

Sorted by: Reset to default 1

The short answer is that yes, this is expected. To understand why, let's dive into what's actually happening.

Graphics APIs are specified to process fragment shading in application primitive order. This ordering guarantee is needed to ensure that there is a sensible programmer's model for order-dependent things such as blending, or other side-effects such as writes to memory.

Graphics APIs are also specified to do ZS testing after fragment shading (i.e. late ZS). Doing it after fragment shading is the only point in the pipeline where we can guarantee that we can do it correctly, because the fragment processing might change the depth value or have some other user-visible side effect that we need to process.

The entire concept of early ZS testing is an optimization, allowing hardware to completely skip running fragments. However, the implementation can only automatically use an early test in the subset of cases where it can unambiguously prove that running the fragment is not necessary, and killing it does not change the application-visible behaviour when compared to doing a late test.

In your first case, you cannot use early ZS because the shader must run enough to create the gl_FragDepth.z value needed by the ZS testing.

In your second case, you cannot use early ZS because you have a user-visible side-effect writing to memory outside of the framebuffer. This must get written to memory because the programmer's model says that ZS testing happens after fragment shading, and skipping the write would change the application-visible behavior.

Framebuffer writes are very special in enabling early ZS testing by default, unless the shader modifies its own depth value, because there are strict rules about how the framebuffer is used and when the values in it are visible to the application. Generic memory writes to storage buffers, images, or atomics, don't give any of the necessary guarantees to allow early ZS by default.

In both cases specifying layout(early_fragment_tests) is a way of the application programmer providing a promise to the implementation that it is algorithmically safe to kill the fragments, so you explicitly allow the change in application-visible behavior.

It's worth noting that this type of "optimizations can't change the programmer's model" logic also applies to other types of fragment optimization, such as vendor-specific hidden surface removal algorithms. Memory side-effects outside of the framebuffer tend to disable most of these too ...

本文标签：

版权声明：本文标题：graphics - Why does early fragment tests need to be specified in shader if I write to a storage buffer? - Stack Overflow 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1741295980a2370820.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

发表评论

全部评论 0

暂无评论

编程频道|软件玩家 - 软件改变生活！

graphics - Why does early fragment tests need to be specified in shader if I write to a storage buffer? - Stack Overflow

1 Answer 1

更多相关文章

javascript - HTML5 Video &quot;Black Screen&quot; on iPad - Stack Overflow

javascript - Image onclick not working - Stack Overflow

javascript - screen.lockOrientation is not a function - Stack Overflow

javascript - Convert JSON Serialized String Containing HTML Entities into object - Stack Overflow

javascript - Vuejs - Cannot read property &#39;_withTask&#39; of undefined - Stack Overflow

javascript - range input disable does not work in chrome - Stack Overflow

winforms - How to move an object to the mouse with a set max speed in Windows Forms using C# - Stack Overflow

how to get html select to show selected value with javascript - Stack Overflow

WooCommerce Registration redirect based on page ID

javascript - Stealing session id cookies - counter measures - Stack Overflow

arm - Soft system restart with use of EWI - early wakeup interrupt - Stack Overflow

javascript - How to reset selected items from the list? - Stack Overflow

javascript - Failed to execute &#39;getImageData&#39; - The canvas has been tainted by cross-origin data - Stack Overflo

flask - 302 error on redirection, unable to redirect to next page while deploying on Railway - Stack Overflow

Replacing part of a string with javascript? - Stack Overflow

Using a custom plugin to capture input data via Ajax and PHP

javascript - &#39;stepUp&#39; called on an object that does not implement interface HTMLInputElement - Stack Overflow

Use gutenberg block editor on plugin page (outside of a post)

vhdl - Vivado: Differing behavior between Behavioral and Post-Synthesis Functional Simulation - Stack Overflow

How to format multiple footnotes on a single sentence in Quarto reveal.js? - Stack Overflow

发表评论

推荐文章

javascript - RequireJS: when to use &#39;paths&#39; versus &#39;packages&#39; - Stack Overflow

javascript - Value of &#39;e&#39; may be overwritten in IE 8 and earlier - Stack Overflow

python logging use two different loggers with different formatting - Stack Overflow

How to Run Code Before a New Site is Created on MultiSite for Validation

Order custom post type by taxonomy

热门文章

javascript - How to redirect user to another page after login that base on JWT-token? - Stack Overflow

Call a javascript function at distinct time intervals - Stack Overflow

javascript - Howwhy is “ *[attribute^=&quot;string&quot; ” a valid querySelector? (JS bug?) - Stack Overflow

PowerBI DAX: Formula for an &quot;Initial Transaction Flag&quot; - Stack Overflow

javascript - js class static method refer to self (like self in php) - Stack Overflow

javascript - Handsontable - Change row height - Stack Overflow

How to detect HTML email input validation with JavaScript? - Stack Overflow

javascript - Open lightgallery.js programmatically - Stack Overflow

How to play audio only from youtube videos using HTML and javascript - Stack Overflow

javascript - error Filters are deprecated how to solve this error in vue.js 3 - Stack Overflow

最新文章

Win7各正式版下载地址和SHA验证

怎么样把中文版的Windows7改成英文版的Windows7

Win7系统笔记本蓝牙打开指南：详细步骤助你轻松连接

win7开机弹计算机,win7开机弹出Windows Installer窗口的解决方法

windows7虚拟机安装vmtools方法

JavaScript: How would I reverse ONLY the words in a string - Stack Overflow

javascript - JSONP To Acquire JSON From HTTPS Protocol with JQuery - Stack Overflow

How to format multiple footnotes on a single sentence in Quarto reveal.js? - Stack Overflow

javascript - redux-persist - When to persist reducer? - Stack Overflow

.editor-styles-wrapper overriding my block styles in Gutenberg

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价

javascript - HTML5 Video "Black Screen" on iPad - Stack Overflow

javascript - Vuejs - Cannot read property '_withTask' of undefined - Stack Overflow

javascript - Failed to execute 'getImageData' - The canvas has been tainted by cross-origin data - Stack Overflo

javascript - 'stepUp' called on an object that does not implement interface HTMLInputElement - Stack Overflow

javascript - RequireJS: when to use 'paths' versus 'packages' - Stack Overflow

javascript - Value of 'e' may be overwritten in IE 8 and earlier - Stack Overflow

javascript - Howwhy is “ *[attribute^="string" ” a valid querySelector? (JS bug?) - Stack Overflow

PowerBI DAX: Formula for an "Initial Transaction Flag" - Stack Overflow