postgresql - Does SERIAL8 (BIGSERIAL) in Greenplum cause performance issues with parallel inserts? - Stack Overflow

IT技术

更新时间：2025-03-080

admin管理员组
文章数量:1289496

I am using Greenplum Database (GPDB) for OLAP workloads, and I have several tables where the primary key is defined as SERIAL8. Since SERIAL8 automatically creates a unique SEQUENCE for each table, I assumed this would work efficiently for parallel inserts. However, I’ve read that in Greenplum, SEQUENCE is managed globally by the master node, which could create performance bottlenecks when multiple segments request new IDs in parallel. Given that Greenplum is an MPP system, I’m concerned that heavy parallel inserts (INSERT INTO ... SELECT ...) could slow down due to SEQUENCE contention.

My questions are:

Does using SERIAL8 (or BIGSERIAL) in Greenplum actually cause performance issues during large-scale parallel inserts?
If yes, what is the best way to optimize it? Should I manually create a SEQUENCE with INCREMENT to distribute IDs across segments, or is there another recommended approach?
Would setting DISTRIBUTED BY (id) help in any way, or does it only affect data distribution and not SEQUENCE contention?
Are there alternative strategies (e.g., UUIDs, pre-generated IDs in ETL, etc.) that are better suited for high-performance OLAP inserts?

I want to ensure that my ID generation strategy does not become a bottleneck as data volume grows. Any insights from experienced Greenplum users would be greatly appreciated!

My questions are:

Does using SERIAL8 (or BIGSERIAL) in Greenplum actually cause performance issues during large-scale parallel inserts?
If yes, what is the best way to optimize it? Should I manually create a SEQUENCE with INCREMENT to distribute IDs across segments, or is there another recommended approach?
Would setting DISTRIBUTED BY (id) help in any way, or does it only affect data distribution and not SEQUENCE contention?
Are there alternative strategies (e.g., UUIDs, pre-generated IDs in ETL, etc.) that are better suited for high-performance OLAP inserts?

I want to ensure that my ID generation strategy does not become a bottleneck as data volume grows. Any insights from experienced Greenplum users would be greatly appreciated!

Share Improve this question asked Feb 20 at 12:35 Nika 631 silver badge5 bronze badges

Running tests to evaluate performance with the anticipated workload is far more valuable than soliciting opinions from people who don't have adequate details about your requirements and environment. This post s likely to be closed for being unfocused and eliciting opinions. – JohnH Commented Feb 20 at 14:57
1 How many inserts per second and concurrent inserts per second do you expect? We only tested up to 20000 INSERTs per second using up to 500 concurrent connections and that worked fine on standard PostgreSQL. Just run your horses and check the results. – Frank Heikens Commented Feb 20 at 18:39
pgbench – Zegarek Commented Feb 20 at 21:13

Add a comment |

1 Answer 1

Sorted by: Reset to default 1

If it turns out to be a problem, you can increase the sequence cache setting:^{[postgres][greenplum]}

alter sequence s1 cache 10;

That way each concurrent worker needs to touch the sequence object only once every 10 nextval() requests because it'll get pre-allocated 10 values at once.

The price is that it increases the amount of gaps in the sequence - all pre-allocated and unused values get discarded, they don't go back to be re-used by some other session. Thing is, even with cache 1 you should not rely on a serial column being gapless, nor should you rely on the insertion order following the order of numbers returned by the sequence. That sequence loses values any time something takes a nextval() and rolls back, or upserts with insert..on conflict.

You can also side-step the problem entirely:

create table t1(
  id uuid primary key default gen_random_uuid()
 ,created_at timestamptz default now()
);

By design, gen_random_uuid() makes sure the column is unique and it doesn't matter where the identifier is generated, removing the need for clients to share the sequence object. If you needed some sort of insertion order info, an actual timestamptz is more reliable than a sequence.

If gen_random_uuid() isn't available in your version of Greenplum, you can use the uuid-ossp package.

本文标签：

版权声明：本文标题：postgresql - Does SERIAL8 (BIGSERIAL) in Greenplum cause performance issues with parallel inserts? - Stack Overflow 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1741435076a2378588.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

发表评论

全部评论 0

暂无评论

编程频道|软件玩家 - 软件改变生活！

postgresql - Does SERIAL8 (BIGSERIAL) in Greenplum cause performance issues with parallel inserts? - Stack Overflow

1 Answer 1

更多相关文章

javascript - save a function in localstorage - Stack Overflow

javascript - How to get checked checkbox inside in div using jquery? - Stack Overflow

images - cant upload mediapictures to my wordpress site, cant upload anything to my database

&#39;Dangerous Site&#39; warning in Chrome when hosting projects (github) - Stack Overflow

How to Use a CGPROGRAM Shader in Unity HDRP to Control Grass Lighting and Shadows - Stack Overflow

Wait popup window javascript - Stack Overflow

python - Why I&#39;m having an error on my robotframework automation - Stack Overflow

functions - How to put the author of the post in the comments?

javascript - Generic type array of objects in typescript - Stack Overflow

javascript - AWS Lambda tmp folder is not shared between executions - Stack Overflow

javascript - vue.js filters in v-for - Stack Overflow

reactjs - WebSocket connection works in Vite development server, but fails in Vite preview with NestJS backend - Stack Overflow

javascript - How can I call a static Backbone.Model function from an instance of that model, without specifying the model name?

javascript - Error not showing for Angular Material md-select - Stack Overflow

javascript - What&#39;s the easiest way to parse the anchor out of the current page&#39;s location? - Stack Overflow

How to add Advenced Custom Fields In Single Post

How can I convert a short date to a long date in Javascript? - Stack Overflow

javascript - jQuery jqGrid Show message when an edit row is complete - Stack Overflow

.net - Memory Leak in Heap unmanaged - Stack Overflow

How to set the attribute &quot;checked&quot; value for a Form.Checked react bootstrap element - Stack Overflow

发表评论

推荐文章

javascript - Location.pathname.indexOf not working with &#39;Or&#39; ( || ) - Stack Overflow

jquery - Open a &quot;Save As&quot;&quot;Download&quot; dialog with JavaScript to download a file created on the

bdd - Testing JavaScript Click Event with Sinon - Stack Overflow

block editor - How to find out whether a plugin is used in pages or posts?

Javascript: How to turn the time (stored as a fraction) into a readable string? - Stack Overflow

热门文章

php - Search in select list? - Stack Overflow

javascript - How can I get the names of name-value pairs in Handlebars? - Stack Overflow

Custom post type url with category

installation - Reinstalling wordpress from database breaks the site

google workspace - Retrieve the event via Calendar Service in appscripts - Stack Overflow

javascript - jquery-steps missing CSS file - Stack Overflow

javascript - Insert form data into mysql database table using node.js and sequelize - Stack Overflow

php - Activate scheduled command task in laravel 11 - Stack Overflow

javascript - Align textfields right via jQueryCSS and within a for loop - Stack Overflow

javascript - A good method for client-side iframe caching - Stack Overflow

最新文章

Win7各正式版下载地址和SHA验证

怎么样把中文版的Windows7改成英文版的Windows7

Win7系统笔记本蓝牙打开指南：详细步骤助你轻松连接

win7开机弹计算机,win7开机弹出Windows Installer窗口的解决方法

windows7虚拟机安装vmtools方法

javascript - AutoComplete for JQuery for multiple textbox - Stack Overflow

c# - Checking if user authorized in Orchard CMS - Stack Overflow

Where does Wordpress get the theme name from to check for updates?

javascript - Chaining .then() calls in ES6 promises - Stack Overflow

javascript - Customizing checkout &quot;Place Order&quot; button output html - Stack Overflow

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价

'Dangerous Site' warning in Chrome when hosting projects (github) - Stack Overflow

python - Why I'm having an error on my robotframework automation - Stack Overflow

javascript - What's the easiest way to parse the anchor out of the current page's location? - Stack Overflow

How to set the attribute "checked" value for a Form.Checked react bootstrap element - Stack Overflow

javascript - Location.pathname.indexOf not working with 'Or' ( || ) - Stack Overflow

jquery - Open a "Save As""Download" dialog with JavaScript to download a file created on the

javascript - Customizing checkout "Place Order" button output html - Stack Overflow