apache kafka - Avoid duplicate messages for topics across different clusters - Stack Overflow

IT技术

更新时间：2025-01-0810

admin管理员组
文章数量:1122832

We have a springboot application that has multiple pods deployed on k8s. It consumes events from a topic sitting in a cluster C1, transforms the messages and pushes the transformed data to a topic sitting in a different cluster using KafkaTemplate class.

Is it possible to maintain deduplication of events using Exactly once semantics config given that there are 2 clusters involved?

If not, what are the options available so even if a pod restarts or a pod gets added/deleted dynamically, there are no events getting duplicated?

We have a springboot application that has multiple pods deployed on k8s. It consumes events from a topic sitting in a cluster C1, transforms the messages and pushes the transformed data to a topic sitting in a different cluster using KafkaTemplate class.

Is it possible to maintain deduplication of events using Exactly once semantics config given that there are 2 clusters involved?

If not, what are the options available so even if a pod restarts or a pod gets added/deleted dynamically, there are no events getting duplicated?

Share Improve this question edited Nov 23, 2024 at 15:13 OneCricketeer 191k20 gold badges141 silver badges267 bronze badges asked Nov 21, 2024 at 9:03 Ravi Singh Shekhawat 312 silver badges10 bronze badges

Add a comment |

1 Answer 1

Sorted by: Reset to default 0

You will need to use an intermediate KV store, such as Redis, Mongo, Postgres, Elasticsearch, etc. Kafka itself will never know there's duplicates (yes, even compacted topics can contain duplicate keys), even in one cluster

You'd insert/query every event to your database to know if it's been seen before or not

Lookup 2PC patterns for more ideas around this concept

Depending on your use case, you could also use a framework like Temporal to handle such distribution transactions

本文标签： apache kafkaAvoid duplicate messages for topics across different clustersStack Overflow

版权声明：本文标题：apache kafka - Avoid duplicate messages for topics across different clusters - Stack Overflow 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1736312253a1935032.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

编程频道|软件玩家 - 软件改变生活！

apache kafka - Avoid duplicate messages for topics across different clusters - Stack Overflow

1 Answer 1

更多相关文章