transactional.id per topic/partition

https://www.confluent.io/blog/transactions-apache-kafka/

>The key to fencing out zombies properly is to ensure that the input topics and partitions in the read-process-write cycle is always the same for a given transactional.id. If this isn’t true, then it is possible for some messages to leak through the fencing provided by transactions.

>For instance, in a distributed stream processing application, suppose topic-partition tp0 was originally processed by transactional.id T0. If, at some point later, it could be mapped to another producer with transactional.id T1, there would be no fencing between T0 and T1. So it is possible for messages from tp0 to be reprocessed, violating the exactly once processing guarantee.

>Practically, one would either have to store the mapping between input partitions and transactional.ids in an external store, or have some static encoding of it. Kafka Streams opts for the latter approach to solve this problem.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

transactional.id per topic/partition #800

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

transactional.id per topic/partition #800

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions