Should I call
addSink()? Even when I add a Kafka source or sink?
Sources and sinks are also operators, although they are—as such—not listed in the Flink documentation. Sources and sinks may also be stateful operators. In this case, a Kafka source (consumer) is storing its partition offsets and an at-least-once or exactly-once Kafka sink (producer) is storing information on Kafka transactions in state.
To make sure your job can recover from a savepoint smoothly, even after job topology changes, we thus recommend to always set a
addSink(), for example:
|Tip: We recommend to set a