Kafka data loss - Memphis.dev

Kafka data loss

Shoham Roditi

Jan 05, 2023

2 3 4

We can recover lost events if the data source lives in the stack’s persistence layer.
In terms of Ingestion latency, increasing CPU resources and fine-tuning batch sizes result in a significant increase in throughput, reducing total lag from several hours to a few minutes.

Most importantly, we can make changes in Kafka to solve this problem too, which are as follows:

Changing Kafka offset reset policy in several applications to the earliest.
For example, suppose an ingestion lag is observed again due to spikes in data traffic.
In that case, we attempt to load the oldest available record in Kafka to reduce the blast radius associated with data loss.
It is especially useful in services where auto commits are disabled, and idempotency is considered.
Extending Kafka retention policy by several days to allow for system re-stabilization and to give development teams enough time to work on potential fixes to issues without losing data.
Where, of course, the retention policy can be reduced or reverted to its default or custom setting.

Memphis Approach

Each station in Memphis uses a stream file to store the station’s messages.
The user can choose whether the streaming file is saved in memory or on a file from where the user can retrieve it.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Memphis FAQ