Skip to content

Grafana Agent with remote write receiver #538

Answered by rfratto
naveen-adisesha asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @naveen-adisesha, sorry for the long delay on getting back to you. The downtime tolerance depends on how frequently the WAL is truncated. As of 0.13.0, our default is 60 minutes, which gives you up to 2/3rds of that in downtime tolerance. I'm working with the Prometheus team on changing this, because it's confusing. The doc of my proposal (which explains more about downtime tolerances) is here.

If you want to tolerate 24 hours of downtime, you'll probably want to set your wal_truncate_frequency in the Agent to 48h. How much storage that'll use depends on your metrics throughput, but it would likely take up more storage than Prometheus TSDB over an equivalent time range since the TSDB s…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@naveen-adisesha
Comment options

Answer selected by naveen-adisesha
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants