wal-bytes-per-sync和bytes-per-sync

TiDBer_jYQINSnf · 2021 年11 月 18 日 04:04

TiKV:4.0.8
这两个参数的作用是什么?影响是什么?
sync-log是true的情况下，wal-bytes-per-sync这个参数有用吗？

TiDBer_jYQINSnf · 2021 年11 月 18 日 06:15

sync-log控制同步刷盘，如果wal-bytes-per-sync设置了512k,这是个什么单位? 每秒512k的wal? 如果写入很多的话，岂不是这个会拦住写? 具体怎么控制的速率?

TiDBer_jYQINSnf · 2021 年11 月 18 日 06:23

官网文档和github的toml文件中只有一句话，不是特别理解，想省个事儿直接问一波点透。对写入的过程中到底哪个参数能影响写入速度想了解清楚些，有相关文档就非常好了

Lucien-卢西恩 · 2021 年11 月 18 日 06:29

这个参数默认不需要调整，具体要看一下监控，如果监控里面 WAL 写入瓶颈，同时I/O 负载没有上线情况下，可以调整一下。

TiDBer_jYQINSnf · 2021 年11 月 18 日 06:34

如图WAL的监控算慢吗?
k8s operator部署的grafana上有磁盘的IO监控吗？没有node_exporter。

Lucien-卢西恩 · 2021 年11 月 18 日 06:46

算，很慢了。这个是 raft 的监控吗？

TiDBer_jYQINSnf · 2021 年11 月 18 日 06:48

是的，昨天的监控是这样的，上午这样调了下参数

[rocksdb.defaultcf]
write-buffer-size = "512MB"

[raftstore]
apply-pool-size = 4
store-pool-size = 4


[rocksdb]
wal-bytes-per-sync = "1M"

对wal-bytes-per-sync的参数工作原理不是特别了解，就来问下。

TiDBer_jYQINSnf · 2021 年11 月 18 日 06:53

你这里面的两个链接我看了，很清楚。不过具体的wal-bytes-per-sync的工作原理还没解答。
我的疑问是:wal-bytes-per-sync这个参数难道是为了控制磁盘IO压力而设置的吗？如果这样的话，那岂不是wal写不下去，wal写不下去的话，整个数据库的写就卡住了。还不如直接让磁盘io打到满为止。

所以这个参数如果真是为了应对一个机器上多个rocksdb的时候，各个实例平等的利用io，那这个参数是有用的。

所以对这个参数的真是作用是不是上面我理解的，希望有了解的同学给个答案

Lucien-卢西恩 · 2021 年11 月 18 日 09:26

可以用 metrictool 导出一下完整的 tikv-details 监控，我们一起看一下吧

TiDBer_jYQINSnf · 2021 年11 月 18 日 09:58

tidb_metric.rar (1.6 MB)
请帮忙看下，谢谢。这个场景主要是load data，导数据，导数据的时候查询很少，就是想导数快一些。

Meditator · 2021 年11 月 24 日 01:44

这个参数是wal的刷盘策略，wal默认有三种刷盘策略：
1）sync模式，每个写请求都会刷盘，最终调用fsync刷盘，这种模式最安全，但效率最低；
2）wal-bytes-per-sync模式，如果设置为1M，则认为是写满1M数据后就刷盘，最多丢失1MB的数据；
3）default模式，交给OS来刷盘，安全性最低，丢失数据可能最多。

至于sync-log参数，有一段解释是这样的：
TiDB is based on the Raft consensus algorithm, it needs to make sure every raft log is persisted to the majority’s disk before committing them. To make sure a log is persisted, we need two steps:

write(log_fd, log)
fsync(log_fd)

When sync-log=false , TiDB skips fsync , it helps performance and is safe if there is no power failure. Set it to true, your data is always intact even if there is a power failure

What is the relationship between `sync-log` and `wal-bytes-per-sync`

github.com/tikv/tikv

What is the relationship between `sync-log` and `wal-bytes-per-sync`

已打开 08:04AM - 04 Jun 19 UTC

已关闭 07:09AM - 11 Jun 19 UTC

Yriuns

type/question

## What is the relationship between `sync-log` and `wal-bytes-per-sync` I kno…w that each `TiKV` instance contains two `RocksDB` instances, one is `Raft RocksDB`, stores `raft log` and the other stores `key-value`. I also know that in a standard raft write process, the raft cluster should persist log before respond `success`. My question are: 1. If `wal-bytes-per-sync` is not `0`, when we calls something like `save_to_raft_RocksDB(log)`, the log will be put in the `memtable` of `RocksDB` and return `ok` to the `RocksDB client`. But the log might be waiting to be persisted into the `WAL` in the background. So if the power is cut off at this time, will the log be lost? 2. The introduction of `sync-log` says ```toml ## Whether to force to flush logs. ## Set to `true` (default) for best reliability, which prevents data loss when there is a power ## failure. sync-log = true ``` I wonder if it means that, if I set `sync-log = true`, `Raft RocksDB` will save every write into `WAL`? In other words, it will overwrite `wal-bytes-per-sync` (set it to 0)?

Meditator · 2021 年11 月 24 日 01:58

bytes-per-sync
这个参数，官网解释的笼统，不知确切含义

Meditator · 2021 年11 月 24 日 02:14

DBOptions()::bytesPerSync()

Allows OS to incrementally sync files to disk while they are being written, asynchronously, in the background.

这里的files是指什么？

TiDBer_jYQINSnf · 2021 年11 月 24 日 02:59

所以，按照这个解释，还是说只要开了sync-log=true，另一个参数没用了对吗？那另一个参数的意义是什么？好像还是没有得到答案。等有空我翻翻代码吧。

Meditator · 2021 年11 月 24 日 03:32

是的，按上面的理解是这样的

system · 2022 年10 月 31 日 19:16

此话题已在最后回复的 1 分钟后被自动关闭。不再允许新回复。

wal-bytes-per-sync和bytes-per-sync

What is the relationship between sync-log and wal-bytes-per-sync

What is the relationship between `sync-log` and `wal-bytes-per-sync`