Documentation/block/deadline-iosched.rst

*4882a593Smuzhiyun==============================
*4882a593SmuzhiyunDeadline IO scheduler tunables
*4882a593Smuzhiyun==============================
*4882a593Smuzhiyun
*4882a593SmuzhiyunThis little file attempts to document how the deadline io scheduler works.
*4882a593SmuzhiyunIn particular, it will clarify the meaning of the exposed tunables that may be
*4882a593Smuzhiyunof interest to power users.
*4882a593Smuzhiyun
*4882a593SmuzhiyunSelecting IO schedulers
*4882a593Smuzhiyun-----------------------
*4882a593SmuzhiyunRefer to Documentation/block/switching-sched.rst for information on
*4882a593Smuzhiyunselecting an io scheduler on a per-device basis.
*4882a593Smuzhiyun
*4882a593Smuzhiyun------------------------------------------------------------------------------
*4882a593Smuzhiyun
*4882a593Smuzhiyunread_expire	(in ms)
*4882a593Smuzhiyun-----------------------
*4882a593Smuzhiyun
*4882a593SmuzhiyunThe goal of the deadline io scheduler is to attempt to guarantee a start
*4882a593Smuzhiyunservice time for a request. As we focus mainly on read latencies, this is
*4882a593Smuzhiyuntunable. When a read request first enters the io scheduler, it is assigned
*4882a593Smuzhiyuna deadline that is the current time + the read_expire value in units of
*4882a593Smuzhiyunmilliseconds.
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593Smuzhiyunwrite_expire	(in ms)
*4882a593Smuzhiyun-----------------------
*4882a593Smuzhiyun
*4882a593SmuzhiyunSimilar to read_expire mentioned above, but for writes.
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593Smuzhiyunfifo_batch	(number of requests)
*4882a593Smuzhiyun------------------------------------
*4882a593Smuzhiyun
*4882a593SmuzhiyunRequests are grouped into ``batches`` of a particular data direction (read or
*4882a593Smuzhiyunwrite) which are serviced in increasing sector order.  To limit extra seeking,
*4882a593Smuzhiyundeadline expiries are only checked between batches.  fifo_batch controls the
*4882a593Smuzhiyunmaximum number of requests per batch.
*4882a593Smuzhiyun
*4882a593SmuzhiyunThis parameter tunes the balance between per-request latency and aggregate
*4882a593Smuzhiyunthroughput.  When low latency is the primary concern, smaller is better (where
*4882a593Smuzhiyuna value of 1 yields first-come first-served behaviour).  Increasing fifo_batch
*4882a593Smuzhiyungenerally improves throughput, at the cost of latency variation.
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593Smuzhiyunwrites_starved	(number of dispatches)
*4882a593Smuzhiyun--------------------------------------
*4882a593Smuzhiyun
*4882a593SmuzhiyunWhen we have to move requests from the io scheduler queue to the block
*4882a593Smuzhiyundevice dispatch queue, we always give a preference to reads. However, we
*4882a593Smuzhiyundon't want to starve writes indefinitely either. So writes_starved controls
*4882a593Smuzhiyunhow many times we give preference to reads over writes. When that has been
*4882a593Smuzhiyundone writes_starved number of times, we dispatch some writes based on the
*4882a593Smuzhiyunsame criteria as reads.
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593Smuzhiyunfront_merges	(bool)
*4882a593Smuzhiyun----------------------
*4882a593Smuzhiyun
*4882a593SmuzhiyunSometimes it happens that a request enters the io scheduler that is contiguous
*4882a593Smuzhiyunwith a request that is already on the queue. Either it fits in the back of that
*4882a593Smuzhiyunrequest, or it fits at the front. That is called either a back merge candidate
*4882a593Smuzhiyunor a front merge candidate. Due to the way files are typically laid out,
*4882a593Smuzhiyunback merges are much more common than front merges. For some work loads, you
*4882a593Smuzhiyunmay even know that it is a waste of time to spend any time attempting to
*4882a593Smuzhiyunfront merge requests. Setting front_merges to 0 disables this functionality.
*4882a593SmuzhiyunFront merges may still occur due to the cached last_merge hint, but since
*4882a593Smuzhiyunthat comes at basically 0 cost we leave that on. We simply disable the
*4882a593Smuzhiyunrbtree front sector lookup when the io scheduler merge function is called.
*4882a593Smuzhiyun
*4882a593Smuzhiyun
*4882a593SmuzhiyunNov 11 2002, Jens Axboe <jens.axboe@oracle.com>