 4c6f8a79ae
			
		
	
	
		4c6f8a79ae
		
	
	
	
	
		
			
			Split that into a separate file, put under "features". Cc: Yong Huang <yong.huang@smartx.com> Reviewed-by: Cédric Le Goater <clg@redhat.com> Link: https://lore.kernel.org/r/20240109064628.595453-8-peterx@redhat.com Signed-off-by: Peter Xu <peterx@redhat.com>
		
			
				
	
	
		
			72 lines
		
	
	
		
			3.6 KiB
		
	
	
	
		
			ReStructuredText
		
	
	
	
	
	
			
		
		
	
	
			72 lines
		
	
	
		
			3.6 KiB
		
	
	
	
		
			ReStructuredText
		
	
	
	
	
	
| Dirty limit
 | |
| ===========
 | |
| 
 | |
| The dirty limit, short for dirty page rate upper limit, is a new capability
 | |
| introduced in the 8.1 QEMU release that uses a new algorithm based on the KVM
 | |
| dirty ring to throttle down the guest during live migration.
 | |
| 
 | |
| The algorithm framework is as follows:
 | |
| 
 | |
| ::
 | |
| 
 | |
|   ------------------------------------------------------------------------------
 | |
|   main   --------------> throttle thread ------------> PREPARE(1) <--------
 | |
|   thread  \                                                |              |
 | |
|            \                                               |              |
 | |
|             \                                              V              |
 | |
|              -\                                        CALCULATE(2)       |
 | |
|                \                                           |              |
 | |
|                 \                                          |              |
 | |
|                  \                                         V              |
 | |
|                   \                                    SET PENALTY(3) -----
 | |
|                    -\                                      |
 | |
|                      \                                     |
 | |
|                       \                                    V
 | |
|                        -> virtual CPU thread -------> ACCEPT PENALTY(4)
 | |
|   ------------------------------------------------------------------------------
 | |
| 
 | |
| When the qmp command qmp_set_vcpu_dirty_limit is called for the first time,
 | |
| the QEMU main thread starts the throttle thread. The throttle thread, once
 | |
| launched, executes the loop, which consists of three steps:
 | |
| 
 | |
|   - PREPARE (1)
 | |
| 
 | |
|      The entire work of PREPARE (1) is preparation for the second stage,
 | |
|      CALCULATE(2), as the name implies. It involves preparing the dirty
 | |
|      page rate value and the corresponding upper limit of the VM:
 | |
|      The dirty page rate is calculated via the KVM dirty ring mechanism,
 | |
|      which tells QEMU how many dirty pages a virtual CPU has had since the
 | |
|      last KVM_EXIT_DIRTY_RING_FULL exception; The dirty page rate upper
 | |
|      limit is specified by caller, therefore fetch it directly.
 | |
| 
 | |
|   - CALCULATE (2)
 | |
| 
 | |
|      Calculate a suitable sleep period for each virtual CPU, which will be
 | |
|      used to determine the penalty for the target virtual CPU. The
 | |
|      computation must be done carefully in order to reduce the dirty page
 | |
|      rate progressively down to the upper limit without oscillation. To
 | |
|      achieve this, two strategies are provided: the first is to add or
 | |
|      subtract sleep time based on the ratio of the current dirty page rate
 | |
|      to the limit, which is used when the current dirty page rate is far
 | |
|      from the limit; the second is to add or subtract a fixed time when
 | |
|      the current dirty page rate is close to the limit.
 | |
| 
 | |
|   - SET PENALTY (3)
 | |
| 
 | |
|      Set the sleep time for each virtual CPU that should be penalized based
 | |
|      on the results of the calculation supplied by step CALCULATE (2).
 | |
| 
 | |
| After completing the three above stages, the throttle thread loops back
 | |
| to step PREPARE (1) until the dirty limit is reached.
 | |
| 
 | |
| On the other hand, each virtual CPU thread reads the sleep duration and
 | |
| sleeps in the path of the KVM_EXIT_DIRTY_RING_FULL exception handler, that
 | |
| is ACCEPT PENALTY (4). Virtual CPUs tied with writing processes will
 | |
| obviously exit to the path and get penalized, whereas virtual CPUs involved
 | |
| with read processes will not.
 | |
| 
 | |
| In summary, thanks to the KVM dirty ring technology, the dirty limit
 | |
| algorithm will restrict virtual CPUs as needed to keep their dirty page
 | |
| rate inside the limit. This leads to more steady reading performance during
 | |
| live migration and can aid in improving large guest responsiveness.
 |