
* Update to QEMU v9.0.0 --------- Signed-off-by: Peter Maydell <peter.maydell@linaro.org> Signed-off-by: Fabiano Rosas <farosas@suse.de> Signed-off-by: Peter Xu <peterx@redhat.com> Signed-off-by: Thomas Huth <thuth@redhat.com> Signed-off-by: Cédric Le Goater <clg@redhat.com> Signed-off-by: Zheyu Ma <zheyuma97@gmail.com> Signed-off-by: Ido Plat <ido.plat@ibm.com> Signed-off-by: Ilya Leoshkevich <iii@linux.ibm.com> Signed-off-by: Markus Armbruster <armbru@redhat.com> Signed-off-by: Marc-André Lureau <marcandre.lureau@redhat.com> Signed-off-by: Paolo Bonzini <pbonzini@redhat.com> Signed-off-by: Vladimir Sementsov-Ogievskiy <vsementsov@yandex-team.ru> Signed-off-by: David Hildenbrand <david@redhat.com> Signed-off-by: Kevin Wolf <kwolf@redhat.com> Signed-off-by: Thomas Lamprecht <t.lamprecht@proxmox.com> Signed-off-by: Fiona Ebner <f.ebner@proxmox.com> Signed-off-by: Gregory Price <gregory.price@memverge.com> Signed-off-by: Jonathan Cameron <Jonathan.Cameron@huawei.com> Signed-off-by: Philippe Mathieu-Daudé <philmd@linaro.org> Signed-off-by: Lorenz Brun <lorenz@brun.one> Signed-off-by: Yao Xingtao <yaoxt.fnst@fujitsu.com> Signed-off-by: Arnaud Minier <arnaud.minier@telecom-paris.fr> Signed-off-by: Inès Varhol <ines.varhol@telecom-paris.fr> Signed-off-by: BALATON Zoltan <balaton@eik.bme.hu> Signed-off-by: Igor Mammedov <imammedo@redhat.com> Signed-off-by: Akihiko Odaki <akihiko.odaki@daynix.com> Signed-off-by: Richard Henderson <richard.henderson@linaro.org> Signed-off-by: Sven Schnelle <svens@stackframe.org> Signed-off-by: Daniel Henrique Barboza <dbarboza@ventanamicro.com> Signed-off-by: Christian Schoenebeck <qemu_oss@crudebyte.com> Signed-off-by: Jason Wang <jasowang@redhat.com> Signed-off-by: Helge Deller <deller@gmx.de> Signed-off-by: Nicholas Piggin <npiggin@gmail.com> Signed-off-by: Benjamin Gray <bgray@linux.ibm.com> Signed-off-by: Avihai Horon <avihaih@nvidia.com> Signed-off-by: Michael Tokarev <mjt@tls.msk.ru> Signed-off-by: Joonas Kankaala <joonas.a.kankaala@gmail.com> Signed-off-by: Marcin Juszkiewicz <marcin.juszkiewicz@linaro.org> Signed-off-by: Stefan Weil <sw@weilnetz.de> Signed-off-by: Zhao Liu <zhao1.liu@intel.com> Signed-off-by: Glenn Miles <milesg@linux.ibm.com> Signed-off-by: Oleg Sviridov <oleg.sviridov@red-soft.ru> Signed-off-by: Artem Chernyshev <artem.chernyshev@red-soft.ru> Signed-off-by: Yajun Wu <yajunw@nvidia.com> Signed-off-by: Stefan Hajnoczi <stefanha@redhat.com> Signed-off-by: Mark Cave-Ayland <mark.cave-ayland@ilande.co.uk> Signed-off-by: Pierre-Clément Tosi <ptosi@google.com> Signed-off-by: Lei Wang <lei4.wang@intel.com> Signed-off-by: Wei Wang <wei.w.wang@intel.com> Signed-off-by: Martin Hundebøll <martin@geanix.com> Signed-off-by: Michael S. Tsirkin <mst@redhat.com> Signed-off-by: Manos Pitsidianakis <manos.pitsidianakis@linaro.org> Signed-off-by: Wafer <wafer@jaguarmicro.com> Signed-off-by: Yuxue Liu <yuxue.liu@jaguarmicro.com> Signed-off-by: Gerd Hoffmann <kraxel@redhat.com> Signed-off-by: Nguyen Dinh Phi <phind.uet@gmail.com> Signed-off-by: Zack Buhman <zack@buhman.org> Signed-off-by: Keith Packard <keithp@keithp.com> Signed-off-by: Yuquan Wang wangyuquan1236@phytium.com.cn Signed-off-by: Matheus Tavares Bernardino <quic_mathbern@quicinc.com> Signed-off-by: Cindy Lu <lulu@redhat.com> Co-authored-by: Peter Maydell <peter.maydell@linaro.org> Co-authored-by: Fabiano Rosas <farosas@suse.de> Co-authored-by: Peter Xu <peterx@redhat.com> Co-authored-by: Thomas Huth <thuth@redhat.com> Co-authored-by: Cédric Le Goater <clg@redhat.com> Co-authored-by: Zheyu Ma <zheyuma97@gmail.com> Co-authored-by: Ido Plat <ido.plat@ibm.com> Co-authored-by: Ilya Leoshkevich <iii@linux.ibm.com> Co-authored-by: Markus Armbruster <armbru@redhat.com> Co-authored-by: Marc-André Lureau <marcandre.lureau@redhat.com> Co-authored-by: Paolo Bonzini <pbonzini@redhat.com> Co-authored-by: Vladimir Sementsov-Ogievskiy <vsementsov@yandex-team.ru> Co-authored-by: David Hildenbrand <david@redhat.com> Co-authored-by: Kevin Wolf <kwolf@redhat.com> Co-authored-by: Stefan Reiter <s.reiter@proxmox.com> Co-authored-by: Fiona Ebner <f.ebner@proxmox.com> Co-authored-by: Gregory Price <gregory.price@memverge.com> Co-authored-by: Lorenz Brun <lorenz@brun.one> Co-authored-by: Yao Xingtao <yaoxt.fnst@fujitsu.com> Co-authored-by: Philippe Mathieu-Daudé <philmd@linaro.org> Co-authored-by: Arnaud Minier <arnaud.minier@telecom-paris.fr> Co-authored-by: BALATON Zoltan <balaton@eik.bme.hu> Co-authored-by: Igor Mammedov <imammedo@redhat.com> Co-authored-by: Akihiko Odaki <akihiko.odaki@daynix.com> Co-authored-by: Richard Henderson <richard.henderson@linaro.org> Co-authored-by: Sven Schnelle <svens@stackframe.org> Co-authored-by: Daniel Henrique Barboza <dbarboza@ventanamicro.com> Co-authored-by: Helge Deller <deller@kernel.org> Co-authored-by: Harsh Prateek Bora <harshpb@linux.ibm.com> Co-authored-by: Benjamin Gray <bgray@linux.ibm.com> Co-authored-by: Nicholas Piggin <npiggin@gmail.com> Co-authored-by: Avihai Horon <avihaih@nvidia.com> Co-authored-by: Michael Tokarev <mjt@tls.msk.ru> Co-authored-by: Joonas Kankaala <joonas.a.kankaala@gmail.com> Co-authored-by: Marcin Juszkiewicz <marcin.juszkiewicz@linaro.org> Co-authored-by: Stefan Weil <sw@weilnetz.de> Co-authored-by: Dayu Liu <liu.dayu@zte.com.cn> Co-authored-by: Zhao Liu <zhao1.liu@intel.com> Co-authored-by: Glenn Miles <milesg@linux.vnet.ibm.com> Co-authored-by: Artem Chernyshev <artem.chernyshev@red-soft.ru> Co-authored-by: Yajun Wu <yajunw@nvidia.com> Co-authored-by: Mark Cave-Ayland <mark.cave-ayland@ilande.co.uk> Co-authored-by: Pierre-Clément Tosi <ptosi@google.com> Co-authored-by: Wei Wang <wei.w.wang@intel.com> Co-authored-by: Martin Hundebøll <martin@geanix.com> Co-authored-by: Michael S. Tsirkin <mst@redhat.com> Co-authored-by: Manos Pitsidianakis <manos.pitsidianakis@linaro.org> Co-authored-by: Wafer <wafer@jaguarmicro.com> Co-authored-by: lyx634449800 <yuxue.liu@jaguarmicro.com> Co-authored-by: Gerd Hoffmann <kraxel@redhat.com> Co-authored-by: Nguyen Dinh Phi <phind.uet@gmail.com> Co-authored-by: Zack Buhman <zack@buhman.org> Co-authored-by: Keith Packard <keithp@keithp.com> Co-authored-by: Yuquan Wang <wangyuquan1236@phytium.com.cn> Co-authored-by: Matheus Tavares Bernardino <quic_mathbern@quicinc.com> Co-authored-by: Cindy Lu <lulu@redhat.com>
139 lines
5.8 KiB
ReStructuredText
139 lines
5.8 KiB
ReStructuredText
Mapped-ram
|
|
==========
|
|
|
|
Mapped-ram is a new stream format for the RAM section designed to
|
|
supplement the existing ``file:`` migration and make it compatible
|
|
with ``multifd``. This enables parallel migration of a guest's RAM to
|
|
a file.
|
|
|
|
The core of the feature is to ensure that RAM pages are mapped
|
|
directly to offsets in the resulting migration file. This enables the
|
|
``multifd`` threads to write exclusively to those offsets even if the
|
|
guest is constantly dirtying pages (i.e. live migration). Another
|
|
benefit is that the resulting file will have a bounded size, since
|
|
pages which are dirtied multiple times will always go to a fixed
|
|
location in the file, rather than constantly being added to a
|
|
sequential stream. Having the pages at fixed offsets also allows the
|
|
usage of O_DIRECT for save/restore of the migration stream as the
|
|
pages are ensured to be written respecting O_DIRECT alignment
|
|
restrictions (direct-io support not yet implemented).
|
|
|
|
Usage
|
|
-----
|
|
|
|
On both source and destination, enable the ``multifd`` and
|
|
``mapped-ram`` capabilities:
|
|
|
|
``migrate_set_capability multifd on``
|
|
|
|
``migrate_set_capability mapped-ram on``
|
|
|
|
Use a ``file:`` URL for migration:
|
|
|
|
``migrate file:/path/to/migration/file``
|
|
|
|
Mapped-ram migration is best done non-live, i.e. by stopping the VM on
|
|
the source side before migrating.
|
|
|
|
Use-cases
|
|
---------
|
|
|
|
The mapped-ram feature was designed for use cases where the migration
|
|
stream will be directed to a file in the filesystem and not
|
|
immediately restored on the destination VM [#]_. These could be
|
|
thought of as snapshots. We can further categorize them into live and
|
|
non-live.
|
|
|
|
- Non-live snapshot
|
|
|
|
If the use case requires a VM to be stopped before taking a snapshot,
|
|
that's the ideal scenario for mapped-ram migration. Not having to
|
|
track dirty pages, the migration will write the RAM pages to the disk
|
|
as fast as it can.
|
|
|
|
Note: if a snapshot is taken of a running VM, but the VM will be
|
|
stopped after the snapshot by the admin, then consider stopping it
|
|
right before the snapshot to take benefit of the performance gains
|
|
mentioned above.
|
|
|
|
- Live snapshot
|
|
|
|
If the use case requires that the VM keeps running during and after
|
|
the snapshot operation, then mapped-ram migration can still be used,
|
|
but will be less performant. Other strategies such as
|
|
background-snapshot should be evaluated as well. One benefit of
|
|
mapped-ram in this scenario is portability since background-snapshot
|
|
depends on async dirty tracking (KVM_GET_DIRTY_LOG) which is not
|
|
supported outside of Linux.
|
|
|
|
.. [#] While this same effect could be obtained with the usage of
|
|
snapshots or the ``file:`` migration alone, mapped-ram provides
|
|
a performance increase for VMs with larger RAM sizes (10s to
|
|
100s of GiBs), specially if the VM has been stopped beforehand.
|
|
|
|
RAM section format
|
|
------------------
|
|
|
|
Instead of having a sequential stream of pages that follow the
|
|
RAMBlock headers, the dirty pages for a RAMBlock follow its header
|
|
instead. This ensures that each RAM page has a fixed offset in the
|
|
resulting migration file.
|
|
|
|
A bitmap is introduced to track which pages have been written in the
|
|
migration file. Pages are written at a fixed location for every
|
|
ramblock. Zero pages are ignored as they'd be zero in the destination
|
|
migration as well.
|
|
|
|
::
|
|
|
|
Without mapped-ram: With mapped-ram:
|
|
|
|
--------------------- --------------------------------
|
|
| ramblock 1 header | | ramblock 1 header |
|
|
--------------------- --------------------------------
|
|
| ramblock 2 header | | ramblock 1 mapped-ram header |
|
|
--------------------- --------------------------------
|
|
| ... | | padding to next 1MB boundary |
|
|
--------------------- | ... |
|
|
| ramblock n header | --------------------------------
|
|
--------------------- | ramblock 1 pages |
|
|
| RAM_SAVE_FLAG_EOS | | ... |
|
|
--------------------- --------------------------------
|
|
| stream of pages | | ramblock 2 header |
|
|
| (iter 1) | --------------------------------
|
|
| ... | | ramblock 2 mapped-ram header |
|
|
--------------------- --------------------------------
|
|
| RAM_SAVE_FLAG_EOS | | padding to next 1MB boundary |
|
|
--------------------- | ... |
|
|
| stream of pages | --------------------------------
|
|
| (iter 2) | | ramblock 2 pages |
|
|
| ... | | ... |
|
|
--------------------- --------------------------------
|
|
| ... | | ... |
|
|
--------------------- --------------------------------
|
|
| RAM_SAVE_FLAG_EOS |
|
|
--------------------------------
|
|
| ... |
|
|
--------------------------------
|
|
|
|
where:
|
|
- ramblock header: the generic information for a ramblock, such as
|
|
idstr, used_len, etc.
|
|
|
|
- ramblock mapped-ram header: the information added by this feature:
|
|
bitmap of pages written, bitmap size and offset of pages in the
|
|
migration file.
|
|
|
|
Restrictions
|
|
------------
|
|
|
|
Since pages are written to their relative offsets and out of order
|
|
(due to the memory dirtying patterns), streaming channels such as
|
|
sockets are not supported. A seekable channel such as a file is
|
|
required. This can be verified in the QIOChannel by the presence of
|
|
the QIO_CHANNEL_FEATURE_SEEKABLE.
|
|
|
|
The improvements brought by this feature apply only to guest physical
|
|
RAM. Other types of memory such as VRAM are migrated as part of device
|
|
states.
|