Virtualsmp #41

udosteinberg · 2014-01-15T10:20:08Z

The following series of patches adds the necessary synchronization in the VMM to support multiple virtual CPUs on different physical cores. A description of the concepts and mechanisms can be found in Markus' thesis titled "Adding SMP Support to a User-Level VMM"

This is needed for: - luring the VCPU recall context into a blocking semaphore to pause its execution at end of migration. - unlocking the VCPU before boot to let it run into recall context and restoring it to exact state like on the source migration host.

This is needed for live migration. The general semantic works like this: - The caller will ask for some RW-mapped memory range - This range will be found in the host op - It will be remapped as read-only and reported back to the caller This routine uses a pointer which is moved round-robin through the guest memory range. WARNING: Assumes NOVA as underlying kernel. Was not ported to UNIX.

Added a CpuMessage to add arbitrary offsets to the VCPU's timestamp counter. This is needed for live migration.

Devices will be attached to this. The migration code uses this to to communicate with classes of devices. Devices can write their state into restore messages and also read it back to restore.

…eval. The live migration module needs this to tell the target host what kind of VMM has to be started.

…which was in use in the Vancouver project on NUL to do host app networking.

This is only compiled into the project and not in use, yet. The next commit will embedd these mechanisms into main.cc

…w live migration code.

…d backspace key press. As this migrates to a hard coded destination host, this could also be done more elegant: - By a VMCALL from the VM, carrying a magic number in the eax register and the destination host in the ebx register. - By some VM manager application, triggering the migration event via some IPC event. - By a fancy ncurses menu, prompting the user for the destination host IP.

ACPI events can be rised with this, fixed and GP events.

…ling code.

The restore procedure does automatically propagate its new position within the LAN.

… methods. This has been done in the Migration class constructor, but this was too early after reordering VMM parameters for live migration retrieval.

Users of the memory bus can now determine if they are working with actual guest-physmem.

From now on, only actual guest-physmem will be tracked.

It will now only check memory ranges which are actual guest-physmem.

In general the transfer has demonstrated to be errorfree. However, checksumming is useful to find out if changes on the tracking mechanism etc. provoke data corruption.

The last resend round did tend to be uncomplete in the scenario of both slow ethernet and large Writeable Working Sets.

This commit adds hooks to the DBus infrastructure to install a proxy between message senders and receivers. In addition to the ReceiveFunction, a similar EnqueueFunction is provided which gets called upon message sending. An I/O thread can then register enqueue callbacks for the respective message type and manage sending on the caller's behalf. If the callback returns false or is not present, everything works as before. Another callback (named "claim") can be used to configure messages to bypass the I/O thread and hence being sent by the issuing thread directly. All information needed to process the message identically as before is encoded into a new message type MessageIOThread. This information includes the send mode (FIFO, LIFO, early out, round-robin), if the message should be sent synchronously (i.e., the caller has to wait until the request is completed) and which vCPU was the parent of the bus, if applicable.

This commit adds a reference implementation for the unix frontend to show how an I/O thread could be implemented. Note that the global lock is still in place to allow for easily disabling the I/O thread. Performance may suffer, but the unix frontend is proof-of-concept only, anyway. The I/O thread can be disabled by commenting out the #define USE_IOTHREAD line in unix/main.cc Access to guest RAM is bypassing the I/O thread because it is synchronized by the operating system.

The vCPU threads are now pinned to consecutive physical cores, starting at the one following the original core that vancouver was started on. As a first simple solution, every physical core gets assigned a dedicated timeout object (i.e., a timer session). Later on, this could be restricted to the actual cores the instance runs on.

This commit ports the reference implementation of the I/O thread found in the unix frontend to NRE. It places the I/O thread worker on the CPU assigned to Vancouver, leaving the vCPUs on the following CPUs.

Sporadic event handlers should have higher priority. In the case of the I/O thread, this is important when it is colocated with another vCPU (which is not advised). For timers, this can help avoid timing issues when the VM does busy waiting on timer events.

Because the synchronization is now provided by the I/O thread, it is safe to remove the global lock.

To help modify vCPU and (LA)PIC subsystem to use atomic operations or fine-grained locking instead of a global lock, this synthetic testing utility can be used to stress the respective device models in an isolated way and run targeted development cycles with it.

Races in the emulation paths of the vCPU and the interrupt controller logic can cause problems when no external synchronization mechanism is applied. Using atomic instructions and relocation of certain code sections, it will now be possible to concurrently access vCPU, Lapic and PIC without the need for a lock around them.

The following devices are now configured to bypass an I/O thread: * vCPU memory and CpuMessage: Safe as of previous commit. * PM Timer: No need for synchronization. * VGA Framebuffer memory: No need for synchronization. * PCI Pass-through memory and IRQ: No need for synchronization, IRQ already safe.

blitz · 2014-01-15T11:53:25Z

Thanks!

For further information please look at 'src/app/audio_player/README'. Fixes TUD-OS#41.

Jacek Galowicz and others added 30 commits November 15, 2013 12:28

VCPU TSC offset manipulation.

c50d4bc

Added a CpuMessage to add arbitrary offsets to the VCPU's timestamp counter. This is needed for live migration.

Added a restore bus to the mainboard.

163db90

Devices will be attached to this. The migration code uses this to to communicate with classes of devices. Devices can write their state into restore messages and also read it back to restore.

Added restore code to TimeoutList.

f6ea4b6

Added restore code to LAPIC model.

656c247

Added restore code to PIC model.

a5a8b19

Added restore code to PIT model.

94df54b

Added restore code to VGA model.

391087d

Added a host op stub for the application's configuration string retri…

f22aab3

…eval. The live migration module needs this to tell the target host what kind of VMM has to be started.

Added class StopWatch to time.h.

edfb110

Added class IpHelper containing a skeleton of the socket abstraction …

223ee65

…which was in use in the Vancouver project on NUL to do host app networking.

Added the main live migration code.

8d21f2f

This is only compiled into the project and not in use, yet. The next commit will embedd these mechanisms into main.cc

This commit implements most of the code needed to actually use the ne…

b9a236d

…w live migration code.

Add an ACPI controller model.

154a76d

Added a new message type: MessageAcpiEvent.

57432ce

ACPI events can be rised with this, fixed and GP events.

Made PCI passthrough devices migratable by adding hot plug event hand…

8332c23

…ling code.

Made the NIC model migratable.

c8ecf06

The restore procedure does automatically propagate its new position within the LAN.

Move guest physical memory information retrieval into the listen/send…

5792f3d

… methods. This has been done in the Migration class constructor, but this was too early after reordering VMM parameters for live migration retrieval.

Added a new field "actual_physmem" to MessageMemRegion messages.

2d74868

Users of the memory bus can now determine if they are working with actual guest-physmem.

Rewrote the MessageHostOp OP_NEXT_DIRTY_REGION

2005dc2

From now on, only actual guest-physmem will be tracked.

Rewrote the checksumming routine.

bf5f6b3

It will now only check memory ranges which are actual guest-physmem.

Made checksumming optional with a preprocessor #define.

fcdea6f

In general the transfer has demonstrated to be errorfree. However, checksumming is useful to find out if changes on the tracking mechanism etc. provoke data corruption.

Fix live migration for larger and memory-aggressive VMs

81d7fd7

The last resend round did tend to be uncomplete in the scenario of both slow ethernet and large Writeable Working Sets.

Use __typeof__ for c++0x (NRE).

2fbc5a2

Ported I/O thread implementation.

68184e2

This commit ports the reference implementation of the I/O thread found in the unix frontend to NRE. It places the I/O thread worker on the CPU assigned to Vancouver, leaving the vCPUs on the following CPUs.

Markus Partheymueller added 5 commits January 15, 2014 11:12

Remove global lock.

2457cc0

Because the synchronization is now provided by the I/O thread, it is safe to remove the global lock.

alex-ab pushed a commit to alex-ab/seoul that referenced this pull request Oct 3, 2023

Add rudimentary audio_player based on libav

da743cc

For further information please look at 'src/app/audio_player/README'. Fixes TUD-OS#41.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Virtualsmp #41

Virtualsmp #41

udosteinberg commented Jan 15, 2014

blitz commented Jan 15, 2014

Virtualsmp #41

Are you sure you want to change the base?

Virtualsmp #41

Conversation

udosteinberg commented Jan 15, 2014

blitz commented Jan 15, 2014