Re: [RFC] Unify KVM kernel-space and user-space code into a singleproject

From: Avi Kivity
Date: Wed Mar 24 2010 - 00:58:37 EST


On 03/23/2010 08:21 PM, Joerg Roedel wrote:
On Tue, Mar 23, 2010 at 06:39:58PM +0200, Avi Kivity wrote:
On 03/23/2010 04:06 PM, Joerg Roedel wrote:
And this system wide entity is the kvm module. It creates instances of
'struct kvm' and destroys them. I see no problem if we just attach a
name to every instance with a good default value like kvm0, kvm1 ... or
guest0, guest1 ... User-space can override the name if it wants. The kvm
module takes care about the names being unique.

So, two users can't have a guest named MyGuest each? What about
namespace support? There's a lot of work in virtualizing all kernel
namespaces, you're adding to that.
This enumeration is a very small and non-intrusive feature. Making it
aware of namespaces is easy too.

It's easier (and safer and all the other boring bits) not to do it at all in the kernel.

What about notifications when guests are added or removed?
Who would be the consumer of such notifications? A 'perf kvm list' can
live without I guess. If we need them later we can still add them.

System-wide monitoring needs to work equally well for guests started before or after the monitor. Even disregarding that, if you introduce an API, people will start using it and complaining if it's incomplete.

The equivalent functionality for network interfaces is in netlink.

This is very much the same as network card numbering is implemented in
the kernel.
Forcing perf to talk to qemu or even libvirt produces to much overhead
imho. Instrumentation only produces useful results with low overhead.

It's a setup cost only.
My statement was not limited to enumeration, I should have been more
clear about that. The guest filesystem access-channel is another
affected part. The 'perf kvm top' command will access the guest
filesystem regularly and going over qemu would be more overhead here.

Why? Also, the real cost would be accessing the filesystem, not copying data over qemu.

Providing this in the KVM module directly also has the benefit that it
would work out-of-the-box with different userspaces too. Or do we want
to limit 'perf kvm' to the libvirt-qemu-kvm software stack?

Other userspaces can also provide this functionality, like they have to provide disk, network, and display emulation. The kernel is not a huge library.

Sidenote: I really think we should come to a conclusion about the
concept. KVM integration into perf is very useful feature to
analyze virtualization workloads.


Agreed.

--
Do not meddle in the internals of kernels, for they are subtle and quick to panic.

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/