Commits · 34f80cfad59ee587e374cbaf5f2a31d9f5015404 · linux / linux-davinci

10 Sep, 2009 40 commits

KVM: SVM: get rid of nested_svm_vmexit_real · 34f80cfa

Joerg Roedel authored Aug 07, 2009

This patch is the starting point of removing nested_svm_do from the
nested svm code. The nested_svm_do function basically maps two guest
physical pages to host virtual addresses and calls a passed function
on it. This function pointer code flow is hard to read and not the
best technical solution here.
As a side effect this patch indroduces the nested_svm_[un]map helper
functions.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

34f80cfa

KVM: SVM: simplify nested_svm_check_exception · 0295ad7d

Joerg Roedel authored Aug 07, 2009

Makes the code of this function more readable by removing on
indentation level for the core logic.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

0295ad7d

KVM: SVM: do nested vmexit in nested_svm_exit_handled · 9c4e40b9

Joerg Roedel authored Aug 07, 2009

If this function returns true a nested vmexit is required. Move that
vmexit into the nested_svm_exit_handled function. This also simplifies
the handling of nested #pf intercepts in this function.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Acked-by: Alexander Graf <agraf@suse.de>
Signed-off-by: Avi Kivity <avi@redhat.com>

9c4e40b9

KVM: SVM: consolidate nested_svm_exit_handled · 4c2161ae

Joerg Roedel authored Aug 07, 2009

When caching guest intercepts there is no need anymore for the
nested_svm_exit_handled_real function. So move its code into
nested_svm_exit_handled.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Acked-by: Alexander Graf <agraf@suse.de>
Signed-off-by: Avi Kivity <avi@redhat.com>

4c2161ae

KVM: SVM: cache nested intercepts · aad42c64

Joerg Roedel authored Aug 07, 2009

When the nested intercepts are cached we don't need to call
get_user_pages and/or map the nested vmcb on every nested #vmexit to
check who will handle the intercept.
Further this patch aligns the emulated svm behavior better to real
hardware.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

aad42c64

KVM: SVM: move nested svm state into seperate struct · e6aa9abd

Joerg Roedel authored Aug 07, 2009

This makes it more clear for which purpose these members in the vcpu_svm
exist.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Acked-by: Alexander Graf <agraf@suse.de>
Signed-off-by: Avi Kivity <avi@redhat.com>

e6aa9abd

KVM: SVM: complete interrupts after handling nested exits · a5c3832d

Joerg Roedel authored Aug 07, 2009

The interrupt completion code must run after nested exits are handled
because not injected interrupts or exceptions may be handled by the l1
guest first.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Acked-by: Alexander Graf <agraf@suse.de>
Signed-off-by: Avi Kivity <avi@redhat.com>

a5c3832d

KVM: SVM: copy only necessary parts of the control area on vmrun/vmexit · 0460a979

Joerg Roedel authored Aug 07, 2009

The vmcb control area contains more then 800 bytes of reserved fields
which are unnecessarily copied. Fix this by introducing a copy
function which only copies the relevant part and saves time.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Acked-by: Alexander Graf <agraf@suse.de>
Signed-off-by: Avi Kivity <avi@redhat.com>

0460a979

KVM: SVM: optimize nested vmrun · defbba56

Joerg Roedel authored Aug 07, 2009

Only copy the necessary parts of the vmcb save area on vmrun and save
precious time.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Acked-by: Alexander Graf <agraf@suse.de>
Signed-off-by: Avi Kivity <avi@redhat.com>

defbba56

KVM: SVM: optimize nested #vmexit · 33740e40

Joerg Roedel authored Aug 07, 2009

It is more efficient to copy only the relevant parts of the vmcb back to
the nested vmcb when we emulate an vmexit.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Acked-by: Alexander Graf <agraf@suse.de>
Signed-off-by: Avi Kivity <avi@redhat.com>

33740e40

KVM: SVM: add helper functions for global interrupt flag · 2af9194d

Joerg Roedel authored Aug 07, 2009

This patch makes the code easier to read when it comes to setting,
clearing and checking the status of the virtualized global
interrupt flag for the VCPU.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

2af9194d

x86: Export kmap_atomic_to_page() · 256cd2ef
Avi Kivity authored Aug 10, 2009
```
Needed by KVM.
Signed-off-by: Avi Kivity <avi@redhat.com>
```
256cd2ef

KVM: Replace pic_lock()/pic_unlock() with direct call to spinlock functions · 88ba63c2

Gleb Natapov authored Aug 04, 2009

They are not doing anything else now.
Signed-off-by: Gleb Natapov <gleb@redhat.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

88ba63c2

KVM: Call ack notifiers from PIC when guest OS acks an IRQ. · 938396a2

Gleb Natapov authored Aug 04, 2009

Currently they are called when irq vector is been delivered.  Calling ack
notifiers at this point is wrong.  Device assignment ack notifier enables
host interrupts, but guest not yet had a chance to clear interrupt
condition in a device.
Signed-off-by: Gleb Natapov <gleb@redhat.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

938396a2

KVM: Call kvm_vcpu_kick() inside pic spinlock · 956f97cf

Gleb Natapov authored Aug 04, 2009

d5ecfdd25 moved it out because back than it was impossible to
call it inside spinlock. This restriction no longer exists.
Signed-off-by: Gleb Natapov <gleb@redhat.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

956f97cf

KVM: fix EFER read buffer overflow · 3a34a881

Roel Kluin authored Aug 04, 2009

Check whether index is within bounds before grabbing the element.
Signed-off-by: Roel Kluin <roel.kluin@gmail.com>
Cc: Avi Kivity <avi@redhat.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Avi Kivity <avi@redhat.com>

3a34a881

KVM: ignore reads to perfctr msrs · 1f3ee616

Amit Shah authored Jun 30, 2009

We ignore writes to the perfctr msrs. Ignore reads as well.

Kaspersky antivirus crashes Windows guests if it can't read
these MSRs.
Signed-off-by: Amit Shah <amit.shah@redhat.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

1f3ee616

KVM: VMX: Optimize vmx_get_cpl() · eab4b8aa

Avi Kivity authored Aug 04, 2009

Instead of calling vmx_get_segment() (which reads a whole bunch of
vmcs fields), read only the cs selector which contains the cpl.
Signed-off-by: Avi Kivity <avi@redhat.com>

eab4b8aa

KVM: x86: Disallow hypercalls for guest callers in rings > 0 · 07708c4a

Jan Kiszka authored Aug 03, 2009

So far unprivileged guest callers running in ring 3 can issue, e.g., MMU
hypercalls. Normally, such callers cannot provide any hand-crafted MMU
command structure as it has to be passed by its physical address, but
they can still crash the guest kernel by passing random addresses.

To close the hole, this patch considers hypercalls valid only if issued
from guest ring 0. This may still be relaxed on a per-hypercall base in
the future once required.

Cc: stable@kernel.org
Signed-off-by: Jan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

07708c4a

KVM: MMU: fix bogus alloc_mmu_pages assignment · b90c062c

Marcelo Tosatti authored Jul 28, 2009

Remove the bogus n_free_mmu_pages assignment from alloc_mmu_pages.

It breaks accounting of mmu pages, since n_free_mmu_pages is modified
but the real number of pages remains the same.

Cc: stable@kernel.org
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

b90c062c

KVM: MMU: make __kvm_mmu_free_some_pages handle empty list · 3b80fffe

Izik Eidus authored Jul 28, 2009

First check if the list is empty before attempting to look at list
entries.

Cc: stable@kernel.org
Signed-off-by: Izik Eidus <ieidus@redhat.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

3b80fffe

KVM: remove superfluous NULL pointer check in kvm_inject_pit_timer_irqs() · 95fb4eb6

Bartlomiej Zolnierkiewicz authored Jul 29, 2009

This takes care of the following entries from Dan's list:

arch/x86/kvm/i8254.c +714 kvm_inject_pit_timer_irqs(6) warning: variable derefenced in initializer 'vcpu'
arch/x86/kvm/i8254.c +714 kvm_inject_pit_timer_irqs(6) warning: variable derefenced before check 'vcpu'
Reported-by: Dan Carpenter <error27@gmail.com>
Cc: corbet@lwn.net
Cc: eteo@redhat.com
Cc: Julia Lawall <julia@diku.dk>
Signed-off-by: Bartlomiej Zolnierkiewicz <bzolnier@gmail.com>
Acked-by: Sheng Yang <sheng@linux.intel.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

95fb4eb6

KVM: report 1GB page support to userspace · 344f414f

Joerg Roedel authored Jul 27, 2009

If userspace knows that the kernel part supports 1GB pages it can enable
the corresponding cpuid bit so that guests actually use GB pages.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

344f414f

KVM: MMU: enable gbpages by increasing nr of pagesizes · 04326caa
Joerg Roedel authored Jul 27, 2009
```
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>
```
04326caa

KVM: MMU: shadow support for 1gb pages · 7e4e4056

Joerg Roedel authored Jul 27, 2009

This patch adds support for shadow paging to the 1gb page table code in KVM.
With this code the guest can use 1gb pages even if the host does not support
them.

[ Marcelo: fix shadow page collision on pmd level if a guest 1gb page is mapped
           with 4kb ptes on host level ]
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

7e4e4056

KVM: MMU: make page walker aware of mapping levels · e04da980

Joerg Roedel authored Jul 27, 2009

The page walker may be used with nested paging too when accessing mmio
areas.  Make it support the additional page-level too.

[ Marcelo: fix reserved bit check for 1gb pte ]
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

e04da980

KVM: MMU: make direct mapping paths aware of mapping levels · 852e3c19
Joerg Roedel authored Jul 27, 2009
```
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>
```
852e3c19

KVM: MMU: rename is_largepage_backed to mapping_level · d25797b2

Joerg Roedel authored Jul 27, 2009

With the new name and the corresponding backend changes this function
can now support multiple hugepage sizes.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

d25797b2

KVM: MMU: make rmap code aware of mapping levels · 44ad9944

Joerg Roedel authored Jul 27, 2009

This patch removes the largepage parameter from the rmap_add function.
Together with rmap_remove this function now uses the role.level field to
find determine if the page is a huge page.
Signed-off-by: Joerg Roedel <joerg.roedel@amd.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

44ad9944

KVM: fix kvm_init() error handling · aed665f7

Xiao Guangrong authored Aug 03, 2009

Remove debugfs file if kvm_arch_init() return error
Signed-off-by: Xiao Guangrong <xiaoguangrong@cn.fujitsu.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

aed665f7

KVM: limit lapic periodic timer frequency · 1444885a

Marcelo Tosatti authored Jul 27, 2009

Otherwise its possible to starve the host by programming lapic timer
with a very high frequency.

Cc: stable@kernel.org
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>
Signed-off-by: Avi Kivity <avi@redhat.com>

1444885a

KVM: Align cr8 threshold when userspace changes cr8 · 5f0269f5

Mikhail Ershov authored Aug 03, 2009

Commit f0a3602c20 ("KVM: Move interrupt injection logic to x86.c") does not
update the cr8 intercept if the lapic is disabled, so when userspace updates
cr8, the cr8 threshold control is not updated and we are left with illegal
control fields.

Fix by explicitly resetting the cr8 threshold.
Signed-off-by: Avi Kivity <avi@redhat.com>

5f0269f5

KVM: VMX: Avoid to return ENOTSUPP to userland · 7f582ab6

Jan Kiszka authored Jul 22, 2009

Choose some allowed error values for the cases VMX returned ENOTSUPP so
far as these values could be returned by the KVM_RUN IOCTL.
Signed-off-by: Jan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

7f582ab6

KVM: Drop obsolete cpu_get/put in make_all_cpus_request · e601e3be

Jan Kiszka authored Jul 20, 2009

spin_lock disables preemption, so we can simply read the current cpu.
Signed-off-by: Jan Kiszka <jan.kiszka@siemens.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

e601e3be

KVM: PIT: Unregister ack notifier callback when freeing · 84fde248
Gleb Natapov authored Jul 16, 2009
```
Signed-off-by: Gleb Natapov <gleb@redhat.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>
```
84fde248

KVM: VMX: Introduce KVM_SET_IDENTITY_MAP_ADDR ioctl · b927a3ce

Sheng Yang authored Jul 21, 2009

Now KVM allow guest to modify guest's physical address of EPT's identity mapping page.

(change from v1, discard unnecessary check, change ioctl to accept parameter
address rather than value)
Signed-off-by: Sheng Yang <sheng@linux.intel.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

b927a3ce

KVM: x86: use kvm_get_gdt() and kvm_read_ldt() · b792c344

Akinobu Mita authored Jul 19, 2009

Use kvm_get_gdt() and kvm_read_ldt() to reduce inline assembly code.

Cc: Avi Kivity <avi@redhat.com>
Cc: kvm@vger.kernel.org
Signed-off-by: Akinobu Mita <akinobu.mita@gmail.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

b792c344

KVM: x86: use get_desc_base() and get_desc_limit() · 46a359e7

Akinobu Mita authored Jul 18, 2009

Use get_desc_base() and get_desc_limit() to get the base address and
limit in desc_struct.

Cc: Avi Kivity <avi@redhat.com>
Cc: kvm@vger.kernel.org
Signed-off-by: Akinobu Mita <akinobu.mita@gmail.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

46a359e7

KVM: s390: remove unused structs · decde80b

Gleb Natapov authored Jul 12, 2009

They are not used by common code without defines which s390 does not
have.
Signed-off-by: Gleb Natapov <gleb@redhat.com>
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

decde80b

KVM: MMU: fix missing locking in alloc_mmu_pages · 6a1ac771

Marcelo Tosatti authored Jul 15, 2009

n_requested_mmu_pages/n_free_mmu_pages are used by
kvm_mmu_change_mmu_pages to calculate the number of pages to zap.

alloc_mmu_pages, called from the vcpu initialization path, modifies this
variables without proper locking, which can result in a negative value
in kvm_mmu_change_mmu_pages (say, with cpu hotplug).
Signed-off-by: Marcelo Tosatti <mtosatti@redhat.com>

6a1ac771