Commits · 756a6c68556600aec9460346332884d891d5beb4 · linux / linux-davinci

17 Apr, 2008 40 commits

x86: ioremap of 64-bit resource on 32-bit kernel fix · 756a6c68
Ingo Molnar authored Mar 25, 2008
```
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
756a6c68

x86: move ipi definitions to mach_ipi.h · 5af5573e

Glauber Costa authored Mar 25, 2008

take them out of the x86_64-only asm/mach_apic.h
Signed-off-by: Glauber Costa <gcosta@redhat.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>

5af5573e

x86: split large page mapping for AMD TSEG · 8346ea17

Andi Kleen authored Mar 12, 2008

On AMD SMM protected memory is part of the address map, but handled
internally like an MTRR. That leads to large pages getting split
internally which has some performance implications. Check for the
AMD TSEG MSR and split the large page mapping on that area
explicitely if it is part of the direct mapping.

There is also SMM ASEG, but it is in the first 1MB and already covered by
the earlier split first page patch.

Idea for this came from an earlier patch by Andreas Herrmann

On a RevF dual Socket Opteron system kernbench shows a clear
improvement from this:
(together with the earlier patches in this series, especially the
split first 2MB patch)

[lower is better]
              no split stddev         split  stddev    delta
Elapsed Time   87.146 (0.727516)     84.296 (1.09098)  -3.2%
User Time     274.537 (4.05226)     273.692 (3.34344)  -0.3%
System Time    34.907 (0.42492)      34.508 (0.26832)  -1.1%
Percent CPU   322.5   (38.3007)     326.5   (44.5128)  +1.2%

=> About 3.2% improvement in elapsed time for kernbench.

With GB pages on AMD Fam1h the impact of splitting is much higher of course,
since it would split two full GB pages (together with the first
1MB split patch) instead of two 2MB pages.  I could not benchmark
a clear difference in kernbench on gbpages, so I kept it disabled
for that case

That was only limited benchmarking of course, so if someone
was interested in running more tests for the gbpages case
that could be revisited (contributions welcome)

I didn't bother implementing this for 32bit because it is very
unlikely the 32bit lowmem mapping overlaps into the TSEG near 4GB
and the 2MB low split is already handled for both.

[ mingo@elte.hu: do it on gbpages kernels too, there's no clear reason
                 why it shouldnt help there. ]
Signed-off-by: Andi Kleen <ak@suse.de>
Acked-by: andreas.herrmann3@amd.com
Cc: mingo@elte.hu
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Ingo Molnar <mingo@elte.hu>

8346ea17

x86: re-add rdmsrl_safe · 1de87bd4

Andi Kleen authored Mar 22, 2008

RDMSR for 64bit values with exception handling.

Makes it easier to deal with 64bit valued MSRs. The old 64bit code
base had that too as checking_rdmsrl(), but it got dropped somehow.
Signed-off-by: Andi Kleen <andi@firstfloor.org>
Cc: andreas.herrmann3@amd.com
Cc: mingo@elte.hu
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Ingo Molnar <mingo@elte.hu>

1de87bd4

x86: don't use large pages to map the first 2/4MB of memory · f5c24a7f

Andi Kleen authored Mar 12, 2008

Intel recommends to not use large pages for the first 1MB
of the physical memory because there are fixed size MTRRs there
which cause splitups in the TLBs.

On AMD doing so is also a good idea.

The implementation is a little different between 32bit and 64bit.
On 32bit I just taught the initial page table set up about this
because it was very simple to do. This also has the advantage
that the risk of a prefetch ever seeing the page even
if it only exists for a short time is minimized.

On 64bit that is not quite possible, so use set_memory_4k() a little
later (in check_bugs) instead.
Signed-off-by: Andi Kleen <ak@suse.de>
Acked-by: andreas.herrmann3@amd.com
Cc: mingo@elte.hu
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Ingo Molnar <mingo@elte.hu>

f5c24a7f

x86: add set_memory_4k to pageattr.c · c9caa02c

Andi Kleen authored Mar 12, 2008

Add a new function to force split large pages into 4k pages.
This is needed for some followup optimizations.

I had to add a new field to cpa_data to pass down the information
that try_preserve_large_page should not run.

Right now no set_page_4k() because I didn't need it and all the
specialized users I have in mind would be more comfortable with
pure addresses. I also didn't export it because it's unlikely
external code needs it.
Signed-off-by: Andi Kleen <ak@suse.de>
Cc: andreas.herrmann3@amd.com
Cc: mingo@elte.hu
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Ingo Molnar <mingo@elte.hu>

c9caa02c

x86: account overlapped mappings in max_pfn_mapped · cc615032

Andi Kleen authored Mar 12, 2008

When end_pfn is not aligned to 2MB (or 1GB) then the kernel might
map more memory than end_pfn. Account this in max_pfn_mapped.
Signed-off-by: Andi Kleen <ak@suse.de>
Cc: andreas.herrmann3@amd.com
Cc: mingo@elte.hu
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Ingo Molnar <mingo@elte.hu>

cc615032

x86: replace the now useless max_pfn_mapped define · 67794292
Thomas Gleixner authored Mar 21, 2008
```
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
67794292

x86: implement true end_pfn_mapped for 32bit · 7d1116a9

Andi Kleen authored Mar 12, 2008

Even on 32bit 2MB pages can map more memory than is in the true
max_low_pfn if end_pfn is not highmem and not aligned to 2MB.
Add a end_pfn_map similar to x86-64 that accounts for this
fact. This is important for code that really needs to know about
all mapping aliases.
Signed-off-by: Andi Kleen <ak@suse.de>
Cc: andreas.herrmann3@amd.com
Cc: mingo@elte.hu
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Ingo Molnar <mingo@elte.hu>

7d1116a9

x86: move early exception handlers into init.text · 41bd4eac

Andi Kleen authored Mar 11, 2008

Currently they are in .text.head because the rest of head_64.S.
.text.head is not removed as init data, but the early exception handlers
should be because they are not needed after early boot of the BP.
So move them over.
Signed-off-by: Andi Kleen <ak@suse.de>
Cc: mingo@elte.hu
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Ingo Molnar <mingo@elte.hu>

41bd4eac

x86: replace early exception setup macro recursion with loop · 749c970a

Andi Kleen authored Mar 11, 2008

The early exception handlers are currently set up using a macro
recursion. There is only one user left. Replace the macro with a
standard loop in place.

Noop patch, just a cleanup.

[ tglx@linutronix.de: simplified ]
Signed-off-by: Andi Kleen <ak@suse.de>
Cc: mingo@elte.hu
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Ingo Molnar <mingo@elte.hu>

749c970a

x86: don't set up early exception handlers for external interrupts · 5524ea32

Andi Kleen authored Mar 11, 2008

All of early setup runs with interrupts disabled, so there is no
need to set up early exception handlers for vectors >= 32

This saves some minor text size.
Signed-off-by: Andi Kleen <ak@suse.de>
Cc: mingo@elte.hu
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
Signed-off-by: Ingo Molnar <mingo@elte.hu>

5524ea32

x86: relocate_kernel - use predefined macroses for page attributes · 366932de
gorcunov@gmail.com authored Mar 23, 2008
```
Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
366932de
x86: relocate_kernel - use predefined macroses for processor state · fd3af531
gorcunov@gmail.com authored Mar 23, 2008
```
Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
fd3af531
x86: relocate_kernel - use PAGE_SIZE instead of numeric constant · a7bba17b
gorcunov@gmail.com authored Mar 23, 2008
```
Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
a7bba17b
x86: relocate_kernel_32.S - clear register in more elegant way · 4039ae53
gorcunov@gmail.com authored Mar 23, 2008
```
Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
4039ae53

x86: fix test_poke for vmalloced pages · 15a601eb

Mathieu Desnoyers authored Mar 12, 2008

* Ingo Molnar (mingo@elte.hu) wrote:
>
> * Mathieu Desnoyers <mathieu.desnoyers@polymtl.ca> wrote:
>
> > The shadow vmap for DEBUG_RODATA kernel text modification uses
> > virt_to_page to get the pages from the pointer address.
> >
> > However, I think vmalloc_to_page would be required in case the page is
> > used for modules.
> >
> > Since only the core kernel text is marked read-only, use
> > kernel_text_address() to make sure we only shadow map the core kernel
> > text, not modules.
>
> actually, i think we should mark module text readonly too.
>

Yes, but in the meantime, the x86 tree would need this patch to make
kprobes work correctly on modules.

I suspect that without this fix, with the enhanced hotplug and kprobes
patch, kprobes will use text_poke to insert breakpoints in modules
(vmalloced pages used), which will map the wrong pages and corrupt
random kernel locations instead of updating the correct page.

Work that would write protect the module pages should clearly be done,
but it can come in a later time. We have to make sure we interact
correctly with the page allocation debugging, as an example.

Here is the patch against x86.git 2.6.25-rc5 :

The shadow vmap for DEBUG_RODATA kernel text modification uses virt_to_page to
get the pages from the pointer address.

However, I think vmalloc_to_page would be required in case the page is used for
modules.

Since only the core kernel text is marked read-only, use kernel_text_address()
to make sure we only shadow map the core kernel text, not modules.
Signed-off-by: Mathieu Desnoyers <mathieu.desnoyers@polymtl.ca>
CC: akpm@linux-foundation.org
Signed-off-by: Ingo Molnar <mingo@elte.hu>

15a601eb

x86: clean up vSMP detection · e5699a82

Ravikiran G Thirumalai authored Mar 24, 2008

vSMP detection: access pci config space early in boot to detect if the
system is a vSMPowered box, and cache the result in a flag, so that
is_vsmp_box() retrieves the value of the flag always.
Signed-off-by: Ravikiran Thirumalai <kiran@scalex86.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>

e5699a82

x86: pgtable, document pde bits · 43cdf5d6

Jiri Slaby authored Mar 22, 2008

Some of pde bits weren't documented, add the short description to them.
Signed-off-by: Jiri Slaby <jirislaby@gmail.com>
Cc: H. Peter Anvin <hpa@zytor.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>

43cdf5d6

x86: spinlock ops are always-inlined · 7fda20f1
Ingo Molnar authored Feb 29, 2008
```
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
7fda20f1

x86: only enable interrupts when kernel state has been set up · d93c870b

Jeremy Fitzhardinge authored Mar 24, 2008

The sysenter path tries to enable interrupts immediately.  Unfortunately
this doesn't work in a paravirt environment, because not enough kernel
state has been set up at that point (namely, pointing %fs to the kernel
percpu data segment).  To fix this, defer ENABLE_INTERRUPTS until after
the kernel state has been set up.

Unfortunately this means that we're running with interrupts disabled
for a while without calling the IRQ tracing code, but that can't be
called without setting up %fs either.
Signed-off-by: Jeremy Fitzhardinge <jeremy.fitzhardinge@citrix.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>

d93c870b

include/asm-x86/xor_64.h: checkpatch cleanups - formatting only · 687c8054
Joe Perches authored Mar 23, 2008
```
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
687c8054
include/asm-x86/xor_32.h: checkpatch cleanups - formatting only · 8fdf7655
Joe Perches authored Mar 23, 2008
```
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
8fdf7655
include/asm-x86/voyager.h: checkpatch cleanups - formatting only · d6ae390a
Joe Perches authored Mar 23, 2008
```
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
d6ae390a
include/asm-x86/vmi.h: checkpatch cleanups - formatting only · 8948584e
Joe Perches authored Mar 23, 2008
```
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
8948584e
include/asm-x86/vm86.h: checkpatch cleanups - formatting only · 9e8a935b
Joe Perches authored Mar 23, 2008
```
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
9e8a935b
include/asm-x86/vga.h: checkpatch cleanups - formatting only · 364fe5ef
Joe Perches authored Mar 23, 2008
```
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
364fe5ef
include/asm-x86/vdso.h: checkpatch cleanups - formatting only · ac1a7b0e
Joe Perches authored Mar 23, 2008
```
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
ac1a7b0e
include/asm-x86/user_64.h: checkpatch cleanups - formatting only · a206ea11
Joe Perches authored Mar 23, 2008
```
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
a206ea11
include/asm-x86/user32.h: checkpatch cleanups - formatting only · a3121619
Joe Perches authored Mar 23, 2008
```
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
a3121619
include/asm-x86/user_32.h: checkpatch cleanups - formatting only · 826700dc
Joe Perches authored Mar 23, 2008
```
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
826700dc
include/asm-x86/unistd_64.h: checkpatch cleanups - formatting only · c489f445
Joe Perches authored Mar 23, 2008
```
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
c489f445
include/asm-x86/unistd_32.h: checkpatch cleanups - formatting only · 687fc16b
Joe Perches authored Mar 23, 2008
```
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
687fc16b
include/asm-x86/unaligned.h: checkpatch cleanups - formatting only · 6e714b37
Joe Perches authored Mar 23, 2008
```
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
6e714b37
include/asm-x86/uaccess_64.h: checkpatch cleanups - formatting only · b896313e
Joe Perches authored Mar 23, 2008
```
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
b896313e
include/asm-x86/uaccess_32.h: checkpatch cleanups - formatting only · b1fcec7f
Joe Perches authored Mar 23, 2008
```
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
b1fcec7f
include/asm-x86/tsc.h: checkpatch cleanups - formatting only · 2d86e637
Joe Perches authored Mar 23, 2008
```
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
2d86e637
include/asm-x86/topology.h: checkpatch cleanups - formatting only · 5d7d03b8
Joe Perches authored Mar 23, 2008
```
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
5d7d03b8
include/asm-x86/tlbflush.h: checkpatch cleanups - formatting only · 94cf8de0
Joe Perches authored Mar 23, 2008
```
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
94cf8de0
include/asm-x86/thread_info_64.h: checkpatch cleanups - formatting only · b98fff30
Joe Perches authored Mar 23, 2008
```
Signed-off-by: Joe Perches <joe@perches.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
```
b98fff30