Commits · 26ba8e4b70af50b0049ff964f9366a56f7f60293 · linux / linux-davinci

29 Jul, 2009 36 commits

rtmutex: Adjust pi_lock usage in wakeup · 26ba8e4b

Gregory Haskins authored Jul 03, 2009

[ The following text is in the "utf-8" character set. ]
    [ Your display is set for the "iso-8859-1" character set.  ]
    [ Some characters may be displayed incorrectly. ]

From: Peter W.Morreale <pmorreale@novell.com>

In wakeup_next_waiter(), we take the pi_lock, and then find out whether
we have another waiter to add to the pending owner.  We can reduce
contention on the pi_lock for the pending owner if we first obtain the
pointer to the next waiter outside of the pi_lock.
Signed-off-by: Peter W. Morreale <pmorreale@novell.com>
Signed-off-by: Gregory Haskins <ghaskins@novell.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

26ba8e4b

rtmutex: Optimize rt lock wakeup · 5aef092b

Gregory Haskins authored Jul 03, 2009

[ The following text is in the "utf-8" character set. ]
    [ Your display is set for the "iso-8859-1" character set.  ]
    [ Some characters may be displayed incorrectly. ]

It is redundant to wake the grantee task if it is already running, and
the call to wake_up_process is relatively expensive.  If we can safely
skip it we can measurably improve the performance of the adaptive-locks.

Credit goes to Peter Morreale for the general idea.
Signed-off-by: Gregory Haskins <ghaskins@novell.com>
Signed-off-by: Peter Morreale <pmorreale@novell.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

5aef092b

rtmutex: Adaptive locking. Spin when owner runs · 1efd0d59

Steven Rostedt authored Jul 03, 2009

After talking with Gregory Haskins about how they implemented his
version of adaptive spinlocks and before I actually looked at their
code, I was thinking about it while lying in bed.

I always thought that adaptive spinlocks were to spin for a short
period of time based off of some heuristic and then sleep. This idea
is totally bogus. No heuristic can account for a bunch of different
activities. But Gregory mentioned something to me that made a hell of a lot
of sense. And that is to only spin while the owner is running.

If the owner is running, then it would seem that it would be quicker to
spin then to take the scheduling hit. While lying awake in bed, it dawned
on me that we could simply spin in the fast lock and never touch the
"has waiters" flag, which would keep the owner from going into the
slow path. Also, the task itself is preemptible while spinning so this
would not affect latencies.

The only trick was to not have the owner get freed between the time
you saw the owner and the time you check its run queue. This was
easily solved by simply grabing the RCU read lock because freeing
of a task must happen after a grace period.

I first tried to stay only in the fast path. This works fine until you want
to guarantee that the highest prio task gets the lock next. I tried all
sorts of hackeries and found that there was too many cases where we can
miss. I finally concurred with Gregory, and decided that going into the
slow path was the way to go.

I then started looking into what the guys over at Novell did. The had the
basic idea correct, but went way overboard in the implementation, making
it far more complex than it needed to be. I rewrote their work using the
ideas from my original patch, and simplified it quite a bit.

This is the patch that they wanted to do ;-)

Special thanks goes out to Gregory Haskins, Sven Dietrich and
Peter Morreale, for proving that adaptive spin locks certainly *can*
make a difference.
Signed-off-by: Steven Rostedt <srostedt@redhat.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

1efd0d59

rtmutex: remove double xchg · 4829d086

Steven Rostedt authored Jul 03, 2009

No reason to update current if we are running. We'll do that when we exit
the loop.
Signed-off-by: Steven Rostedt <srostedt@redhat.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

4829d086

rtmutex: Rearrange the code · 090e2db9

Gregory Haskins authored Jul 03, 2009

The current logic makes rather coarse adjustments to current->state since
it is planning on sleeping anyway.  We want to eventually move to an
adaptive (e.g. optional sleep) algorithm, so we tighten the scope of the
adjustments to bracket the schedule().  This should yield correct behavior
with or without the adaptive features that are added later in the series.
We add it here as a separate patch for greater review clarity on smaller
changes.
Signed-off-by: Gregory Haskins <ghaskins@novell.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

090e2db9

rtmutex: Add lateral lock stealing · 74804766

Gregory Haskins authored Jul 03, 2009

The current logic only allows lock stealing to occur if the current task
is of higher priority than the pending owner. We can gain signficant
throughput improvements (200%+) by allowing the lock-stealing code to
include tasks of equal priority. The theory is that the system will make
faster progress by allowing the task already on the CPU to take the lock
rather than waiting for the system to wake-up a different task.

This does add a degree of unfairness, yes. But also note that the users
of these locks under non -rt environments have already been using unfair
raw spinlocks anyway so the tradeoff is probably worth it.

The way I like to think of this is that higher priority tasks should
clearly preempt, and lower priority tasks should clearly block. However,
if tasks have an identical priority value, then we can think of the
scheduler decisions as the tie-breaking parameter. (e.g. tasks that the
scheduler picked to run first have a logically higher priority amoung tasks
of the same prio). This helps to keep the system "primed" with tasks doing
useful work, and the end result is higher throughput.

Thanks to Steven Rostedt for pointing out that RT tasks should be excluded
to prevent the introduction of an unnatural unbounded latency.

[ Steven Rostedt - removed config option to disable ]
Signed-off-by: Gregory Haskins <ghaskins@novell.com>
Signed-off-by: Steven Rostedt <srostedt@redhat.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

74804766

rtmutex: implement lock replacement functions · 012bf732

Thomas Gleixner authored Jul 26, 2009

Implement the base infrastructure to replace spinlocks for -rt.
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

012bf732

sched: implement wake functions for the priority boosting code · 2b56f065
Thomas Gleixner authored Jul 26, 2009
```
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
```
2b56f065
sched: Add lock_count to task struct · 53e9ba17
Thomas Gleixner authored Jul 26, 2009
```
Debugging interface.
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
```
53e9ba17

sched: Add TASK_RUNNING_MUTEX state · 8741e875

Thomas Gleixner authored Jul 26, 2009

Necessary for rt_mutex wakeups to preserve the state.
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

8741e875

x86: Use generic rwsem_spinlocks on -rt · 472e7312

Thomas Gleixner authored Jul 26, 2009

Simplifies the separation of anon_rw_semaphores and rw_semaphores for
-rt.
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

472e7312

x86: fix preempt_enable_no_resched usage · f2905d54
Thomas Gleixner authored Jul 25, 2009
```
Triggers the missed preemption check.
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
```
f2905d54

softirq: provide rt api variants · 35316f6a

Ingo Molnar authored Jul 03, 2009

add new, -rt specific IRQ API variants. Maps to the same as before
on non-PREEMPT_RT.

 include/linux/bottom_half.h |    8 ++++++++
 1 file changed, 8 insertions(+)
Signed-off-by: Ingo Molnar <mingo@elte.hu>

35316f6a

genirq: Enable the config switch for forced threading · 7d836495
Ingo Molnar authored Jul 03, 2009
```
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
```
7d836495

x86: Optimize and fix the 8259 ack code for -rt · 600b1f32

Ingo Molnar authored Jul 03, 2009

Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

600b1f32

hrtimers: prepare full preemption · 20634762

Ingo Molnar authored Jul 03, 2009

Make cancellation of a running callback in softirq context safe
against preemption.
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

20634762

timers: prepare for full preemption · 6dce6f3f

Ingo Molnar authored Jul 03, 2009

When softirqs can be preempted we need to make sure that cancelling
the timer from the active thread can not deadlock vs. a running timer
callback. Add a waitqueue to resolve that.
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

6dce6f3f

genirq: disable random call on preempt-rt · 04479b1c

Thomas Gleixner authored Jul 21, 2009

The random call introduces high latencies and is almost
unused. Disable it for -rt.
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

04479b1c

genirq: support forced threading of interrupts · 8baf330d

Thomas Gleixner authored Jul 16, 2009

Based on the mainline infrastructure we force thread all interrupts
with per device threads.
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

8baf330d

softirqs: forced threading of softirqs · cecf393e

Ingo Molnar authored Jul 03, 2009

Split them into separate threads. One for each softirq
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

cecf393e

genirq: use spin_lock_irqsave instead of open coding it. · 8fefc0fb
Thomas Gleixner authored Jul 19, 2009
```
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
```
8fefc0fb

genirq: prevent interrupt storm on irq migration · b87daa28

Ingo Molnar authored Jul 03, 2009

Migration maks/unmaks interrupts unconditionally. With forced irq
threading thats going to result in an interrupt storm when the
threaded handler has not finished yet.
Signed-off-by: Ingo Molnar <mingo@elte.hu>

b87daa28

sched: revert schedule/__schedule changes · 8ce08b9d

Thomas Gleixner authored Jul 03, 2009

RT wants that disctinction for now. We need to revisit this.
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

8ce08b9d

Merge branch 'rt/atomic-locks' into rt/base · 2f49b3e3
Thomas Gleixner authored Jul 29, 2009

2f49b3e3
locks: provide __DEFINE_SPINLOCK needed by percpu_locked · 6cffa489
Thomas Gleixner authored Jul 29, 2009
```
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
```
6cffa489
Merge branch 'linus' into rt/base · 54181c46
Thomas Gleixner authored Jul 29, 2009

54181c46
Merge branch 'rt/powerpc' into rt/base · 1326092f
Thomas Gleixner authored Jul 29, 2009

1326092f
powerpc: kprobes: Fix missed preemption check · b4f4919b
Thomas Gleixner authored Jul 03, 2009
```
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
```
b4f4919b

powerpc: OF convert to atomic lock · 5b659531

Thomas Gleixner authored Jul 29, 2009

Replaced the RW lock for now with atomic_spinlock. We have no
atomic_rwlocks in -rt right now and I doubt that the OF path is
performance critical.
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

5b659531

powerpc: Disable preemption across hypervisor call · 2126c312

Thomas Gleixner authored Jul 03, 2009

Otherwise the HV magic gets confused if we are preempted.
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

2126c312

powerpc: Annotate atomic_locks · 32ff9169
Thomas Gleixner authored Jul 03, 2009
```
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
```
32ff9169
Merge branch 'rt/arm' into rt/base · a9de8b7b
Thomas Gleixner authored Jul 29, 2009

a9de8b7b

ARM: AT91: Use edge irq for AT91-GPIO · a55fb470

Remy Bohmer authored Jul 03, 2009

On ARM there is a problem where the interrupt handler stalls when they
are coming faster than the kernel can handle. The problem seems to
occur on RT primarily, but the problem is also valid for non-RT
kernels.

The problem is twofold:
* the handle_simple_irq() mechanism is used for GPIO, but because the GPIO
interrupt source is actually an edge triggered interrupt source, the
handle_edge_irq() mechanism must be used. While using the simple_irq()
mechanisms edges can be missed for either mainline as RT kernels.
The simple_irq mechanism is *never* meant to be used for these types
of interrupts. See the thread at: http://lkml.org/lkml/2007/11/26/73
* The RT kernels has a problem that the interrupt get masked forever while
the interrupt thread is running and a new interrupt arrives.
In the interrupt threads there is masking done in the handle_simple_irq()
path, while a simple_irq typically cannot be masked.

This patch only solves the first bullet, which is enough for AT91, by
moving the GPIO interrupt handler towards the handle_edge_irq().
To solve the problem in the simple_irq() path a seperate fix has to be done,
but as it is no longer used by AT91, that fix will not affect AT91.

Tested on:
* AT91rm9200-ek, and proprietary board
* AT91SAM9261-ek. (This patches also solves the problem that the DM9000 does
                   not work on this board while using PREEMPT-RT)
Signed-off-by: Remy Bohmer <linux@bohmer.net>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

a55fb470

ARM: OMAP: remove unnecessary locking in clk_get_rate() · 59ad0ff1

Kevin Hilman authored Jul 03, 2009

The locking in the get_rate() hook is unnecessary, and causes problems
when used with the -rt patch, since it may be called recursively.
Signed-off-by: Kevin Hilman <khilman@mvista.com>
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

59ad0ff1

ARM: badge4: Move printk out of irq_disabled region · 30d18e07
Thomas Gleixner authored Jul 03, 2009
```
Reduce latencies on -rt
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
```
30d18e07

ARM: atomic_lock conversions · 8f99f823

Thomas Gleixner authored Jul 03, 2009

Annotate the locks which cannot be converted to sleeping locks in -rt
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

8f99f823

28 Jul, 2009 4 commits

Merge branch 'rt/mm' into rt/base · ba36d1d9

Thomas Gleixner authored Jul 29, 2009

Conflicts:
	include/linux/percpu.h
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>

ba36d1d9

Merge branch 'hwmon-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/jdelvare/staging · 7d3e91b8

Linus Torvalds authored Jul 28, 2009

* 'hwmon-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/jdelvare/staging:
  hwmon: (asus_atk0110) Fix upper limit readings
  hwmon: (smsc47m1) Differentiate between LPC47M233 and LPC47M292

7d3e91b8

Merge branch 'i2c-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/jdelvare/staging · ddb1d4ed
Linus Torvalds authored Jul 28, 2009
```
* 'i2c-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/jdelvare/staging:
  i2c/tsl2550: Fix lux value in dark environment
```
ddb1d4ed

Merge git://git.kernel.org/pub/scm/linux/kernel/git/mason/btrfs-unstable · 655c5d8f

Linus Torvalds authored Jul 28, 2009

* git://git.kernel.org/pub/scm/linux/kernel/git/mason/btrfs-unstable: (22 commits)
  Btrfs: Fix async caching interaction with unmount
  Btrfs: change how we unpin extents
  Btrfs: Correct redundant test in add_inode_ref
  Btrfs: find smallest available device extent during chunk allocation
  Btrfs: clear all space_info->full after removing a block group
  Btrfs: make flushoncommit mount option correctly wait on ordered_extents
  Btrfs: Avoid delayed reference update looping
  Btrfs: Fix ordering of key field checks in btrfs_previous_item
  Btrfs: find_free_dev_extent doesn't handle holes at the start of the device
  Btrfs: Remove code duplication in comp_keys
  Btrfs: async block group caching
  Btrfs: use hybrid extents+bitmap rb tree for free space
  Btrfs: Fix crash on read failures at mount
  Btrfs: remove of redundant btrfs_header_level
  Btrfs: adjust NULL test
  Btrfs: Remove broken sanity check from btrfs_rmap_block()
  Btrfs: convert nested spin_lock_irqsave to spin_lock
  Btrfs: make sure all dirty blocks are written at commit time
  Btrfs: fix locking issue in btrfs_find_next_key
  Btrfs: fix double increment of path->slots[0] in btrfs_next_leaf
  ...

655c5d8f