Re: [PATCH v2 -tip] perf: x86, add SandyBridge support

From: Stephane Eranian
Date: Tue Mar 01 2011 - 10:09:41 EST


On Tue, Mar 1, 2011 at 4:07 PM, Lin Ming <ming.m.lin@xxxxxxxxx> wrote:
> On Tue, 2011-03-01 at 17:39 +0800, Stephane Eranian wrote:
>> On Tue, Mar 1, 2011 at 9:57 AM, Stephane Eranian <eranian@xxxxxxxxxx> wrote:
>> > On Tue, Mar 1, 2011 at 9:45 AM, Lin Ming <ming.m.lin@xxxxxxxxx> wrote:
>> >> On Tue, 2011-03-01 at 15:43 +0800, Stephane Eranian wrote:
>> >>> On Mon, Feb 28, 2011 at 10:15 AM, Peter Zijlstra <a.p.zijlstra@xxxxxxxxx> wrote:
>> >>> > On Mon, 2011-02-28 at 15:22 +0800, Lin Ming wrote:
>> >>> >> This patch adds basic SandyBridge support, including hardware cache
>> >>> >> events and PEBS events support.
>> >>> >>
>> >>> >> LLC-* hareware cache events don't work for now, it depends on the
>> >>> >> offcore patches.
>> >>> >
>> >>> > What's the status of those, Stephane reported some problems last I
>> >>> > remember?
>> >>> >
>> >>> I tried the trick I mentioned and it seems to work.
>> >>>
>> >>> Something like below with hwc->extra_alloc.
>> >>> Could probably find a better name for that field.
>> >>
>> >> Stephane,
>> >>
>> >> I'll integrate below changes to the offcore patches, OK?
>> >>
>> > Let me try one more test on this.
>> > I want to show the case the caused the problem in the first place.
>> >
>>
>> There you go:
>>
>> $ task -e offcore_response_0:DMND_DATA_RD:local_dram -e
>> offcore_response_0:DMND_DATA_RD:local_dram noploop 1
>>
>> Here the two instances of offcore_response are in two different event groups.
>> I instrumented get/put percore constraint routines. get1 is where you do the
>> first allocation, get2 is where you do ref++.
>>
>> The scheduling algorithm will do:
>> Â - submit 1st group
>> Â - schedule 1st group
>> Â - submit 2nd group
>> Â - schedule 1st + 2nd group
>>
>> Which results in the following trace:
>>
>> [ Â109.855713] CPU0 get1 cfg=1301b7 ref=1
>> [ Â109.855717] CPU0 get2 cfg=1301b7 ref=2
>> [ Â109.855718] CPU0 get2 cfg=1301b7 ref=3 <-- this one is bogus
>> [ Â109.856606] CPU0 put cfg=1301b7 ref=2
>> [ Â109.856609] CPU0 put cfg=1301b7 ref=1 Â<- don't free the resource
>
> Do you mean the issue is still there even with your extra_alloc patch
> applied?
>
No, I am showing you what happens without it.
If you try with it, it should work.

>>
>> [ Â109.856616] CPU0 get2 cfg=1301b7 ref=2
>> [ Â109.856619] CPU0 get2 cfg=1301b7 ref=3
>> [ Â109.856622] CPU0 get2 cfg=1301b7 ref=4
>> [ Â110.742151] CPU0 put cfg=1301b7 ref=3
>> [ Â110.742154] CPU0 put cfg=1301b7 ref=2
>>
>> [ Â110.742160] CPU0 get2 cfg=1301b7 ref=3
>> [ Â110.742161] CPU0 get2 cfg=1301b7 ref=4
>> [ Â110.742163] CPU0 get2 cfg=1301b7 ref=5
>> [ Â110.854448] CPU0 put cfg=1301b7 ref=4
>> [ Â110.854450] CPU0 put cfg=1301b7 ref=3
>
>
>
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/