Re: [platform-releng-dev] Dimensions with unusable statistical propertie

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]

Re: [platform-releng-dev] Dimensions with unusable statistical properties

From: Tom Eicher <eclipse@xxxxxxxxxxxxxxx>
Date: Tue, 07 Mar 2006 09:19:32 +0100
Delivered-to: platform-releng-dev@xxxxxxxxxxx
List-archive: <http://eclipse.org/pipermail/platform-releng-dev>
List-help: <mailto:platform-releng-dev-request@eclipse.org?subject=help>
List-subscribe: <https://dev.eclipse.org/mailman/listinfo/platform-releng-dev>, <mailto:platform-releng-dev-request@eclipse.org?subject=subscribe>
List-unsubscribe: <https://dev.eclipse.org/mailman/listinfo/platform-releng-dev>, <mailto:platform-releng-dev-request@eclipse.org?subject=unsubscribe>
Organization: IBM OTI Labs
User-agent: Thunderbird 1.5 (X11/20060119)

I am to blame.

I noticed that some of our performance tests were not robust enough toget any meaningful result from them. That is, some green and red bars onthe performance pages really didn't contain any useful information abouta test but that it is highly unstable; any result might have occurred bychance, due to sampling fluctuation.

So I included code into the evaluation of performance data that may flaga test as unstable. This is what you are seeing.


A test is flagged:

- When the test was just run once, as then there is no measure of thestability of the test (standard deviation) and one cannot infer anythingabout the real life average of the sampled dimension.

- When the test, given its standard error, would require a sample size >1000 to be able to measure a 5% deviation from the mean. The standarderror is high when there are large differences between the differentsamples taken. This usually means that a test is very unstable.

Note that also the performance output on the download pages wasmodified: tests that measure a statistically insignificant deviationfrom the baseline measurement are grayed out.

Do you think your tests are wrongly flagged? Or is just the outputmessage unclear? Or is the above a bad idea? Please comment (or correctmy limited understanding of statistics).


-tom

See also:

https://bugs.eclipse.org/bugs/show_bug.cgi?id=127264 - performancetesting: add significance information to performance graphshttps://bugs.eclipse.org/bugs/show_bug.cgi?id=126358 - performance:small update to the performance plug-in


Wassim Melhem wrote:

About two weeks ago, I got the latest org.eclipse.test.performance from
HEAD and since then I have not been able to get any numbers when I run the
PDE/UI performance tests in my workspace.

I keep getting messages like this one spit out to the console:

"Dimensions with unusable statistical properties: Used Java Heap, Working
Set, Committed, Working Set Peak, Elapsed Process, Kernel time, Page
Faults, CPU Time, GDI Objects"

What makes the statistical properties of a dimension unusable?

The same tests were fine before, so why now?


Thanks...

Wassim.



_______________________________________________
platform-releng-dev mailing list
platform-releng-dev@xxxxxxxxxxx
https://dev.eclipse.org/mailman/listinfo/platform-releng-dev

References:
- [platform-releng-dev] Dimensions with unusable statistical properties
  - From: Wassim Melhem

Prev by Date: [platform-releng-dev] Dimensions with unusable statistical properties
Next by Date: [platform-releng-dev] Power outage in build lab impacts nightly and 8 am integration build
Previous by thread: [platform-releng-dev] Dimensions with unusable statistical properties
Next by thread: [platform-releng-dev] Power outage in build lab impacts nightly and 8 am integration build
Index(es):
- Date
- Thread

Breadcrumbs