Percentile: Difference between revisions
imported>Anh Nguyen m (→Examples) |
mNo edit summary |
||
(28 intermediate revisions by 8 users not shown) | |||
Line 1: | Line 1: | ||
{{subpages}} | |||
'''Percentiles''' are statistical parameters which describe the distribution | |||
of a (real) value in a population (or a sample). | |||
Roughly speaking, the ''k''-th percentile separates the smallest ''k'' percent | |||
of values from the largest (100-''k'') percent. | |||
Special percentiles are the [[median]] (50th percentile), | |||
the quartiles (25th and 75th percentile), | |||
the quintiles (20th, 40th, 60th and 80th percentile), | |||
and the deciles (the ''k''-th decile is the (10''k'')-th percentile). | |||
Percentiles are special cases of [[quantile]]s: | |||
The ''k''-th percentile is the same as the (''k''/100)-quantile. | |||
== Definition == | |||
The value ''x'' is ''k''-th percentile (for a given ''k'' = 1,2,...,99) if | |||
: <math> P(\omega\le x) \ge {k\over100} \textrm{\ \ and \ \ } | |||
P(\omega\ge x) \ge 1-{k\over100} \, \textrm{.} </math> | |||
In this definition, ''P'' is a probability distribution on the real numbers. | |||
It may be obtained either | |||
* from a (theoretical) probability measure (such as the [[normal distribution|normal]] or [[Poisson distribution]]), or | |||
* from a finite population where it expresses the probability of a random element to have the property,<br>i.e., it is the relative frequency of elements with this property (number of elements with the property divided by the size of the population),or | |||
* from a sample of size ''N'' where it also is the relative frequency which is used to estimate the corresponding percentile for the population from which it was taken. | |||
== Special cases == | |||
For most standard continuous distributions (like the [[normal distribution]]) the | |||
''k''-th percentile ''x'' is uniquely determined by | |||
: <math> P(\omega\le x) = {k\over100} \textrm{\ \ and \ \ } | |||
P(\omega\ge x) = 1-{k\over100} </math> | |||
In the general case (e.g., for discrete distributions, or for finite samples) | |||
it may happen that the separating value has positive probability: | |||
: <math> P(\omega = x) > 0 \Rightarrow | |||
P(\omega\le x) > {k\over100} \textrm{\ \ or \ \ } | |||
P(\omega\ge x) > 1-{k\over100} </math> | |||
or that there is a gap in the range of the variable such that, for two distinct | |||
<math> x_1 < x_2 </math>, equality holds: | |||
: <math> P(\omega\le x_1) = {k\over100} \textrm{\ \ and \ \ } | |||
P(\omega\ge x_2) = 1-{k\over100} </math> | |||
Then every value in the ([[closed interval|closed]]) interval between the smallest and the largest such value | |||
: <math> \left [ \min \left\{ x \Bigl\vert P(\omega\le x) = {k\over100} \right\}, | |||
\max \left\{ x \Bigl\vert P(\omega\ge x) = 1-{k\over100} \right\} \right]</math> | |||
is a ''k''-th percentile. | |||
== | == Examples == | ||
The following examples illustrate this: | |||
* Take a sample of 101 values, ordered according to their size: | |||
:: <math> x_1 \le x_2 \le \dots \le x_{100} \le x_{101} </math>. | |||
: Then the unique ''k''-th percentile is <math>x_{k+1}</math>. | |||
* If there are only 100 values | |||
< | :: <math> x_1 \le x_2 \le \dots \le x_{99} \le x_{100} </math>. | ||
: Then any value between <math>x_k</math> and <math>x_{k+1}</math> is a ''k''-th percentile. | |||
'''Example from the praxis:''' | |||
<br> | |||
Educational institutions (i.e. universities, schools...) frequently report admission test scores in terms of percentiles. | |||
For instance, assume that a candidate obtained 85 on her verbal test. | |||
The question is: How did this student compared to all other students? | |||
If she is told that her score correspond to the 80th percentile, | |||
we know that approximately 80% of the other candidates scored lower than she | |||
[[ | and that approximately 20% of the students had a higher score than she had.[[Category:Suggestion Bot Tag]] | ||
Latest revision as of 16:00, 2 October 2024
Percentiles are statistical parameters which describe the distribution of a (real) value in a population (or a sample). Roughly speaking, the k-th percentile separates the smallest k percent of values from the largest (100-k) percent.
Special percentiles are the median (50th percentile), the quartiles (25th and 75th percentile), the quintiles (20th, 40th, 60th and 80th percentile), and the deciles (the k-th decile is the (10k)-th percentile). Percentiles are special cases of quantiles: The k-th percentile is the same as the (k/100)-quantile.
Definition
The value x is k-th percentile (for a given k = 1,2,...,99) if
In this definition, P is a probability distribution on the real numbers. It may be obtained either
- from a (theoretical) probability measure (such as the normal or Poisson distribution), or
- from a finite population where it expresses the probability of a random element to have the property,
i.e., it is the relative frequency of elements with this property (number of elements with the property divided by the size of the population),or - from a sample of size N where it also is the relative frequency which is used to estimate the corresponding percentile for the population from which it was taken.
Special cases
For most standard continuous distributions (like the normal distribution) the k-th percentile x is uniquely determined by
In the general case (e.g., for discrete distributions, or for finite samples) it may happen that the separating value has positive probability:
or that there is a gap in the range of the variable such that, for two distinct , equality holds:
Then every value in the (closed) interval between the smallest and the largest such value
is a k-th percentile.
Examples
The following examples illustrate this:
- Take a sample of 101 values, ordered according to their size:
- .
- Then the unique k-th percentile is .
- If there are only 100 values
- .
- Then any value between and is a k-th percentile.
Example from the praxis:
Educational institutions (i.e. universities, schools...) frequently report admission test scores in terms of percentiles.
For instance, assume that a candidate obtained 85 on her verbal test.
The question is: How did this student compared to all other students?
If she is told that her score correspond to the 80th percentile,
we know that approximately 80% of the other candidates scored lower than she
and that approximately 20% of the students had a higher score than she had.