Winsorized Mean

What Is the Winsorized Mean?

Winsorized mean is a method of averaging that initially replaces the littlest and biggest values with the perceptions closest to them. This is finished to limit the effect of exceptions or abnormal extreme values, or anomalies, on the calculation.

In the wake of supplanting the values, the arithmetic mean formula is then used to compute the winsorized mean.

Formula for the Winsorized Mean

$\begin &\text\ =\ \frac{x_\dots x_{n+1}\ +\ x_{n+2}\dots x_}\ &\textbf\ &\begin n\ =\ &\text\ &\text\ &\text\end\ &N\ =\ \text \end$
Winsorized means are communicated in two ways. A "kⁿ" winsorized mean alludes to the replacement of the "k" littlest and biggest perceptions, where "k" is an integer. An "X%" winsorized mean includes supplanting a given percentage of values from the two closures of the data.

The winsorized mean is accomplished by supplanting the littlest and biggest data points, then, at that point, summing every one of the data points and partitioning the sum by the total number of data points.

What Does the Winsorized Mean Tell You?

The winsorized mean is less sensitive to anomalies since it can supplant them with less extreme values. That is, it is less powerless to anomalies versus the arithmetic average. Be that as it may, assuming a distribution has fat tails, the effect of eliminating the highest and least values in the distribution will have little influence as a result of the high degree of variability in the distribution figures.

One major downside for winsorized means is that they normally bring some bias into the data set. By decreasing the influence of exceptions, the analysis is modified for better analysis, yet in addition eliminates data about the underlying data.

Illustration of How to Use Winsorized Mean

How about we ascertain the winsorized mean for the accompanying data set: 1, 5, 7, 8, 9, 10, 34. In this model, we assume the winsorized mean is in the principal order, in which we supplant the littlest and biggest values with their nearest perceptions.

The data set presently shows up as follows: 5, 5, 7, 8, 9, 10, 10. Taking an arithmetic average of the new set delivers a winsorized mean of 7.7, or (5 + 5 + 7 + 8 + 9 + 10 + 10) isolated by 7. Note that the arithmetic mean would have been higher — 10.6. The winsorized mean effectively decreases the influence of the 34 value as an exception.

Or on the other hand consider a 20% winsorized mean that takes the top 10% and base 10% and replaces them with their next closest value. We will winsorize the accompanying data set: 2, 4, 7, 8, 11, 14, 18, 23, 23, 27, 35, 40, 49, 50, 55, 60, 61, 61, 62, 75. The two littlest and two biggest data points — 20% of the 20 data points — will be supplanted with their next closest value. Subsequently, the new data set is as per the following: 7, 7, 7, 8, 11, 14, 18, 23, 23, 27, 35, 40, 49, 50, 55, 60, 61, 61, 61, 61. The winsorized mean is 33.9, or the total of the data (678) partitioned by the total number of data points (20).

Winsorized Mean versus Trimmed Mean

The winsorized mean incorporates changing data points, while the trimmed mean includes eliminating data points. It is common for the winsorized mean and trimmed mean to be close or some of the time equivalent in value to one another.

Highlights

The winsorized mean isn't equivalent to the trimmed mean, which includes eliminating data points rather than supplanting them — albeit the consequences of the two will generally be close.
It mitigates the effects of anomalies by supplanting them with less extreme values.
The winsorized mean is an averaging method that includes supplanting the littlest and biggest values of a data set with the perceptions closest to them.