Types of Outlier

Outliers can be categorized as extreme and mild based on their deviation from the dataset’s central tendency.

Extreme Outlier

Data points that lie far from the mean or median, typically beyond 3 times the interquartile range (IQR).

Formula: 

Outlier = Q3 + 3 × IQR

Example:

In a dataset with Q3 = 20, Q1 = 10, and IQR = 10, an extreme outlier would be any value above 50.

Mild Outlier

Data points that are moderately different from the rest of the data, falling between 1.5 to 3 times the IQR from the quartiles.

Formula: 

Outlier = Q3 + 1.5 × IQR

Example:

In the same dataset, a mild outlier would fall between 20 and 35.

Outlier

Outliers stand for data points that are indicative of a much higher variability than other observations in a given dataset. This can result in skewing statistical studies and wrong conclusions after all the variables are not adequately identified and handled. Identifications of outliers are very relevant for the financial sector, healthcare industry and decision-making processes that depend on data analysis.

In this article, we will learn in detail about outlier, its definition, examples, types, how to find outlier, their uses and how they are different of inliers.

Similar Reads

What is Outlier?

An outlier is also a data point that is drastically different from the other records in the dataset, with the differences being either too high or too low when compared to the rest of the observations. These extreme values are one of the reasons why giving out correct results based on the prepared analysis may be out of order if the statistical values aren’t precisely identified and addressed. Outliers may occur as a result of different reasons, e.g., measurement error, experimental variability, or genuine anomalies in the data....

Outlier Examples

Example 1: Dataset: 10, 12, 14, 16, 18, 500...

Types of Outlier

Outliers can be categorized as extreme and mild based on their deviation from the dataset’s central tendency....

How to Find Outliers?

To identify outliers in a dataset, you can use the following two methods:...

Causes of Outliers

There are four main causes of outliers in a dataset:...

Uses of Outliers

Anomaly Detection: Identifying unusual patterns in data. Quality Control: Monitoring for defects or irregularities. Financial Analysis: Detecting fraudulent activities or unusual transactions. Predictive Modeling: Improving model accuracy by handling outliers appropriately....

Difference between Outliers and Inliers

The difference between Outliers and Inliers are tabulated below:...

FAQs on Outliers

What are outliers?...

Contact Us