Scatter Diagram Correlation | Meaning, Interpretation, Example

What is a Scatter Diagram?

A simple and attractive method of measuring correlation by diagrammatically representing bivariate distribution for determination of the nature of the correlation between the variables is known as the Scatter Diagram Method. This method gives the investigator/analyst a visual idea of the nature of the association between the two variables. It is the simplest method of studying the relationship between two variables as there is no need to calculate any numerical value. 

Table of Content

  • What is a Scatter Diagram?
  • How to draw a Scatter Diagram?
  • Interpretation of Scatter Diagram
    • 1. Perfect Positive Correlation
    • 2. Perfect Negative Correlation
    • 3. Positive Correlation
    • 4. Negative Correlation
    • 5. No Correlation
  • Example of Scatter Diagram Correlation
  • Merits of Scatter Diagram
  • Demerits of Scatter Diagram
  • Scatter Diagram Correlation – FAQs

How to draw a Scatter Diagram?

The two steps required to draw a Scatter Diagram or Dot Diagram are as follows:

  1. Plot the values of the given variables (say X and Y) along the X-axis and Y-axis, respectively.
  2. Show these plotted values on the graph by dots. Each of these dots represents a pair of values.

Interpretation of Scatter Diagram

After observing the pattern of dots, one can know the presence or absence of correlation and its type. Besides, it also gives an idea of the nature and intensity of the relationship between the two variables. 

The scatter diagram can be interpreted in the following ways:

1. Perfect Positive Correlation

If the points of the scatter diagram fall on a straight line and have a positive(upward) slope, then the correlation is said to be perfectly positive; i.e., r = +1.

 

2. Perfect Negative Correlation

If the points of the scatter diagram fall on a straight line and have a negative(downward) slope, then the correlation is said to be perfectly negative; i.e., r = -1.

 

3. Positive Correlation

When the points of the scatter diagram cluster around a straight line (upward slope from left to right), then the correlation is said to be positive.

 

4. Negative Correlation

When the points of the scatter diagram cluster around a straight line (downward/negative slope), then the correlation is said to be negative.

 

5. No Correlation

When the points of the scatter diagram are scattered in a haphazard manner, then there is zero or no correlation. 

 

How to interpret a Scatter Diagram?

While interpreting a scatter diagram, the given below points should be taken into consideration:

  • Dense or Scattered Points: If the plotted points are close to each other, then the analyst can expect a high degree of correlation between the two variables. However, if the plotted points are widely scattered, then the analyst can expect a poor correlation between the variables.
  • Trend or No Trend: If the points plotted on the scatter diagram shows any trend either upward or downward, then it can be said that the variables are correlated. However, if the plotted points do not show any trend, then it can be said that the variables are uncorrelated.
  • Upward or Downward Trend: If the plotted points show an upward trend rising from the lower left-hand corner of the graph and goes upward to the upper right-hand corner, then the correlation is positive. It means that the two variables move in the same direction. However, if the plotted points show a downward trend from the upper left-hand corner of the graph to the lower right-hand corner, then the correlation is negative. It means that the two variables move in the opposite direction.
  • Perfect Correlation: If the points plotted on the scatter diagram lie on a straight line and have a positive slope, then it can be said that the correlation is perfect and positive. However, if the points plotted lie on a straight line and have a negative slope, then it can be said that the correlation is perfect and negative.

Example of Scatter Diagram Correlation

Draw a Scatter Diagram for the following data and state the type of correlation between the given two variables X and Y.

 

Solution:

We will draw the scatter diagram by plotting the values of Series X on the X-axis and values of Series y on the Y-axis (10, 80), (20, 160),………(60, 480).

We can see that all the points of the given two variables X and Y are plotted on a positively sloping straight line, which means that there is a Positive Correlation between the values of Series X and Y.

Merits of Scatter Diagram

1. Simplicity: Scatter Diagram is a simple and non-mathematical method to study correlation between two variables.

2. First Step: It is the first step of investigating the relationship between two variables.

3. Easily Understandable: One can easily understand and interpret scatter diagrams. Besides, only at a single glance at the diagram, one can easily tell the presence or absence of correlation.

4. Not Affected by Extreme Items: The size of extreme values does not affect the scatter diagram. It is a quality which is not present in most mathematical methods.

Demerits of Scatter Diagram

1. Rough Measure: Scatter diagram only gives a rough idea of the degree and nature of correlation between the given two variables. Therefore, it is only a qualitative expression rather than a quantitative expression.

2. Non-mathematical Method: Like other methods of correlation, Scatter Diagram Method does not indicate the exact numerical value of correlation. 

3. Unsuitable for Large Observations: If there are more than two variables, it becomes difficult to draw a scatter diagram.

Scatter Diagram Correlation – FAQs

What is a scatter diagram?

A scatter diagram, also known as a scatter plot, is a graphical representation of the relationship between two quantitative variables. Each point on the scatter diagram represents an observation in the dataset, with one variable plotted on the x-axis and the other on the y-axis.

How can you determine the direction of the relationship using a scatter diagram?

Positive Relationship: If the points tend to rise from left to right, the relationship is positive, indicating that as one variable increases, the other variable also increases.

Negative Relationship: If the points tend to fall from left to right, the relationship is negative, indicating that as one variable increases, the other variable decreases.

No Relationship: If the points are randomly scattered with no discernible pattern, there is no clear relationship between the variables.

What are outliers in a scatter diagram?

Outliers are points that lie far away from the overall pattern of the data. They may indicate unusual observations or errors in the data.

Can scatter diagrams show non-linear relationships?

Yes, scatter diagrams can show non-linear relationships. If the points form a pattern that is curved or follows a non-linear trend, it indicates a non-linear relationship between the variables.

How can you enhance a scatter diagram to better interpret correlation?

You can enhance a scatter diagram by:

  • Adding a Trend Line: A line of best fit (regression line) helps in visualizing the overall trend.
  • Color-Coding Points: Use different colors to represent different groups or categories within the data.
  • Annotating Outliers: Mark or label outliers to highlight unusual observations.
  • Using Jitter: Add slight random noise to the points to better visualize dense clusters (helpful for large datasets).


Contact Us