A Guide To Understand Negative Correlation | Outlier (2024)

In This Article

  1. What is a Negative Correlation?

  2. Negative Correlations and Causation

  3. Negative Correlations and Scatter Plots

  4. Negative Correlation, Positive Correlation and Zero Correlation

  5. Why is Correlation Important?

  6. How to Interpret Negative Correlation

  7. How to Determine Negative Correlation

  8. Computing Correlation Using Software

  9. Factors That Influence Correlation

  10. 5 Negative Correlation Examples

Don't Overpay For College Statistics

Take Intro to Statistics Online with Outlier.org

From the co-founder of MasterClass, earn transferable college credits from the University of Pittsburgh (a top 50 global school). The world's best online college courses for 50% less than a traditional college.

Get started

A Guide To Understand Negative Correlation | Outlier (1)

What is a Negative Correlation?

A negative correlation — also known as an inverse correlation — describes a relationship between two variables that tend to move in opposite directions. When two variables are negatively correlated, a higher value of one is associated with a lower value of the other and vice versa.

Examples of Negatively Correlated Variables

  • Covid cases and air travel

  • Ice cream sales and hot chocolate sales

  • Hours spent NOT studying for an exam and performance on an exam

  • Exercise and body fat

Negative Correlations and Causation

A negative correlation does not imply that one variable causes a change in the other. While this could be the case, it could also be the case that some third variable explains the correlation. It could even be that the correlation occurs by chance and not because of any causal factors.

Consider two of the examples from above. In the negative correlation between Covid cases and air travel, the number of COVID cases has a causal effect on air travel. The prevalence of COVID makes people less inclined to book a flight. In the case of ice cream and hot chocolate sales, however, nothing about selling ice cream causes the sale of hot chocolate to fall. Instead, a third factor, temperature, is responsible for the negative correlation. The temperature has a causal effect on the sale of both items. People crave more hot chocolate and less ice cream when temperatures are low, and people crave more ice cream and less hot chocolate when temperatures are high.

Remember, just because two things are correlated does not mean that one causes the other. Correlation does not imply causation! A correlation simply establishes an observable association between two variables.

Negative Correlations and Scatter Plots

A great way to visualize correlations is with a scatter plot. A scatter plot is a graph that plots the value of one variable measured along the y-axis in relation to values of another variable measured along the x-axis.

When two variables are negatively correlated, the plotted points will be clustered around a downward sloping line. Why? Because higher values of the good on the y-axis are associated with lower values of the good on the x-axis, and lower values of the good on the y-axis are associated with higher values of the good on the x-axis.

A Guide To Understand Negative Correlation | Outlier (2)

Negative Correlation, Positive Correlation and Zero Correlation

Negative correlations are just one type of correlation. Correlations can be positive, equal to zero, or negative.

While a negative correlation indicates that two variables tend to move in opposite directions, a positive correlation means that two variables tend to move in the same direction (think ice cream sales and popsicle sales). If two variables are positively correlated, higher values of one will be associated with higher values of the other and vice versa. A zero correlation (or no correlation) indicates no observable relationship between the two variables.

When two variables are positively correlated, their values will be clustered around an upward-sloping line on a scatter plot. If the two variables are not correlated, there will be no line of best fit that you can draw to show a linear trend in the plotted points.

Why is Correlation Important?

Negative correlations, and correlations more generally, are important because they improve our ability to estimate and predict things. If two variables are correlated, knowing something about one gives you valuable information about the other.

As an illustration, suppose you are a doctor who knows there is a well-established negative correlation between a person’s daily minutes of cardiovascular exercise and the risk of suffering a particular heart disease. Knowing this correlation allows you to diagnose your patients better. By learning about your patients’ exercise habits, you are better able to assess which patients are at a higher risk of having the disease.

If two variables are not correlated, knowing the value of one of the variables tells you nothing about the value of the other.

How to Interpret Negative Correlation

Correlation is measured on a scale from -1 to 1. A correlation that is less than 0 and greater than or equal to -1 is a negative correlation. A correlation equal to 0 is a zero correlation, and a correlation greater than zero or less than or equal to 1 is a positive correlation.

A Guide To Understand Negative Correlation | Outlier (4)

In addition to being positive, negative, or zero, correlations can be strong or weak. The closer a correlation is to -1 or 1, the stronger the correlation is. The closer a correlation is to 0, the weaker it is. A strong correlation means that the association between the two variables is strong and that your ability to estimate the value of one variable based on the value of the other is better than if the correlation was weaker.

A correlation of -1 is called a perfect negative correlation, and a correlation of 1 is called a perfect positive correlation. A perfect correlation, though rare, is the strongest type of correlation you can observe. When a correlation is perfect, knowing one variable allows you to predict the value of the other perfectly. As an example, think of a test that has 100 questions. The number of correct answers a student gets on the test and the number of incorrect answers the student has on the test is an example of a perfect negative correlation. If you know a student got 70 out of 100 questions correct, you know for sure that the student got 30 questions wrong!

On a scatter plot, the points of two perfectly correlated variables will not be clustered around a line of best fit. Instead, every point falls exactly on the line.

A Guide To Understand Negative Correlation | Outlier (5)

How to Determine Negative Correlation?

The most common measure of correlation is the Pearson correlation coefficient. The Pearson correlation coefficient (often just called the correlation coefficient) is denoted by the Greek letter rho (⍴) when calculated for a population and by the lower-case letter r when calculated for a sample.

There are many ways to calculate the Pearson correlation coefficient. Here we will cover two methods for calculating the Pearson correlation coefficient for a sample. Both methods will give you the same result.

Method 1

The first way to calculate the Pearson correlation coefficient is to divide the covariance of x and y by the product of the standard deviation of x and the standard deviation of y.

rxy=cov(xy)sxsyr_{xy}=\frac{cov(xy)}{s_xs_y}rxy=sxsycov(xy)

  • Where rxyr_{xy}rxy is the correlation coefficient for two variables xxx and yyy

  • cov(X,Y) is the sample covariance between xxx and yyy

  • sxs_{x}sx is the sample standard deviation of xxx

  • sys_{y}sy is the sample standard deviation of yyy

Stock Correlation Example

The sample covariance between the price of two stocks is -8. The sample standard deviation of the price of the first stock is 5, and the sample standard deviation of the price of the second stock is 16.

Use the correlation coefficient equation from above to calculate the correlation coefficient for these two stocks. Is the correlation positive or negative? Is it a strong or weak correlation?

Solution

rxy=8(5)(16)r_{xy}=\frac{-8}{(5)(16)} rxy=(5)(16)8

The correlation between the two stocks is -0.1. This is a weak negative correlation.

Method 2

The second way to calculate the Pearson correlation coefficient is to take the average of the sum of the cross-product of the Z-scores of xxx and yyy. When calculating a sample correlation coefficient, we must divide the sum of the cross product by nnn-1 (the sample size minus one) rather than dividing by nnn.

rxy=ZxZyn1r_{xy}=\frac{\sum Z_xZ_y}{n-1}rxy=n1ZxZy

  • Where rxyr_{xy}rxy is the correlation coefficient for two variables xxx and yyy

  • ∑ is the sum of the product of ZxZ_{x}Zx and ZyZ_{y}Zy for every data point (x,y)

  • ZxZ_{x}Zx is the Z-score of a value of xxx

  • ZyZ_{y}Zy is the Z-score of a value of yyy

  • nnn is the sample size

Example

Calculate the sample correlation coefficient for xxx and yyy.

The table shows the values of xxx and yyy drawn from a sample of 5 (n=5).

Use the information in the table and the equation from Method 2 to find the Pearson correlation coefficient for xxx and yyy. Is the correlation positive or negative? Is it a strong or weak correlation?

Solution

The table provides us with the values of xxx and yyy and the Z-scores for each of the values of xxx and yyy.

The first step to finding the correlation coefficient is finding the Z-scores’ product (ZxZyZ_xZ_yZxZy). To do this, you can fill out the last column of the table.

You should find that the five values from top to bottom are:

  • -1.547

  • -0.425

  • -0.243

  • -1.577

Next, you need to sum the values you found in Step 1 to find the sum ZxZy\sum Z_xZ_yZxZy. If you do this, you should get the value in the bottom-right cell of the table ZxZy\sum Z_xZ_yZxZy = -3.790.

Now that you have the sum of the cross product of the Z-scores, all you need to do is divide by nnn-1. Since the sample size in this problem is 5, nnn-1 is equal to 4.

rxy=ZxZyn1=3.7904=0.948r_{xy}=\frac{\sum Z_xZ_y}{n-1} = \frac{-3.790}{4} = -0.948rxy=n1ZxZy=43.790=0.948

Answer: The correlation coefficient for xxx and yyy, rxyr_{xy}rxy, is equal to -0.948. This is a strong negative correlation.

Computing Correlation Using Software

While it’s important to understand how to calculate correlation, more often than not, you will be computing correlations using software.

Calculating in Excel or Google Sheets

To calculate correlations, use the formula =CORREL(Array1, Array 2)

As an example, say you have five values of x in cells A2 through A6 and five values of y in cells B2 through B6. To find the correlation between x and y, you would enter the following into an empty cell =CORREL(A2:A6, B2:B6).

Calculating in Desmos

To calculate correlations, first, create a table with your xxx and yyy values, labeling the xxx variable x1x_{1}x1 and the yyy variable y2y_{2}y2. Once you have your table, use the command corr(x1x_{1}x1, y1y_{1}y1) to find the correlation between your variables.

Calculating in R

To calculate correlations in R, first, create two variables with your xxx and yyy values. Give each variable a name such as xxx and yyy, or chocolatesales and icecreamsales. Once you've stored your variables, use the function cor() to compute the correlation. If you named your variables xxx and yyy, you would type cor(x, y). If you named your variables chocolatesales and icecreamsales, you would type cor(chocolatesales, icecreamsales).

Factors That Influence Correlation

There are a number of factors that can affect your correlation coefficient and throw off your results such as:

  • Outliers

  • Restriction of range

  • Non-linear relationships

AnnMaria De Mars explains:

5 Negative Correlation Examples

Here are some examples of how negative correlations are used in real life.

1. Economics

In economics, there is an established negative correlation between inflation (the rate at which prices rise) and the unemployment rate (the percentage of the workforce out of a job). In an ideal world, we want to keep BOTH inflation and the unemployment rate low, but due to the inverse correlation between these two measures, economists and politicians often face a trade-off and must try to strike a good balance between pursuing low inflation and low unemployment.

2. Finance

In finance, investors might use negative correlations to balance and diversify their investments. If the growth in two investments is negatively correlated, an investor can hedge her bets by investing in both investments. If the value of one of her investments goes down, she is likely to see a positive return on the other.

3. Education

In education, researchers often look for variables that are either positively or negatively correlated with student achievement and educational performance. These correlations help researchers pinpoint characteristics and interventions that are associated with better or worse academic performance. Once these correlations are established, researchers can try to determine if the correlations are causal and whether any further interventions can help improve educational outcomes.

4. Medicine

In medicine, doctors and medical researchers use correlations to improve their understanding of disease and medical conditions. Certain medical conditions can be correlated with specific risk factors, symptoms, behaviors, or other diseases. Establishing correlations can help doctors diagnose, advise, and treat patients.

5. Sports

In sports, correlations can be used to assess the performance of individual athletes or teams. People who bet on sports might also look for correlations to place smarter bets.

Explore Outlier's Award-Winning For-Credit Courses

Outlier (from the co-founder of MasterClass) has brought together some of the world's best instructors, game designers, and filmmakers to create the future of online college.

Check out these related courses:

A Guide To Understand Negative Correlation | Outlier (7)

Intro to Statistics

How data describes our world.

Explore course

A Guide To Understand Negative Correlation | Outlier (8)

Intro to Statistics

How data describes our world.

Explore course

A Guide To Understand Negative Correlation | Outlier (9)

Precalculus

Master the building blocks of Calculus.

Explore course

A Guide To Understand Negative Correlation | Outlier (10)

Precalculus

Master the building blocks of Calculus.

Explore course

A Guide To Understand Negative Correlation | Outlier (11)

Calculus I

The mathematics of change.

Explore course

A Guide To Understand Negative Correlation | Outlier (12)

Calculus I

The mathematics of change.

Explore course

A Guide To Understand Negative Correlation | Outlier (13)

Intro to Microeconomics

Why small choices have big impact.

Explore course

A Guide To Understand Negative Correlation | Outlier (14)

Intro to Microeconomics

Why small choices have big impact.

Explore course
A Guide To Understand Negative Correlation | Outlier (2024)

FAQs

How should I interpret a negative correlation? ›

A negative, or inverse correlation, between two variables, indicates that one variable increases while the other decreases, and vice-versa. This relationship may or may not represent causation between the two variables, but it does describe an observable pattern.

Which relationship has a negative correlation explain your answer? ›

A negative correlation is a relationship between two variables that move in opposite directions. In other words, when variable A increases, variable B decreases.

What is a negative correlation for dummies? ›

Negative correlation is a relationship between two variables in which one variable increases as the other decreases, and vice versa. In statistics, a perfect negative correlation is represented by the value -1.0, while a 0 indicates no correlation, and +1.0 indicates a perfect positive correlation.

How do you interpret correlation answers? ›

Correlation coefficients can range from -1.0 to 1.0. They can be interpreted by both their magnitude and sign. For example, a correlation of 0.9 indicates a very strong positive correlation; a change in a first variable is a strong indicator of a similar change in a second variable.

Which best describes a negative correlation? ›

A negative correlation describes the extent to which two variables move in opposite directions. For example, for two variables, X and Y, an increase in X is associated with a decrease in Y.

What is an example of negative correlation? ›

The value of a strong negative correlation is to show as one variable increases, another variable reliably decreases. For example, the number of mistakes made on simple tasks increases as average hours of sleep per night decreases.

What is the range of a negative correlation? ›

Correlations can range from −1 to +1, where −1 means a perfect negative correlation (as one variable goes up the other goes down), 0 means no correlation (the variables are independent with no pattern of relationship), and +1 means a perfect (error-free) positive correlation between two variables (both go up and down ...

What does a negative R value mean in statistics? ›

Positive r values indicate a positive correlation, where the values of both variables tend to increase together. Negative r values indicate a negative correlation, where the values of one variable tend to increase when the values of the other variable decrease.

What happens in a negative correlation quizlet? ›

Negative correlation means that as one variable goes up or down, the other goes the opposite way.

What is the best negative correlation? ›

If a correlation is exactly -1, it is generally referred to as a "perfect negative correlation." Aside from those categorizations, correlations can also be characterized as weak, moderate, semi-strong, or strong—depending on the degree to which two (or more) underlyings move together.

Which relationship could have a negative correlation? ›

The relationships that could have a negative correlation are: number of hours worked and free time. speed of a car and minimum stopping distance. a tadpole's age and the length of its tail.

How to interpret negative correlation coefficient? ›

For negative correlation coefficients, high values of one variable are associated with low values of another variable. For example, there is a negative correlation coefficient for school absences and grades. As the number of absences increases, the grades decrease.

What is a simple way to explain correlation? ›

Correlation describes the relationship between variables. It can be described as either strong or weak, and as either positive or negative.

What does moderate negative correlation mean? ›

It simply means that there is some relationship between the two variables in question, but that there's also a lot of randomness affecting one or both variables, or perhaps other variables affect the two variables in question, so the direct relationship isn't strong, but it's certainly noticeable.

What does it mean when the correlation is negative? ›

Negative correlation, or inverse correlation, describes a situation where, with two variables, one variable increases in value while the other decreases. You might see negative correlation represented with a -1. This shows that while x, or the first variable, gains value, y, or the second variable, decreases in value.

How do you describe a negative correlation graph? ›

A scatterplot with a negative correlation is a graph that shows that all of the data points are in a pattern trending down from left to right. The scatterplot shows that, in general, as x increases, y decreases which means the data points have a negative association or relationship.

What does a negative correlation of 0.5 mean? ›

Negative correlation is measured from -0.1 to -1.0. Weak negative correlation being -0.1 to -0.3, moderate -0.3 to -0.5, and strong negative correlation from -0.5 to -1.0. The stronger the negative correlation, the more the stocks tend to be on the opposite side of their mean. Learn more about Negative Correlation.

Top Articles
Latest Posts
Article information

Author: Jamar Nader

Last Updated:

Views: 6476

Rating: 4.4 / 5 (55 voted)

Reviews: 86% of readers found this page helpful

Author information

Name: Jamar Nader

Birthday: 1995-02-28

Address: Apt. 536 6162 Reichel Greens, Port Zackaryside, CT 22682-9804

Phone: +9958384818317

Job: IT Representative

Hobby: Scrapbooking, Hiking, Hunting, Kite flying, Blacksmithing, Video gaming, Foraging

Introduction: My name is Jamar Nader, I am a fine, shiny, colorful, bright, nice, perfect, curious person who loves writing and wants to share my knowledge and understanding with you.