Solving Variables Puzzle: Correlation Coefficients & SPSS for N=850

In summary, Leigh13 suggests that you use Spearman's correlation coefficient to calculate the degree of correlation between two ordinal variables. Spearman's correlation coefficient is calculated by subtracting each observation's rank in variable (a) from their rank in variable (b) in column d. The data set is then sorted according to the next variable (Intensity(a)). Ties with dissimilar ranks are resolved using the same method as step 3. Once the data is sorted, the d values are calculated and finally the d^2 values are summed to produce Spearman's r.
  • #1
Leigh13
1
0
Hi guys!

I have never used statistics in my life, so now I'm completely lost.

I have only 3 variables -

the year of the company founding (up to 1993, 1994-2004, 2005-now), the technological intensity of their products (that's either a, b, c or d) and the number of foreign markets (that's either 1, 2, 3 or more)

I would like to check the correlation between the year and the tech. intensity, the year and the number of markets, and the intensity and the number of markets

I understand I'm looking for some sort of correlation, but which coefficient to check - Spearman or Pearson?

And how to get the variables into SPSS? n=850 Do I just put 3 columns one next to each other?

Thanks so much for help!
 
Mathematics news on Phys.org
  • #2
Leigh13 said:
Hi guys!

I have never used statistics in my life, so now I'm completely lost.

I have only 3 variables -

the year of the company founding (up to 1993, 1994-2004, 2005-now), the technological intensity of their products (that's either a, b, c or d) and the number of foreign markets (that's either 1, 2, 3 or more)

I would like to check the correlation between the year and the tech. intensity, the year and the number of markets, and the intensity and the number of markets

I understand I'm looking for some sort of correlation, but which coefficient to check - Spearman or Pearson?

And how to get the variables into SPSS? n=850 Do I just put 3 columns one next to each other?

Thanks so much for help!

Welcome to MHB, Leigh13! :)

All of your variables are ordinal.
That is, they have an ordering, but for instance an average is meaningless.
In particular that means that you cannot use Pearson.
So yes, Spearman is the way to go.

In SPSS you would indeed simply put 3 columns next to each other and specify you want to use Spearman.
 
  • #3
Also, try doing it in Excel! That way, you'll really get an intuitive feel for how Spearman's correlation coefficient works.

Calculating Spearman's r From Scratch
Step 1: Create Your Table
Create columns for the IDs, variables, ranks, d, and d^2 (see table 1 below).

Step 2: Sort According to the First Variable
Sort the (entire! i.e. all columns of your data set!) data according to your first variable (Intensity(a)). Make a note of whether you sorted them in ascending, or descending order. In the example below, I used ascending order and gave the value "a" for the variable Intensity(a) rank 1, value "b" rank 2 and so forth.

Step 3: Assign First Variable Ranks, Deal with Ties
Assign ranks to each observation's value for the variable we sorted in step 2 (i.e. Intensity(a)). Now, you may notice that some of the observations have the same variable value, but different ranks. These are called ties, and need to receive new and equal ranks for the calculations to work.

e.g. observations with IDs 2, 3, and 7 all share a value of "a" in the Intensity(a) variable, but hold the dissimilar ranks 4, 5, and 6. Their new equal rank will be: (6 + 5 + 4)/3 = 5.

Step 4: Sort According to the Second Variable
Sort your according to the next variable. Make sure that the ranking system follows the same pattern as your first variable. In our case, "a" was considered high and received the rank of one. Therefore, the observation with 8 markets will receive the rank of 1, the observation with 7 markets will receive rank 2 and so forth.

Step 5: Assign Second Variable Ranks, Deal with Ties
Assign ties with dissimilar ranks new equal ranks using the same method as step 3.

Step 6: Sort According to Observation IDs
Re-sort all of your data according to your ID numbers (not a requirement, but it makes the data easier to read).

Step 7: Calculate d
Subtract each observation's rank in variable (a) from their rank in variable (b) in column d. Note that if these are summed up, you will always get zero.

Step 8: Calculate d^2
Raise each value in d by two in order to make sure that their sum is'nt zero.

Step 9:
Sum the values of d^2 and use it in the formula below.

Formula:
\(\displaystyle Spearman's r = 1 - \frac{6\sum_{x = 1}^n {d^2}}{n(n^2-1)} = 1 - \frac{6*11,5}{7(7^2-1)} = 0,79\)
Table 1. Calculating Spearman's r in Excel/Open Office
IDIntensity(a)RANK(a)#Markets(b)RANK(b)dd^2
1a1.572-0.50.25
2c554.50.50.25
3c546.5-1.52.25
4d754.52.56.25
5b36300
6a1.5810.50.25
7c546.5-1.52.25
SUM:011.5

Fun Experiments! Plug in and see what happens to the results.
1. What happens if all "d" values are zero?
2. What happens if all "d" values are negative?
3. What happens when the ranks are perfectly dissimilar? And similar?
 
Last edited:

FAQ: Solving Variables Puzzle: Correlation Coefficients & SPSS for N=850

What is the purpose of solving variables puzzle in correlation coefficients and SPSS for N=850?

The purpose of solving variables puzzle in correlation coefficients and SPSS for N=850 is to understand the relationship between two or more variables and determine how strong or weak the relationship is. This can help in making predictions and understanding patterns in data.

What is the difference between correlation coefficient and SPSS?

Correlation coefficient is a statistical measure that shows the strength and direction of the relationship between two variables. SPSS (Statistical Package for the Social Sciences) is a software used for statistical analysis and data management. It can be used to calculate correlation coefficients and other statistical measures.

How is correlation coefficient calculated?

Correlation coefficient is calculated by dividing the covariance of two variables by the product of their standard deviations. This gives a value between -1 and 1, where -1 indicates a perfect negative correlation, 0 indicates no correlation, and 1 indicates a perfect positive correlation.

What does a correlation coefficient of 0.5 mean?

A correlation coefficient of 0.5 indicates a moderately positive relationship between two variables. This means that as one variable increases, the other variable also tends to increase, but not necessarily at the same rate. A value of 0.5 is considered a moderate correlation, with 1 being a strong correlation and 0 being no correlation.

How can I interpret the results of a correlation coefficient?

The results of a correlation coefficient should be interpreted based on the strength and direction of the relationship. A positive coefficient indicates a positive relationship, where both variables tend to increase or decrease together. A negative coefficient indicates an inverse relationship, where one variable increases while the other decreases. The closer the value is to 1 or -1, the stronger the relationship. It is also important to consider the context and other factors when interpreting correlation coefficients.

Back
Top