How to calculate deviation
Understanding deviation is essential in the realm of statistics because it provides insight into the spread or dispersion of data within a dataset. Calculating deviation can be done with various methods, including Standard Deviation, Mean Absolute Deviation, and Median Absolute Deviation. This article will walk you through the basic concepts and steps for calculating each type of deviation.
1. Standard Deviation
Standard Deviation is a widely used measure to quantify the dispersion within a dataset, and it is expressed using the same unit as the data points. Calculating Standard Deviation involves the following steps:
a) Calculate the Mean
– Find the sum of all data points.
– Divide the sum by the number of data points in the dataset.
b) Determine Differences Each Data Point and Mean Value
– Subtract the mean from each data point in the dataset.
c) Square These Differences
– Square each difference obtained in step b).
d) Determine Variance
– Add all squared differences from step c).
– Divide this sum by the total number of data points in the dataset.
e) Calculate Standard Devation
– Take square root of variance calculated in step d).
2. Mean Absolute Deviation (MAD)
Mean Absolute Deviation is an alternative method to measure deviation using absolute values. The steps to find MAD are as follows:
a) Calculate The Mean
– Find the sum of all data points.
– Divide this sum by how many data points there are.
b) Determine Differences Each Data Point and Mean Value
– Subtract mean from each individual data point.
c) Calculate Absolute Value of These Differences
– Convert any negative results from step b) into positive values.
d) Obtain MAD
– Sum all absolute differences from step c).
– Divide this sum by total count of data points.
3. Median Absolute Deviation (MedAD)
Median Absolute Deviation is a robust statistic to calculate deviation, ideal for datasets with outliers. The steps to find MedAD are:
a) Determine Dataset Median
– Arrange data points in ascending order.
– Identify the middle value (median).
b) Determine Differences Between Each Data Point and Median Value
– Subtract the median from each data point.
c) Calculate Absolute Value of These Differences
– Convert any negative results from step b) into positive values.
d) Obtain MedAD
– Arrange the absolute differences in ascending order.
– Identify the middle value as the MedAD.
Conclusion
Calculating deviation helps in identifying the nature and spread of data points within a dataset. It is crucial in statistical analysis, enabling one to grasp an initial understanding of dataset tendencies or gather more complex knowledge from it. By following these steps, you will be able to effectively measure deviation using Standard Deviation, Mean Absolute Deviation, and Median Absolute Deviation methods.