dev-resources.site

for different kinds of informations.

Statistics with R - Measures of Central Tendency and Measures of Dispersion

Published at

8/17/2024

mtcars

data(mtcars)
head(mtcars)

Loads and displays the first few rows of the mtcars dataset.

str(mtcars)

Displays the structure of the mtcars dataset, showing the type of each column.

summary(mtcars)

Measures of Central Tendency

Mean

\mu = \frac{1}{N} \sum_{i=1}^{N} x_i

Calculates the mean of a sequence of numbers.

n = c(1,2,4,5,6)

print(n)

mean_ = sum(n) / length(n)

print(mean_)

mean_cyl = sum(mtcars$cyl) / length(mtcars$cyl) 

print(mean_cyl)

Median

If ( N ) is odd:

\text{Med} = x_{\left(\frac{N+1}{2}\right)}

If ( N ) is even:

\text{Med} = \frac{x_{\left(\frac{N}{2}\right)} + x_{\left(\frac{N}{2} + 1\right)}}{2}

Calculates the median of a sequence of numbers with an odd size.

data_even <- c(7, 13, 19, 33, 67)

median_ <- median(data_even)
print(median_)

data_even <- c(7, 13, 19, 33, 67)
n = length(data_even)
median_ <- data_even[(n + 1) / 2]
print(median_)

Calculates the median of a sequence of numbers with an even size.

data_odd <- c(2, 34, 76, 92, 112)

median_ <- median(data_odd)
print(median_)

data_odd <- c(2, 34, 76, 92, 112)
n = length(data_odd)

median_ <- (data_odd[n / 2] + data_odd[n / 2 + 1]) / 2

print(median_)

median(mtcars$cyl)

median(mtcars$qsec)

Mode

\text{Mode} = \underset{x_i}{\operatorname{argmax}} \ f(x_i)

Creates a frequency table for a sequence of numbers.

numbers <- c(1, 233, 233, 010101, 342, 1, 2, 1111, 1, 55)

tnumbers <- table(numbers)
print(numbers)
print(tnumbers)

mode_ <- as.numeric(names(tnumbers)[tnumbers == max(tnumbers)])
print(mode_)

Identifies the most frequent value(s) in the sequence of numbers.

library(DescTools)

mode_ <- Mode(tnumbers)
print(mode_)

Measures of Dispersion

Defines a sequence of numbers.

n_arr = c(1,2,4,5,6)
print(n_arr)

Variance

\sigma^2 = \frac{1}{N} \sum_{i=1}^{N} (x_i - \mu)^2

Calculates the variance of a sequence of numbers.

mean_ <- mean(n_arr)

print('Mean')
print(mean_)

print('Variance')
var_ <- sum((n_arr - mean_)^2) / length(n_arr)

print((n_arr - mean_))
print((n_arr - mean_)^2)
print(sum((n_arr - mean_)^2))
print(length(n_arr))
print(var_)

Standard Deviation

\sigma = \sqrt{\sigma^2}

Calculates the standard deviation, which is the square root of the variance.

print('Variance')
var_ <- sum((n_arr - mean_)^2) / length(n_arr)
print((n_arr - mean_))
print((n_arr - mean_)^2)
print(sum((n_arr - mean_)^2))
print(length(n_arr))
print(var_)

print('Standard Deviation')
std_ <- sqrt(var_)
print(std_)

Calculates the standard deviation using the sd function in R.

std_ <- sd(n_arr)
print(std_)

Range

\text{Range} = x_{\text{max}} - x_{\text{min}}

Calculates the range, which is the difference between the maximum and minimum values.

range_ <- max(n_arr) - min(n_arr)
print('Range')
print(max(n_arr))
print(min(n_arr))
print(range_)

Calculates the range using the diff function.

range_ <- diff(range(n_arr))
print(range_)

Coefficient of Variation

\text{CV} = \frac{\sigma}{\mu}

Calculates the coefficient of variation, which is the ratio of the standard deviation to the mean.

mean_ <- mean(n_arr)
print('Mean')
print(mean_)

print('Variance')
var_ <- sum((n_arr - mean_)^2) / length(n_arr)
print((n_arr - mean_))
print((n_arr - mean_)^2)
print(sum((n_arr - mean_)^2))
print(length(n_arr))
print(var_)

print('Standard Deviation')
std_ <- sqrt(var_)
print(std_)

print('Coefficient of Variation')
cv <- std_ / mean_
print(cv)

My Latest Posts

Did you say Free GPU?

sc0v0ne ・ Feb 29 '24

#python #gpu #ai #machinelearning

My Super Powers as a Software Developer - 2024

sc0v0ne ・ Jan 6 '24

#tools #softwaredevelopment #workstations #productivity

Becoming Efficient with Pomodoro

sc0v0ne ・ Feb 12 '24

#tools #productivity #softwaredevelopment

Favorites Projects Open Source

About the author:

sc0v0ne

Machine learning, deep learning, and raw code. Presented clearly and with examples.

A little more about me...

Graduated in Bachelor of Information Systems, in college I had contact with different technologies. Along the way, I took the Artificial Intelligence course, where I had my first contact with machine learning and Python. From this it became my passion to learn about this area. Today I work with machine learning and deep learning developing communication software. Along the way, I created a blog where I create some posts about subjects that I am studying and share them to help other users.

I'm currently learning TensorFlow and Computer Vision

Curiosity: I love coffee

statistics Article's

30 articles in total

Different kinds of machine learning methods - supervised, unsupervised, parametric, and non-parametric