R Tutorial
Fundamentals of R
Variables
Input and Output
Decision Making
Control Flow
Functions
Strings
Vectors
Lists
Arrays
Matrices
Factors
DataFrames
Object Oriented Programming
Error Handling
File Handling
Packages in R
Data Interfaces
Data Visualization
Statistics
Machine Learning with R
In statistics, measures like the average (or mean), variance, and standard deviation help describe the central tendency and spread of data. In R, there are built-in functions to calculate each of these measures.
The mean()
function calculates the average of a numeric set of data.
data <- c(5, 7, 8, 9, 10) avg <- mean(data) print(avg) # Output: 7.8
The var()
function calculates the variance of a set of data. Variance measures the spread or dispersion of the data points from the mean.
data <- c(5, 7, 8, 9, 10) variance <- var(data) print(variance) # Output: 6.7
The sd()
function calculates the standard deviation of a set of data. The standard deviation is the square root of variance and provides a measure of the average distance between each data point and the mean.
data <- c(5, 7, 8, 9, 10) std_dev <- sd(data) print(std_dev) # Output: 2.588
data <- c(5, 7, 8, 9, 10) # Calculate mean avg <- mean(data) # Calculate variance variance <- var(data) # Calculate standard deviation std_dev <- sd(data) cat("Average:", avg, "\n") cat("Variance:", variance, "\n") cat("Standard Deviation:", std_dev, "\n")
Output:
Average: 7.8 Variance: 6.7 Standard Deviation: 2.588
All these functions work with vectors. If applied to matrices or data frames, they operate column-wise.
They handle NA
values by returning NA
by default. To exclude NA
values in calculations, use the argument na.rm=TRUE
.
R offers a simple and efficient way to calculate statistical measures like average, variance, and standard deviation through its built-in functions. Understanding these measures is foundational for many statistical and data analysis tasks.
R Calculate Mean, Variance, and Standard Deviation Example:
# Generate sample data set.seed(123) data <- rnorm(100) # Calculate mean, variance, and standard deviation mean_value <- mean(data) variance_value <- var(data) sd_value <- sd(data)
How to Use mean() Function in R:
# Calculate mean using mean() function data <- c(1, 2, 3, 4, 5) mean_value <- mean(data)
Calculating Variance with var() Function in R:
# Calculate variance using var() function data <- c(1, 2, 3, 4, 5) variance_value <- var(data)
Standard Deviation Calculation in R Using sd():
# Calculate standard deviation using sd() function data <- c(1, 2, 3, 4, 5) sd_value <- sd(data)
Weighted Average in R with mean() Function:
# Calculate weighted average using mean() function values <- c(1, 2, 3, 4, 5) weights <- c(0.1, 0.2, 0.3, 0.2, 0.2) weighted_average <- sum(values * weights) / sum(weights)
Group-Wise Mean, Variance, and Standard Deviation in R:
# Group-wise mean, variance, and standard deviation data <- data.frame(group = rep(1:2, each = 50), value = rnorm(100)) group_means <- tapply(data$value, data$group, mean) group_variances <- tapply(data$value, data$group, var) group_sds <- tapply(data$value, data$group, sd)
Summary Statistics with describe() in R:
# Summary statistics with describe() function library(Hmisc) data <- rnorm(100) summary_stats <- describe(data)
Calculating Rolling Average, Variance, and Standard Deviation in Time Series Data in R:
# Calculate rolling average, variance, and standard deviation library(zoo) set.seed(123) time_series <- zoo(rnorm(10), order.by = seq(from = Sys.Date(), by = "days", length.out = 10)) rolling_mean <- rollmean(time_series, k = 3, fill = NA) rolling_variance <- rollapply(time_series, width = 3, FUN = var, fill = NA) rolling_sd <- rollapply(time_series, width = 3, FUN = sd, fill = NA)
Confidence Intervals for Mean in R:
# Confidence intervals for mean set.seed(123) data <- rnorm(100) mean_ci <- t.test(data)$conf.int
Comparing Mean and Standard Deviation Between Groups in R:
# Compare mean and standard deviation between groups group1 <- rnorm(50, mean = 5, sd = 2) group2 <- rnorm(50, mean = 8, sd = 3) mean_difference <- mean(group1) - mean(group2) sd_ratio <- sd(group1) / sd(group2)