Section author: Ravi Selker, Jonathon Love, Damian Dropmann
Descriptives (descriptives)
Description
Descriptives are an assortment of summarising statistics, and visualizations which allow exploring the shape and distribution of data. It is good practice to explore your data with descriptives before proceeding to more formal tests.
Usage
descriptives(
data,
vars,
splitBy = NULL,
freq = FALSE,
hist = FALSE,
dens = FALSE,
bar = FALSE,
barCounts = FALSE,
box = FALSE,
violin = FALSE,
dot = FALSE,
dotType = "jitter",
qq = FALSE,
n = TRUE,
missing = TRUE,
mean = TRUE,
median = TRUE,
mode = FALSE,
sum = FALSE,
sd = TRUE,
variance = FALSE,
range = FALSE,
min = TRUE,
max = TRUE,
se = FALSE,
skew = FALSE,
kurt = FALSE,
sw = FALSE,
quart = FALSE,
pcEqGr = FALSE,
pcNEqGr = 4,
formula
)
Arguments
|
the data as a data frame |
|
a vector of strings naming the variables of interest in |
|
a vector of strings naming the variables used to split |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
. |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
an integer (default: 4) specifying the number of equal groups |
|
(optional) the formula to use, see the examples |
Output
A results object containing:
|
a table of the descriptive statistics |
|
an array of frequency tables |
Tables can be converted to data frames with asDF or
as.data.frame(). For example:
results$descriptives$asDF
as.data.frame(results$descriptives)
Examples
data('mtcars')
dat <- mtcars
# frequency tables can be provided for factors
dat$gear <- as.factor(dat$gear)
descriptives(dat, vars = vars(mpg, cyl, disp, gear), freq = TRUE)
#
# DESCRIPTIVES
#
# Descriptives
# -------------------------------------------
# mpg cyl disp gear
# -------------------------------------------
# N 32 32 32 32
# Missing 0 0 0 0
# Mean 20.1 6.19 231 3.69
# Median 19.2 6.00 196 4.00
# Minimum 10.4 4.00 71.1 3
# Maximum 33.9 8.00 472 5
# -------------------------------------------
#
#
# FREQUENCIES
#
# Frequencies of gear
# --------------------
# Levels Counts
# --------------------
# 3 15
# 4 12
# 5 5
# --------------------
#
# spliting by a variable
descriptives(formula = disp + mpg ~ cyl, dat,
median=F, min=F, max=F, n=F, missing=F)
# providing histograms
descriptives(formula = mpg ~ cyl, dat, hist=T,
median=F, min=F, max=F, n=F, missing=F)
# splitting by multiple variables
descriptives(formula = mpg ~ cyl:gear, dat,
median=F, min=F, max=F, missing=F)