Section author: Jonathon Love
The Spreadsheet
In jamovi, data is represented in a spreadsheet, with each column representing a “variable”.
Data Variables
The most commonly used variables in jamovi are “Data Variables”, these variables simply contain data either loaded from a data file, or “typed in” by the user. Data variables can be one three data types:
Integer
Decimal
Text
and one of four measure types:
The measure types are designated by the symbol in the header of the variable’s column. Note that some combinations of data-type and measure-type don’t make sense, and jamovi won’t let you choose these.
Nominal
and Ordinal
are, predictably, for nominal and ordinal variables. Continuous
is for variables with numeric values which are considered to be Interval or Ratio scales (Equivalent to Scale
in SPSS). The ID
measure type is, unlike the others, unique to jamovi. It’s intended for variables that contain identifiers that you would almost never want to analyse. For example, a persons name, or a participant ID. The advantage of IDs, is that jamovi does not need to maintain a list of levels internally, which can improve performance when interacting with very large data sets.
When starting with a blank spreadsheet and typing values in, the data and measure types will change automatically depending on the data you enter. This is a good way to get a feel for which variable types go with which sorts of data. Similarly, when opening a data file, jamovi will infer the variable type from the data in each column. In both cases, this automatic approach may not be correct, and it may be necessary to manually specify the data and measure type with the variable editor.
The variable editor can be invoked by selecting Setup
from the Data
tab, double-clicking on the column header, or by pressing F3
. The variable editor allows you to change the name of the variable, and (for data variables) the data type, the measure type, the order of the levels, and the label displayed for each level. The variable editor can be dismissed by clicking the close arrow, or by pressing F3
again.
New variables can be inserted or appended to the data set using the Add
button from the data ribbon. The Add
button also allows the addition of Computed variables.
Computed Variables
Computed Variables are those which take their value by performing a computation on other Variables. Computed Variables can be used for a range of purposes, including log transforms, z-scores, sum-scores, negative scoring and means.
Computed variables can be added to the data set, with the Add
button available on the data tab. This will produce a formula box where you can specify the formula. The usual arithmetic operators are available. Some examples of formulas are:
A + B
LOG10(len)
MEAN(A, B)
(dose - VMEAN(dose)) / VSTDEV(dose)
Z(dose)
In order, these are the sum of A and B, a log (base 10) transform of len
, the mean of A
and B
, and the z-score of dose
(twice).
There are many more functions available.
V
-functions
A number of functions appear in pairs, one prefixed with a V
and the other not. V
functions perform their calculation on a variable as a whole, where as non-V
functions perform their calculation row by row. For example, MEAN(A, B)
will produce the mean of A
and B
for each row. Where as VMEAN(A)
gives the mean of all the values in A
.
Additionally V
functions support a group_by
argument. When a group_by
variable is specified, a separate value is calculated for each level of the group_by
variable. In the following example:
VMEAN(len, group_by=dose)
A separate mean is calculated for each level of dose
, and each value in the computed variable will be the mean corresponding to it’s row’s value of dose
.
Recoded Variables
Transformed and recoded variables are also available. See this blog post for details.
Filters
Filters in jamovi allow you to filter out rows that you don’t want included in your analysis. For example, you might want to only include people’s survey responses if they explicitly consented to having their data used, or you might want to exclude all left-handed people, or perhaps people who score “below chance” in an experimental task. In some cases you just want to exclude extreme scores, for example those that score more than 3 standard deviations from the mean.
The filters in jamovi are build on top of jamovi’s computed variable formula system, which allows the building of arbitrarily complex formulas.
More details about filters are covered in this blog post