This vignette contains answers to some frequently asked questions about the package bpnreg
for analyzing Bayesian projected normal circular regression and mixed-effects models. Answers are given and illustrated with the Motor
and Maps
datasets.
To obtain more information about the Motor
and Maps
datasets the following code can be used:
library(bpnreg)
?Maps ?Motor
A circular regression model for the Motor
data can be fit using the bpnr
function as follows:
bpnr(Phaserad ~ Cond + AvAmp, data = Motor)
Note that categorical variables should be of class factor
in order to be handled correctly by bpnreg.
A circular mixed-effects model for the Maps
data can be fit using the bpnme
function as follows:
bpnme(Error.rad ~ Maze + Trial.type + (1|Subject),data = Maps, its = 100)
Note that categorical variables should be of class factor
in order to be handled correctly by bpnreg.
An interaction effect between the Cond
and AvAmp
variables can be included in the regression model in the following two ways:
bpnr(Phaserad ~ Cond + AvAmp + Cond:AvAmp, data = Motor)
bpnr(Phaserad ~ Cond*AvAmp, data = Motor)
The input data should be formatted as a standard R
data.frame
or dplyr
tbl()
.
In case of missing values in the input data an error will be returned. In case of mixed effects models subgroups do not need to be of the same size, e.g. in case of repeated measures data not all individuals need to have been observed at each measurement occasion.
The dependent variable should be a circular variable measured on a scale from 0 to 2\(\pi\) radians or -\(\pi\) to \(\pi\) radians. An warning message if returned in case the dependent variable contains values outside these ranges.
The package does not currently contain an option to include a user-specified priors. Priors for regression coefficients and fixed-effect coefficients used in the current version of the package are uninformative normal distributions, \(N(0, 10000)\).
No. Unlike packages for mixed-effects models such as lme4
and nlme
the mixed-effects model in bpnreg
may only contain one nesting variable/grouping factor.
A seed can be specified in the bpnr
and bpnme
functions using the seed
option as follows:
bpnr(Phaserad ~ Cond + AvAmp, data = Motor, seed = 101)
The coef_circ()
function can be used to extract model summaries for both categorical and continuous predictors. The units in which the results are displayed can be chosen to be either radians or degrees.
<- bpnr(Phaserad ~ Cond + AvAmp, data = Motor, seed = 101) fit
E.g. circular coefficients for the circular regression model above can be obtained as follows:
coef_circ(fit, type = "continuous", units = "degrees")
coef_circ(fit, type = "categorical", units = "degrees")
coef_circ(fit, type = "continuous", units = "radians")
coef_circ(fit, type = "categorical", units = "radians")
coef_circ()
function:<- bpnr(Phaserad ~ Cond + AvAmp, data = Motor, seed = 101) fit
To obtain circular coefficients in degrees for the continuous variable AvAmp
in the circular regression model above are obtained as:
coef_circ(fit, type = "continuous", units = "degrees")
#> mean mode sd LB HPD UB HPD
#> AvAmp ax 81.60680954 74.107655480 115.262007 -1.541343e+02 276.10492950
#> AvAmp ac 37.29736965 -25.096914448 72.470701 -4.541639e+01 145.44390534
#> AvAmp bc -1.67909812 -0.612950343 152.053102 -2.614163e+01 17.27149184
#> AvAmp AS 0.08110837 0.001181389 1.532724 -2.631771e-01 0.31289926
#> AvAmp SAM 0.05039833 0.005540038 1.478331 3.711198e-05 0.06059786
#> AvAmp SSDO 0.25235872 1.756779390 1.963975 -2.636442e+00 2.85796861
The output returns summary statistics for the posterior distributions for several parameters of the circular regression line of the AvAmp
variable.
These parameters are interpreted as follows:
ax
= the location of the inflection point of the regression curve on the axis of the predictor.ac
= the location of the inflection point of the regression curve on the axis of the circular outcome.bc
= the slope of the tangent line at the inflection point. An increase of 1 unit of the predictor at the inflection point leads to a bc
change in the circular outcome.AS
= the average slopes of the circular regression. An increase of 1 unit of the predictor leads to a AS
change in the circular outcome on average.SAM
= the circular regression slopes at the mean.An increase of 1 unit of the predictor leads to a SAM
change in the circular outcome at the average predictor value.SSDO
= the signed shortest distance to the origin.A more detailed explanation of the above parameters is given in Cremers, Mulder & Klugkist (2018).
To obtain circular coefficients in degrees for the categorical variable Cond
we use:
coef_circ(fit, type = "categorical", units = "degrees")
#> $Means
#> mean mode sd LB UB
#> (Intercept) 47.60691 47.46081 12.47107 23.71734 70.70371
#> Condsemi.imp 15.93874 15.29835 22.01006 -28.42677 57.42232
#> Condimp 28.31828 37.56589 25.50625 -26.30714 77.40392
#> Condsemi.impCondimp -62.81396 -71.20733 59.58785 -170.56951 85.77068
#>
#> $Differences
#> mean mode sd LB UB
#> Condsemi.imp 31.74145 38.90969 26.89524 -19.95152 86.87231
#> Condimp 19.46819 22.77203 30.56586 -42.88731 80.42033
#> Condsemi.impCondimp 115.96819 141.80603 57.61487 -56.90659 203.30989
The output returns summary statistics for the posterior distributions of the circular means for all categories and combination of categories of the categorical variables in the model, as well as differences between these means.
By using the fit()
function on a bpnr
or bpnme
object 5 different fit statistics together with the (effective) number of parameters they are based on can be obtained.
fit(fit)
#> Statistic Parameters
#> lppd -57.1423 8.000000
#> DIC 130.1116 8.024107
#> DIC.alt 132.1696 9.053110
#> WAIC1 129.8904 7.802881
#> WAIC2 131.6491 8.682262
All five fit statistics are computed as in Gelman et.al. (2014). The lppd
is an estimate of the expected log predictive density, the DIC
is the Deviance Information Criterion, the DIC_alt
is a version of the DIC that uses a slightly different definition of the effective number of parameters, the WAIC1
and WAIC2
are the two versions of the Watanabe-Akaike or Widely Available Information Criterion presented in Gelman et.al. (2014).
Raw posterior estimates are stored in the following objects:
a.x
= posterior samples for the the locations of the inflection point of the regression curve on the axis of the predictor.a.c
= posterior samples for the the locations of the inflection point of the regression curve on the axis of the circular outcome.b.c
= posterior samples for the slopes of the tangent line at the inflection point.AS
= posterior samples for the average slopes of the circular regression.SAM
= posterior samples for the circular regression slopes at the mean.SSDO
= posterior samples for the signed shortest distance to the origin.circ.diff
= posterior samples for the circular differences between intercept and other categories of categorical variables.beta1
= posterior samples for the fixed effects coefficients for the first component.beta2
= posterior samples for the fixed effects coefficients for the second component.In circular mixed-effects models the following additional parameters can be obtained:
b1
= posterior samples for the random effects coefficients for the first component.b2
= posterior samples for the random effects coefficients for the second component.circular.ri
= posterior samples for the circular random intercepts for each individual.omega1
= posterior samples for the random effect variances of the first component.omega2
= posterior samples for the random effect variances of the first component.cRS
= posterior samples for the circular random slope variance.cRI
= posterior samples of the mean resultant length of the circular random intercept, a measure of concentration.E.g. to obtain the first six posterior samples for beta1
and a.x
we use the following code:
head(fit$beta1)
#> (Intercept) Condsemi.imp Condimp AvAmp
#> [1,] 0.3454734 0.5982656 0.4816842 0.0140926098
#> [2,] 0.9452224 0.5676390 -0.7671751 -0.0067083928
#> [3,] 0.7600679 0.2982861 -0.1715743 -0.0060381475
#> [4,] 1.3186083 -0.3477482 -0.5248282 0.0003941233
#> [5,] 1.3624594 -0.4803601 -0.8509008 -0.0059952280
#> [6,] 0.9594075 0.3900853 -0.3079756 -0.0073404834
head(fit$a.x)
#> AvAmp
#> [1,] 41.70181
#> [2,] -27.74747
#> [3,] 76.66307
#> [4,] 67.55377
#> [5,] 108.64874
#> [6,] 175.43192
<- bpnme(Error.rad ~ Maze + Trial.type + (1|Subject), Maps) fitme
In circular mixed-effects models the following parameters contain the individual random effects and random effect variances:
b1
= posterior samples for the random effects coefficients for the first component.b2
= posterior samples for the random effects coefficients for the second component.circular.ri
= posterior samples for the circular random intercepts for each individual.omega1
= posterior samples for the random effect variances of the first component.omega2
= posterior samples for the random effect variances of the first component.cRS
= posterior samples for the circular random slope variance (for bc
).cRI
= posterior samples of the mean resultant length of the circular random intercept, a measure of concentration.E.g. if we want to obtain the posterior mean for the mean resultant length of the circular random intercept we use the following code:
mean(fitme$cRI)
#> [1] 0.9998607
This estimate is very close to 1, meaning there is almost no variation in the individual circular random intercepts. We can check this by plotting the posterior means of the individual circular random intercepts (in degrees):
apply(fitme$circular.ri, 1, mean_circ)*180/pi
#> [1] -13.18457 -13.20623 -13.39053 -13.28647 -13.28007 -13.29426 -13.19401
#> [8] -13.27823 -13.11418 -13.29843 -13.34283 -13.38288 -13.28338 -13.29185
#> [15] -13.18482 -13.24974 -13.20127 -13.16453 -13.47421 -13.15180
Indeed the circular random intercepts of the 20 individuals in the Maps
data lie very close together.
An explanation of the computation of random intercept and slope variances can be found in the supplementary material of Cremers, Pennings, Mainhard & Klugkist (2021).
Because projected normal models are heterogeneous models, i.e. they simultaneously model mean and variance, we can in addition to investigating effects on the circular mean also investigate effects on the circular variance.
Kendall (1974) gives the following formula for computation of the circular variance in a projected normal model:
\[1 - \hat{\rho} = 1 - \sqrt{\pi\xi/2}\exp{-\xi}[I_0(\xi) + I_1(\xi)]\]
where \(\xi = ||\boldsymbol{\mu}||^2\) and \(I_\nu()\) is the modified Bessel function of the first kind and order \(\nu\). For the effect of a variable \(x\) on the variance, \(\boldsymbol{\mu} = (\beta_0^I + \beta_1^Ix,\beta_0^{II} + \beta_1^{II}x)\), where \(\beta_0^I\), \(\beta_1^I\), \(\beta_0^{II}\) and \(\beta_1^{II}\) are the intercepts and slopes of the first and second linear component respectively.
Automated summaries for effects on the circular variance are however not yet implemented in bpnreg
and will need to be computed explicitly using the raw posterior samples. E.g. the effect of the trial type on the circular variance can be computed as follows:
<- fitme$beta1[,"(Intercept)"]
a1 <- fitme$beta2[,"(Intercept)"]
a2 <- fitme$beta1[,"Trial.type1"]
b1 <- fitme$beta2[,"Trial.type1"]
b2
<- sqrt((a1)^2 + (a2 + b2)^2)^2/4
zeta_standard <- 1 - sqrt((pi * zeta_standard)/2) * exp(-zeta_standard) *
var_standard besselI(zeta_standard, 0) + besselI(zeta_standard, 1))
(
<- sqrt((a1 + b1)^2 + (a2 + b2)^2)^2/4
zeta_probe <- 1 - sqrt((pi * zeta_probe)/2) * exp(-zeta_probe) *
var_probe besselI(zeta_probe, 0) + besselI(zeta_probe, 1))
(
<- c(mode_est(var_standard),
standard mean(var_standard),
sd(var_standard),
hpd_est(var_standard))
<- c(mode_est(var_probe),
probe mean(var_probe),
sd(var_probe),
hpd_est(var_probe))
<- rbind(standard, probe)
results
colnames(results) <- c("mode", "mean", "sd", "HPD LB", "HPD UB")
rownames(results) <- c("standard", "probe")
The posterior estimates of the circular variance for the standard and probe trials are:
results#> mode mean sd HPD LB HPD UB
#> standard 0.08664707 0.10494372 0.02334300 0.05901261 0.1471963
#> probe 0.06332233 0.07409204 0.01829064 0.04351343 0.1090560
From these results we conclude that the circular variance for the standard and probe trials is not significantly different, the 95% Highest Posterior Density (HPD) intervals overlap.
Cremers, J. , Mulder, K.T. & Klugkist, I. (2018). Circular Interpretation of Regression Coefficients. British Journal of Mathematical and Statistical Psychology, 71(1), 75-95.
Cremers, J., Pennings, H.J.M., Mainhard, T. & Klugkist, I. (2021). Circular Modelling of Circumplex Measurements for Interpersonal Behavior. Assessment, 28(2), 585-600.
Gelman, A., Carlin, J.B., Stern, H.S., Dunson, D.B., Vehtari, A. & Rubin, D. (2014). Bayesian Data Analysis, 3rd ed.
Kendall, D.G. (1974). Pole-seeking Brownian motion and bird navigation. Journal of the Royal Statistical Society. Series B, 37, 97–133.