Customizing RBesT plots

Baldur Magnusson

2022-08-08

Introduction

This vignette demonstrates how to work with the forest plot provided by the RBesT package. We show how to modify the default plot and, for advanced users, how to extract data from a ggplot object to create a new plot from scratch. Finally we recreate plots from a case study presented in the training materials.

For more general information on plotting in R, we recommend the following resources:

# Load required libraries
library(RBesT)
library(ggplot2)
library(dplyr)
library(tidyr)
library(bayesplot)

# Default settings for bayesplot
color_scheme_set("blue")
theme_set(theme_default(base_size=12))

# Load example gMAP object
set.seed(546346)
map_crohn <- gMAP(cbind(y, y.se) ~ 1 | study,
                  family=gaussian,
                  data=transform(crohn, y.se=88/sqrt(n)),
                  weights=n,
                  tau.dist="HalfNormal", tau.prior=44,
                  beta.prior=cbind(0,88))

print(map_crohn)
## Generalized Meta Analytic Predictive Prior Analysis
## 
## Call:  gMAP(formula = cbind(y, y.se) ~ 1 | study, family = gaussian, 
##     data = transform(crohn, y.se = 88/sqrt(n)), weights = n, 
##     tau.dist = "HalfNormal", tau.prior = 44, beta.prior = cbind(0, 
##         88))
## 
## Exchangeability tau strata: 1 
## Prediction tau stratum    : 1 
## Maximal Rhat              : 1 
## Estimated reference scale : 88 
## 
## Between-trial heterogeneity of tau prediction stratum
##  mean    sd  2.5%   50% 97.5% 
## 14.50 10.10  1.09 12.20 41.10 
## 
## MAP Prior MCMC sample
##  mean    sd  2.5%   50% 97.5% 
## -49.8  19.3 -93.2 -48.3 -11.5

Forest plot

The default forest plot is a “standard” forest plot with the Meta-Analytic-Predictive (MAP) prior additionally summarized in the bottom row:

forest_plot(map_crohn)

We can also include the model-based estimates for each study, and add a legend to explain the different linetypes.

forest_plot(map_crohn, model="both") + legend_move("right")

We can modify the color scheme as follows (refer to help(color_scheme_set) for a full list of themes):

# preview a color scheme
color_scheme_view("mix-blue-red")

# and now let's use it
color_scheme_set("mix-blue-red") 
forest_plot(map_crohn)

color_scheme_set("gray")
forest_plot(map_crohn)

The point size can be modified and the vertical line removed:

color_scheme_set("blue")
forest_plot(map_crohn, size=0.5, alpha=0)

Presentation-ready plots

If a plot is to be used for a presentation or in a document such as the study protocol, it is recommended to use sufficiently large font sizes (e.g. about as large as the fonts on the same slide or in the same document) and that elements of the plot are clearly visible. Here we show a few simple statements that can be used for this purpose.

We also recommend saving plots explicitly with the ggsave function, which allows control (and hence consistency) of image size. Note that the font size will be enforced in the requested size; a small image with large font size may result in too little space for the plot itself. The image is sized according to the golden cut (\(\phi=\frac{1+\sqrt5}{2} \approx 1.62\)) which is perceived as a pleasing axis ratio. Png is the recommended image file type for presentations and study documents.

Advanced topics

Extract data from a ggplot object

In some situations, desired modifications to a plot provided by RBesT may not be possible given the returned ggplot object. If a truly tailored plot is desired, the user must extract the data from this object and create a graph from scratch using ggplot functions. Recall the original forest plot:

Suppose we wish to use different symbols for the meta/stratified point estimates and a different linestyle for the vertical line. A tailored plot can be created as follows.

##                 mean  sem median  low    up    study      model
## Gastr06          -51 10.2    -51  -71 -30.9  Gastr06 stratified
## AIMed07          -49  6.8    -49  -62 -35.6  AIMed07 stratified
## NEJM07           -36  4.9    -36  -46 -26.5   NEJM07 stratified
## Gastr01a         -47 19.7    -47  -86  -8.4 Gastr01a stratified
## APhTh04          -90 17.6    -90 -124 -55.5  APhTh04 stratified
## Gastr01b         -54 11.6    -54  -77 -31.4 Gastr01b stratified
## theta_resp_pred  -50 19.3    -48  -93 -11.5      MAP       meta
## theta_resp       -50  8.4    -49  -69 -34.1     Mean       meta

Design plots a clinical trial

Here we show how to create outcome and operating characteristic plots for a clinical trial comparing a developmental drug against placebo. The primary endpoint is binary (with event probability \(p\)) and we use an informative prior for the placebo arm event rate. The experimental drug is believed to lower the event rate, and the criteria for study outcome are hence as follows:

\[ \begin{align*} \textrm{Criterion 1:} \Pr(p_{trt} / p_{pbo} < 1) &> 0.9\\ \textrm{Criterion 2:} \Pr(p_{trt} / p_{pbo} < 0.5) &> 0.5 \end{align*} \]

The outcome is success (GO) if both criteria are satisfied, futility (STOP) if neither is satisfied, and indeterminate if only one or the other is satisfied.

We now create a plot that shows the study conclusion, given any combination of outcomes on the two treatment arms.

# Define prior distributions
prior_pbo <- mixbeta(inf1=c(0.60, 19, 29), inf2=c(0.30, 4, 5), rob=c(0.10, 1, 1))
prior_trt <- mixbeta(c(1, 1/3, 1/3))

# Study sample size
n_trt <- 50
n_pbo <- 20

# Create decision rules and designs to represent success and futility
success <- decision2S(pc=c(0.90, 0.50), qc=c(log(1), log(0.50)), lower.tail=TRUE, link="log")
futility <- decision2S(pc=c(0.10, 0.50), qc=c(log(1), log(0.50)), lower.tail=FALSE, link="log")
design_suc <- oc2S(prior_trt, prior_pbo, n_trt, n_pbo, success)
design_fut <- oc2S(prior_trt, prior_pbo, n_trt, n_pbo, futility)
crit_suc <- decision2S_boundary(prior_trt, prior_pbo, n_trt, n_pbo, success)
crit_fut <- decision2S_boundary(prior_trt, prior_pbo, n_trt, n_pbo, futility)

# Create a data frame that holds the outcomes for y1 (treatment) that define success and futility, 
# conditional on the number of events on y2 (placebo)
outcomes <- data.frame(y2=c(0:n_pbo), suc=crit_suc(0:n_pbo), fut=crit_fut(0:n_pbo), max=n_trt)
outcomes$suc <- with(outcomes, ifelse(suc<0, 0, suc)) # don't allow negative number of events

# Finally put it all together in a plot. 
o <- 0.5 # offset
ggplot(outcomes, aes(x=y2, ymin=-o, ymax=suc+o)) + geom_linerange(size=4, colour="green", alpha=0.6) + 
  geom_linerange(aes(ymin=suc+o, ymax=fut+o), colour="orange", size=4, alpha=0.6) + 
  geom_linerange(aes(ymin=fut+o, ymax=max+o), colour="red", size=4, alpha=0.6) + 
  annotate("text", x=c(2,14), y=c(36,8), label=c("STOP", "GO"), size=10) + 
  scale_x_continuous(breaks=seq(0,n_pbo,by=2)) + 
  scale_y_continuous(breaks=seq(0,n_trt,by=4)) + 
  labs(x="Events on placebo", y="Events on treatment", title="Study outcomes") + 
  coord_flip() + theme_bw(base_size=12)

We can also use the design functions that were already derived (design_suc and design_fut) to compute operating characteristics.

## R version 4.1.0 (2021-05-18)
## Platform: x86_64-pc-linux-gnu (64-bit)
## Running under: Ubuntu 20.04.4 LTS
## 
## Matrix products: default
## BLAS:   /usr/lib/x86_64-linux-gnu/blas/libblas.so.3.9.0
## LAPACK: /usr/lib/x86_64-linux-gnu/lapack/liblapack.so.3.9.0
## 
## locale:
##  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C              
##  [3] LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8    
##  [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8   
##  [7] LC_PAPER=en_US.UTF-8       LC_NAME=C                 
##  [9] LC_ADDRESS=C               LC_TELEPHONE=C            
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C       
## 
## attached base packages:
## [1] stats     graphics  grDevices utils     datasets  methods   base     
## 
## other attached packages:
## [1] bayesplot_1.8.1 tidyr_1.1.3     dplyr_1.0.8     ggplot2_3.3.5  
## [5] RBesT_1.6-4     knitr_1.33     
## 
## loaded via a namespace (and not attached):
##  [1] Rcpp_1.0.7           mvtnorm_1.1-2        prettyunits_1.1.1   
##  [4] ps_1.6.0             assertthat_0.2.1     digest_0.6.29       
##  [7] utf8_1.2.2           V8_3.4.2             plyr_1.8.6          
## [10] R6_2.5.1             ggridges_0.5.3       backports_1.2.1     
## [13] stats4_4.1.0         evaluate_0.14        highr_0.9           
## [16] pillar_1.6.2         rlang_1.0.1          curl_4.3.2          
## [19] callr_3.7.0          jquerylib_0.1.4      checkmate_2.0.0     
## [22] rmarkdown_2.11       labeling_0.4.2       stringr_1.4.0       
## [25] loo_2.4.1            munsell_0.5.0        compiler_4.1.0      
## [28] xfun_0.25            rstan_2.21.2         pkgconfig_2.0.3     
## [31] pkgbuild_1.2.0       rstantools_2.1.1     htmltools_0.5.2     
## [34] tidyselect_1.1.1     tibble_3.1.3         gridExtra_2.3       
## [37] codetools_0.2-18     matrixStats_0.60.1   fansi_0.5.0         
## [40] crayon_1.4.2         withr_2.4.3          grid_4.1.0          
## [43] jsonlite_1.7.2       gtable_0.3.0         lifecycle_1.0.1     
## [46] DBI_1.1.2            magrittr_2.0.1       StanHeaders_2.21.0-7
## [49] scales_1.1.1         RcppParallel_5.1.4   cli_3.1.1           
## [52] stringi_1.7.3        farver_2.1.0         bslib_0.3.1         
## [55] ellipsis_0.3.2       generics_0.1.0       vctrs_0.3.8         
## [58] Formula_1.2-4        tools_4.1.0          glue_1.6.1          
## [61] purrr_0.3.4          processx_3.5.2       parallel_4.1.0      
## [64] fastmap_1.1.0        yaml_2.2.1           inline_0.3.19       
## [67] colorspace_2.0-2     sass_0.4.0