Transformers are unary functions that are applied on the covariates. Here is an example of “to the power of 5”.
<- function(x) {x^5} pow5
Which you can pass it into the transformers list as
"pow5"
.
The gen_pseudo_pop
function tries transformers if the
covariate balance test has not been met in the previous attempt. The
covariate with the worst balance value will be chosen to apply a
transformer. The first transformer from the list will be selected for
this purpose. If the transformer has been used for this specific
covariate, the next value will be selected.
You can use set_logger
function and set logger_level to
one of “TRACE”, “DEBUG”, “INFO”, “SUCCESS”, “WARN”, “ERROR”, or”FATAL”.
In this package most of the internal information are logged in INFO and
DEBUG level. If you need to see a new information in the .log file,
please consider opening and issue here.
We are using a spawning mechanism in multicore processing. Each worker processor gets a copy of the required data and libraries. In case of limited available memory and a large dataset, you can reduce the number of CPU cores (nthread) to fit the processing into your system. Following this recommendation, the processing time will increase; however, the memory usage will decrease.
Many internal libraries (e.g., XGBoost
) are dependent on
OpenMP library for parallel computation. Please make sure that you have
installed OpenMP library and configured it correctly. Please see the
following links for more details:
In order to activate OpenMP on HPC, you need to load the required modules. For example, if you are using SLURM on Cannon at Harvard University, you need to load the intel module.
module load intel/19.0.5-fasrc01
export OMP_NUM_THREADS=$SLURM_CPUS_PER_TASK
Please read more here.