Title: | Efficient Approximate Bayesian Computation Sampling Schemes |
---|---|
Description: | Enables launching a series of simulations of a computer code from the R session, and to retrieve the simulation outputs in an appropriate format for post-processing treatments. Five sequential sampling schemes and three coupled-to-MCMC schemes are implemented. |
Authors: | Franck Jabot, Thierry Faure, Nicolas Dumoulin, Carlo Albert. |
Maintainer: | Nicolas Dumoulin <[email protected]> |
License: | GPL-3 |
Version: | 1.5.2 |
Built: | 2024-11-26 04:36:41 UTC |
Source: | https://github.com/r-forge/easyabc |
The package EasyABC enables to launch a series of simulations of a computer code from the R platform, and to retrieve the simulation outputs in an appropriate format for post-processing treatments. Four sequential sampling schemes, three coupled-to-MCMC schemes and a Simulated Annealing scheme are implemented. EasyABC further enables to launch the simulations in parallel on multiple cores of a computer.
Package: EasyABC Type: Package Version: 1.5.1 Date: 2022-09-21 License: GPL-3 LazyLoad: yes |
Franck Jabot, Thierry Faure, Nicolas Dumoulin, Carlo Albert
ABC_rejection
, ABC_sequential
, ABC_mcmc
, SABC
, binary_model
, binary_model_cluster
This function launches a series of nb_design_pts
model simulations with model parameters drawn in the prior distribution specified in prior_matrix
, build an emulator with these computed design points and then launches a series of nb_simul
emulator simulations.
ABC_emulation(model, prior, nb_design_pts, nb_simul, prior_test=NULL, summary_stat_target=NULL, emulator_span = 50, tol=NULL, use_seed=FALSE, seed_count=0, n_cluster=1, verbose=FALSE, progress_bar=FALSE)
ABC_emulation(model, prior, nb_design_pts, nb_simul, prior_test=NULL, summary_stat_target=NULL, emulator_span = 50, tol=NULL, use_seed=FALSE, seed_count=0, n_cluster=1, verbose=FALSE, progress_bar=FALSE)
model |
a |
prior |
a list of prior information. Each element of the list corresponds to a model parameter. The list element must be a vector whose first argument determines the type of prior distribution: possible values are |
nb_design_pts |
a positive integer equal to the desired number of simulations of the model used to build the emulator. |
nb_simul |
a positive integer equal to the desired number of simulations of the emulator. |
prior_test |
a string expressing the constraints between model parameters.
This expression will be evaluated as a logical expression, you can use all the logical operators including |
summary_stat_target |
a vector containing the targeted (observed) summary statistics.
If not provided, |
emulator_span |
a positive number, the number of design points selected for the local regression.
|
tol |
tolerance, a strictly positive number (between 0 and 1) indicating the proportion of simulations retained nearest the targeted summary statistics. |
use_seed |
logical. If |
seed_count |
a positive integer, the initial seed value provided to the function |
n_cluster |
a positive integer. If larger than 1 (the default value), |
verbose |
logical. |
progress_bar |
logical, |
The returned value is a list containing the following components:
param |
The model parameters used in the |
stats |
The summary statistics obtained at the end of the |
weights |
The weights of the different |
stats_normalization |
The standard deviation of the summary statistics across the |
nsim |
The number of |
nrec |
The number of retained simulations (if targeted summary statistics are provided). |
computime |
The computing time to perform the simulations. |
Franck Jabot, Thierry Faure and Nicolas Dumoulin
Jabot, F., Lagarrigues G., Courbaud B., Dumoulin N. (2015). A comparison of emulation methods for Approximate Bayesian Computation. To be published.
binary_model
, binary_model_cluster
, ABC_sequential
, ABC_mcmc
## Not run: ##### EXAMPLE 1 ##### ##################### ## the model is a C++ function packed into a R function -- the option 'use_seed' ## must be turned to TRUE. trait_prior=list(c("unif",3,5),c("unif",-2.3,1.6),c("unif",-25,125),c("unif",-0.7,3.2)) trait_prior ## only launching simulations with parameters drawn in the prior distributions ABC_emul = ABC_emulation(model=trait_model, prior=trait_prior, nb_design_pts=10, nb_simul=300, use_seed=TRUE, progress=TRUE) ABC_emul ## launching simulations with parameters drawn in the prior distributions and performing # the rejection step sum_stat_obs=c(100,2.5,20,30000) ABC_emul = ABC_emulation(model=trait_model, prior=trait_prior, tol=0.2, nb_design_pts=10, nb_simul=100, summary_stat_target=sum_stat_obs, use_seed=TRUE, progress=TRUE) ABC_emul ## End(Not run)
## Not run: ##### EXAMPLE 1 ##### ##################### ## the model is a C++ function packed into a R function -- the option 'use_seed' ## must be turned to TRUE. trait_prior=list(c("unif",3,5),c("unif",-2.3,1.6),c("unif",-25,125),c("unif",-0.7,3.2)) trait_prior ## only launching simulations with parameters drawn in the prior distributions ABC_emul = ABC_emulation(model=trait_model, prior=trait_prior, nb_design_pts=10, nb_simul=300, use_seed=TRUE, progress=TRUE) ABC_emul ## launching simulations with parameters drawn in the prior distributions and performing # the rejection step sum_stat_obs=c(100,2.5,20,30000) ABC_emul = ABC_emulation(model=trait_model, prior=trait_prior, tol=0.2, nb_design_pts=10, nb_simul=100, summary_stat_target=sum_stat_obs, use_seed=TRUE, progress=TRUE) ABC_emul ## End(Not run)
This function implements three different algorithms to perform coupled to MCMC ABC.
ABC_mcmc(method, model, prior, summary_stat_target, prior_test=NULL, n_rec=100, n_between_sampling=10, n_cluster = 1, use_seed = FALSE, verbose = FALSE, dist_weights=NULL, ...)
ABC_mcmc(method, model, prior, summary_stat_target, prior_test=NULL, n_rec=100, n_between_sampling=10, n_cluster = 1, use_seed = FALSE, verbose = FALSE, dist_weights=NULL, ...)
method |
a character string indicating the ABC-MCMC algorithm to be used. Possible values are |
model |
a |
prior |
a list of prior information. Each element of the list corresponds to a model parameter. The list element must be a vector whose first argument determines the type of prior distribution: possible values are |
summary_stat_target |
a vector containing the targeted (observed) summary statistics. |
prior_test |
a string expressing the constraints between model parameters.
This expression will be evaluated as a logical expression, you can use all the logical operators including |
n_rec |
a positive integer equal to the desired number of sampled points along the MCMC. |
n_between_sampling |
a positive integer equal to the desired spacing between sampled points along the MCMC. |
n_cluster |
a positive integer. If larger than 1 (the default value), |
use_seed |
logical. If |
verbose |
logical. |
dist_weights |
a vector containing the weights to apply to the distance between the computed and the targeted statistics. These weights can be used to give more importance to a summary statistisc for example. The weights will be normalized before applying them. If not provided, no weights will be applied. |
... |
Additional arguments can be passed depending on the choosen method (see below) |
See the package's vignette for details on ABC-MCMC.
The returned value is a list containing the following components:
param |
The model parameters used in the |
stats |
The summary statistics obtained at the end of the |
dist |
The distance of the simulations to the data. |
stats_normalization |
The standard deviation of the summary statistics across the |
epsilon |
The final maximal distance between simulations and data in the retained sample of particles. |
nsim |
The number of |
n_between_sampling |
The spacing between two sampled points in the MCMC. |
computime |
The computing time to perform the simulations. |
min_stats |
The minimal values of each summary statistics during the calibration step, given when |
max_stats |
The maximal values of each summary statistics during the calibration step, given when |
lambda |
The lambda values of the Box-Cox transformation, given when |
geometric_mean |
The geometric means, given when |
boxcox_mean |
The means of Box-Cox transforms, given when |
boxcox_sd |
The standard deviations of Box-Cox transforms, given when |
pls_transform |
The matrix of PLS transformation, given when |
numcomp |
The number of used components for the PLS transformation, given when |
Depending on the choosen method, you can specify the following arguments:
a positive number, used when method
is "Marjoram_original"
.
This is the tolerance threshold used during the MCMC.
If not provided by the user, it is automatically computed as half the distance between the first simulation and the target summary statistics and a warning is printed.
a vector of the same length as summary_stat_target
, used when method
is "Marjoram_original"
.
Each element contains a positive number by which each summary statistics must be divided before the computation of the Euclidean distance between simulations and data.
If not provided by the user, the simulated summary statistics are divided by the target summary statistics and a warning is printed.
a vector of the same length as the number of model parameters, used when method
is "Marjoram_original"
.
Each element contains a positive number defining the range of MCMC jumps for each model parameter.
If not provided by the user, a default value is used for each parameter and a warning is printed. The default value is 1/50 of the prior range for uniform distributions, 1/20 of the standard deviation of the prior distribution for normal distributions, 1/20 * exp ( sigma * sigma
for lognormal distributions where sigma is the standard deviation of the prior distribution in the log scale, and 1/20 of the inverse of the rate for exponential distributions.
a positive integer, used when method
is "Marjoram"
or "Wegmann"
.
This is the number of simulations performed during the calibration step.
Default value is 10000.
a positive number between 0 and 1 (strictly), used when method
is "Marjoram"
or "Wegmann"
.
This is the percentage of simulations retained during the calibration step to determine the tolerance threshold to be used during the MCMC.
Default value is 0.01.
a positive number, used when method
is "Marjoram"
or "Wegmann"
.
This is a scaling factor defining the range of MCMC jumps.
Default value is 1.
a positive integer, used when method
is "Wegmann"
.
This is the number of components to be used for PLS transformations.
Default value is 0 which encodes that this number is equal to the number of summary statistics.
a positive integer, the initial seed value provided to the function model
(if use_seed=TRUE
). This value is incremented by 1 at each call of the function model
.
logical, FALSE
by default. If TRUE
, ABC_mcmc
will output a bar of progression with the estimated remaining computing time. Option not available with multiple cores.
a positive number, the max number of fails when moving particle inside the prior. Enabled only if inside_prior is to TRUE
.
10000
by default.
Franck Jabot, Thierry Faure and Nicolas Dumoulin
Marjoram, P., Molitor, J., Plagnol, V. and Tavar\'e, S. (2003) Markov chain Monte Carlo without likelihoods. PNAS, 100, 15324–15328.
Wegmann, D., Leuenberger, C. and Excoffier, L. (2009) Efficient approximate Bayesian computation coupled with Markov chain Monte Carlo without likelihood. Genetics, 182, 1207-1218.
binary_model
, binary_model_cluster
, ABC_rejection
, ABC_emulation
, ABC_sequential
## Not run: ##### EXAMPLE 1 ##### ##################### ## the model has two parameters and outputs two summary statistics. ## defining a simple toy model: toy_model<-function(x){ c( x[1] + x[2] + rnorm(1,0,0.1) , x[1] * x[2] + rnorm(1,0,0.1) ) } ## define prior information toy_prior=list(c("unif",0,1),c("normal",1,2)) # a uniform prior distribution between 0 and 1 for parameter 1, and a normal distribution # of mean 1 and standard deviation of 2 for parameter 2. ## define the targeted summary statistics sum_stat_obs=c(1.5,0.5) ## to perform the Marjoram et al. (2003)'s method: ## ABC_Marjoram_original<-ABC_mcmc(method="Marjoram_original", model=toy_model, prior=toy_prior, summary_stat_target=sum_stat_obs) ABC_Marjoram_original ## artificial example to perform the Marjoram et al. (2003)'s method, with modifications # drawn from Wegmann et al. (2009) without Box-Cox and PLS transformations. ## ABC_Marjoram<-ABC_mcmc(method="Marjoram", model=toy_model, prior=toy_prior, summary_stat_target=sum_stat_obs) ABC_Marjoram ## artificial example to perform the Wegmann et al. (2009)'s method. ## ABC_Wegmann<-ABC_mcmc(method="Wegmann", model=toy_model, prior=toy_prior, summary_stat_target=sum_stat_obs) ABC_Wegmann ##### EXAMPLE 2 ##### ##################### ## this time, the model is a C++ function packed into a R function -- this time, # the option 'use_seed' must be turned to TRUE. ## define prior information trait_prior=list(c("unif",3,5),c("unif",-2.3,1.6),c("unif",-25,125),c("unif",-0.7,3.2)) trait_prior ## define the targeted summary statistics sum_stat_obs=c(100,2.5,20,30000) ## artificial example to perform the Marjoram et al. (2003)'s method. ## n=10 ABC_Marjoram_original<-ABC_mcmc(method="Marjoram_original", model=trait_model, prior=trait_prior, summary_stat_target=sum_stat_obs, n_rec=n, use_seed=TRUE) ABC_Marjoram_original ## artificial example to perform the Marjoram et al. (2003)'s method, with modifications # drawn from Wegmann et al. (2009) without Box-Cox and PLS transformations. ## n=10 n_calib=10 tol_quant=0.2 ABC_Marjoram<-ABC_mcmc(method="Marjoram", model=trait_model, prior=trait_prior, summary_stat_target=sum_stat_obs, n_rec=n, n_calibration=n_calib, tolerance_quantile=tol_quant, use_seed=TRUE) ABC_Marjoram ## artificial example to perform the Wegmann et al. (2009)'s method. ## n=10 n_calib=10 tol_quant=0.2 ABC_Wegmann<-ABC_mcmc(method="Wegmann", model=trait_model, prior=trait_prior, summary_stat_target=sum_stat_obs, n_rec=n, n_calibration=n_calib, tolerance_quantile=tol_quant, use_seed=TRUE) ABC_Wegmann ## End(Not run)
## Not run: ##### EXAMPLE 1 ##### ##################### ## the model has two parameters and outputs two summary statistics. ## defining a simple toy model: toy_model<-function(x){ c( x[1] + x[2] + rnorm(1,0,0.1) , x[1] * x[2] + rnorm(1,0,0.1) ) } ## define prior information toy_prior=list(c("unif",0,1),c("normal",1,2)) # a uniform prior distribution between 0 and 1 for parameter 1, and a normal distribution # of mean 1 and standard deviation of 2 for parameter 2. ## define the targeted summary statistics sum_stat_obs=c(1.5,0.5) ## to perform the Marjoram et al. (2003)'s method: ## ABC_Marjoram_original<-ABC_mcmc(method="Marjoram_original", model=toy_model, prior=toy_prior, summary_stat_target=sum_stat_obs) ABC_Marjoram_original ## artificial example to perform the Marjoram et al. (2003)'s method, with modifications # drawn from Wegmann et al. (2009) without Box-Cox and PLS transformations. ## ABC_Marjoram<-ABC_mcmc(method="Marjoram", model=toy_model, prior=toy_prior, summary_stat_target=sum_stat_obs) ABC_Marjoram ## artificial example to perform the Wegmann et al. (2009)'s method. ## ABC_Wegmann<-ABC_mcmc(method="Wegmann", model=toy_model, prior=toy_prior, summary_stat_target=sum_stat_obs) ABC_Wegmann ##### EXAMPLE 2 ##### ##################### ## this time, the model is a C++ function packed into a R function -- this time, # the option 'use_seed' must be turned to TRUE. ## define prior information trait_prior=list(c("unif",3,5),c("unif",-2.3,1.6),c("unif",-25,125),c("unif",-0.7,3.2)) trait_prior ## define the targeted summary statistics sum_stat_obs=c(100,2.5,20,30000) ## artificial example to perform the Marjoram et al. (2003)'s method. ## n=10 ABC_Marjoram_original<-ABC_mcmc(method="Marjoram_original", model=trait_model, prior=trait_prior, summary_stat_target=sum_stat_obs, n_rec=n, use_seed=TRUE) ABC_Marjoram_original ## artificial example to perform the Marjoram et al. (2003)'s method, with modifications # drawn from Wegmann et al. (2009) without Box-Cox and PLS transformations. ## n=10 n_calib=10 tol_quant=0.2 ABC_Marjoram<-ABC_mcmc(method="Marjoram", model=trait_model, prior=trait_prior, summary_stat_target=sum_stat_obs, n_rec=n, n_calibration=n_calib, tolerance_quantile=tol_quant, use_seed=TRUE) ABC_Marjoram ## artificial example to perform the Wegmann et al. (2009)'s method. ## n=10 n_calib=10 tol_quant=0.2 ABC_Wegmann<-ABC_mcmc(method="Wegmann", model=trait_model, prior=trait_prior, summary_stat_target=sum_stat_obs, n_rec=n, n_calibration=n_calib, tolerance_quantile=tol_quant, use_seed=TRUE) ABC_Wegmann ## End(Not run)
This function launches a series of nb_simul
model simulations with model parameters drawn in the prior distribution specified in prior_matrix
.
ABC_rejection(model, prior, nb_simul, prior_test=NULL, summary_stat_target=NULL, tol=NULL, use_seed=FALSE, seed_count=0, n_cluster=1, verbose=FALSE, progress_bar=FALSE)
ABC_rejection(model, prior, nb_simul, prior_test=NULL, summary_stat_target=NULL, tol=NULL, use_seed=FALSE, seed_count=0, n_cluster=1, verbose=FALSE, progress_bar=FALSE)
model |
a |
prior |
a list of prior information. Each element of the list corresponds to a model parameter. The list element must be a vector whose first argument determines the type of prior distribution: possible values are |
nb_simul |
a positive integer equal to the desired number of simulations of the model. |
prior_test |
a string expressing the constraints between model parameters.
This expression will be evaluated as a logical expression, you can use all the logical operators including |
summary_stat_target |
a vector containing the targeted (observed) summary statistics.
If not provided, |
tol |
tolerance, a strictly positive number (between 0 and 1) indicating the proportion of simulations retained nearest the targeted summary statistics. |
use_seed |
logical. If |
seed_count |
a positive integer, the initial seed value provided to the function |
n_cluster |
a positive integer. If larger than 1 (the default value), |
verbose |
logical. |
progress_bar |
logical, |
The returned value is a list containing the following components:
param |
The model parameters used in the |
stats |
The summary statistics obtained at the end of the |
weights |
The weights of the different |
stats_normalization |
The standard deviation of the summary statistics across the |
nsim |
The number of |
nrec |
The number of retained simulations (if targeted summary statistics are provided). |
computime |
The computing time to perform the simulations. |
Franck Jabot, Thierry Faure and Nicolas Dumoulin
Pritchard, J.K., and M.T. Seielstad and A. Perez-Lezaun and M.W. Feldman (1999) Population growth of human Y chromosomes: a study of Y chromosome microsatellites. Molecular Biology and Evolution, 16, 1791–1798.
binary_model
, binary_model_cluster
, ABC_sequential
, ABC_mcmc
##### EXAMPLE 1 ##### ##################### set.seed(1) ## artificial example to show how to use the 'ABC_rejection' function. ## defining a simple toy model: toy_model<-function(x){ 2 * x + 5 + rnorm(1,0,0.1) } ## define prior information toy_prior=list(c("unif",0,1)) # a uniform prior distribution between 0 and 1 ## only launching simulations with parameters drawn in the prior distributions set.seed(1) n=10 ABC_sim<-ABC_rejection(model=toy_model, prior=toy_prior, nb_simul=n) ABC_sim ## launching simulations with parameters drawn in the prior distributions # and performing the rejection step sum_stat_obs=6.5 tolerance=0.2 ABC_rej<-ABC_rejection(model=toy_model, prior=toy_prior, nb_simul=n, summary_stat_target=sum_stat_obs, tol=tolerance) ## NB: see the package's vignette to see how to pipeline 'ABC_rejection' with the function # 'abc' of the package 'abc' to perform other rejection schemes. ## Not run: ##### EXAMPLE 2 ##### ##################### ## this time, the model has two parameters and outputs two summary statistics. ## defining a simple toy model: toy_model2<-function(x){ c( x[1] + x[2] + rnorm(1,0,0.1) , x[1] * x[2] + rnorm(1,0,0.1) ) } ## define prior information toy_prior2=list(c("unif",0,1),c("normal",1,2)) # a uniform prior distribution between 0 and 1 for parameter 1, and a normal distribution # of mean 1 and standard deviation of 2 for parameter 2. ## only launching simulations with parameters drawn in the prior distributions set.seed(1) n=10 ABC_sim<-ABC_rejection(model=toy_model2, prior=toy_prior2, nb_simul=n) ABC_sim ## launching simulations with parameters drawn in the prior distributions # and performing the rejection step sum_stat_obs2=c(1.5,0.5) tolerance=0.2 ABC_rej<-ABC_rejection(model=toy_model2, prior=toy_prior2, nb_simul=n, summary_stat_target=sum_stat_obs2, tol=tolerance) ## NB: see the package's vignette to see how to pipeline 'ABC_rejection' with the function # 'abc' of the package 'abc' to perform other rejection schemes. ##### EXAMPLE 3 ##### ##################### ## this time, the model is a C++ function packed into a R function -- this time, the option # 'use_seed' must be turned to TRUE. n=10 trait_prior=list(c("unif",3,5),c("unif",-2.3,1.6),c("unif",-25,125),c("unif",-0.7,3.2)) trait_prior ## only launching simulations with parameters drawn in the prior distributions ABC_sim<-ABC_rejection(model=trait_model, prior=trait_prior, nb_simul=n, use_seed=TRUE) ABC_sim ## launching simulations with parameters drawn in the prior distributions and performing # the rejection step sum_stat_obs=c(100,2.5,20,30000) tolerance=0.2 ABC_rej<-ABC_rejection(model=trait_model, prior=trait_prior, nb_simul=n, summary_stat_target=sum_stat_obs, tol=tolerance, use_seed=TRUE) ## NB: see the package's vignette to see how to pipeline 'ABC_rejection' with the function # 'abc' of the package 'abc' to perform other rejection schemes. ##### EXAMPLE 4 - Parallel implementations ##### ################################################ ## NB: the option use_seed must be turned to TRUE. ## For models already running with the option use_seed=TRUE, simply change # the value of n_cluster: sum_stat_obs=c(100,2.5,20,30000) ABC_simb<-ABC_rejection(model=trait_model, prior=trait_prior, nb_simul=n, use_seed=TRUE, n_cluster=2) ## For other models, change the value of n_cluster and modify the model so that the first # parameter becomes a seed information value: toy_model_parallel<-function(x){ set.seed(x[1]) 2 * x[2] + 5 + rnorm(1,0,0.1) } sum_stat_obs=6.5 ABC_simb<-ABC_rejection(model=toy_model_parallel, prior=toy_prior, nb_simul=n, use_seed=TRUE, n_cluster=2) ## End(Not run)
##### EXAMPLE 1 ##### ##################### set.seed(1) ## artificial example to show how to use the 'ABC_rejection' function. ## defining a simple toy model: toy_model<-function(x){ 2 * x + 5 + rnorm(1,0,0.1) } ## define prior information toy_prior=list(c("unif",0,1)) # a uniform prior distribution between 0 and 1 ## only launching simulations with parameters drawn in the prior distributions set.seed(1) n=10 ABC_sim<-ABC_rejection(model=toy_model, prior=toy_prior, nb_simul=n) ABC_sim ## launching simulations with parameters drawn in the prior distributions # and performing the rejection step sum_stat_obs=6.5 tolerance=0.2 ABC_rej<-ABC_rejection(model=toy_model, prior=toy_prior, nb_simul=n, summary_stat_target=sum_stat_obs, tol=tolerance) ## NB: see the package's vignette to see how to pipeline 'ABC_rejection' with the function # 'abc' of the package 'abc' to perform other rejection schemes. ## Not run: ##### EXAMPLE 2 ##### ##################### ## this time, the model has two parameters and outputs two summary statistics. ## defining a simple toy model: toy_model2<-function(x){ c( x[1] + x[2] + rnorm(1,0,0.1) , x[1] * x[2] + rnorm(1,0,0.1) ) } ## define prior information toy_prior2=list(c("unif",0,1),c("normal",1,2)) # a uniform prior distribution between 0 and 1 for parameter 1, and a normal distribution # of mean 1 and standard deviation of 2 for parameter 2. ## only launching simulations with parameters drawn in the prior distributions set.seed(1) n=10 ABC_sim<-ABC_rejection(model=toy_model2, prior=toy_prior2, nb_simul=n) ABC_sim ## launching simulations with parameters drawn in the prior distributions # and performing the rejection step sum_stat_obs2=c(1.5,0.5) tolerance=0.2 ABC_rej<-ABC_rejection(model=toy_model2, prior=toy_prior2, nb_simul=n, summary_stat_target=sum_stat_obs2, tol=tolerance) ## NB: see the package's vignette to see how to pipeline 'ABC_rejection' with the function # 'abc' of the package 'abc' to perform other rejection schemes. ##### EXAMPLE 3 ##### ##################### ## this time, the model is a C++ function packed into a R function -- this time, the option # 'use_seed' must be turned to TRUE. n=10 trait_prior=list(c("unif",3,5),c("unif",-2.3,1.6),c("unif",-25,125),c("unif",-0.7,3.2)) trait_prior ## only launching simulations with parameters drawn in the prior distributions ABC_sim<-ABC_rejection(model=trait_model, prior=trait_prior, nb_simul=n, use_seed=TRUE) ABC_sim ## launching simulations with parameters drawn in the prior distributions and performing # the rejection step sum_stat_obs=c(100,2.5,20,30000) tolerance=0.2 ABC_rej<-ABC_rejection(model=trait_model, prior=trait_prior, nb_simul=n, summary_stat_target=sum_stat_obs, tol=tolerance, use_seed=TRUE) ## NB: see the package's vignette to see how to pipeline 'ABC_rejection' with the function # 'abc' of the package 'abc' to perform other rejection schemes. ##### EXAMPLE 4 - Parallel implementations ##### ################################################ ## NB: the option use_seed must be turned to TRUE. ## For models already running with the option use_seed=TRUE, simply change # the value of n_cluster: sum_stat_obs=c(100,2.5,20,30000) ABC_simb<-ABC_rejection(model=trait_model, prior=trait_prior, nb_simul=n, use_seed=TRUE, n_cluster=2) ## For other models, change the value of n_cluster and modify the model so that the first # parameter becomes a seed information value: toy_model_parallel<-function(x){ set.seed(x[1]) 2 * x[2] + 5 + rnorm(1,0,0.1) } sum_stat_obs=6.5 ABC_simb<-ABC_rejection(model=toy_model_parallel, prior=toy_prior, nb_simul=n, use_seed=TRUE, n_cluster=2) ## End(Not run)
This function implements four different algorithms to perform sequential sampling schemes for ABC. Sequential sampling schemes consist in sampling initially model parameters in the prior distribution, just like in a standard rejection-based ABC, in order to obtain a rough posterior distribution of parameter values, and in subsequently sampling close to this rough posterior distribution to refine it. Sequential sampling schemes have been shown to be more efficient than standard rejection-based procedures.
ABC_sequential(method, model, prior, nb_simul, summary_stat_target, prior_test=NULL, n_cluster = 1, use_seed = FALSE, verbose = FALSE, dist_weights=NULL, ...)
ABC_sequential(method, model, prior, nb_simul, summary_stat_target, prior_test=NULL, n_cluster = 1, use_seed = FALSE, verbose = FALSE, dist_weights=NULL, ...)
method |
a character string indicating the sequential algorithm to be used. Possible values are |
model |
a |
prior |
a list of prior information. Each element of the list corresponds to a model parameter. The list element must be a vector whose first argument determines the type of prior distribution: possible values are |
nb_simul |
a positive integer equal to the desired number of simulations of the model below the tolerance threshold when |
summary_stat_target |
a vector containing the targeted (observed) summary statistics. |
prior_test |
a string expressing the constraints between model parameters.
This expression will be evaluated as a logical expression, you can use all the logical operators including |
n_cluster |
a positive integer. If larger than 1 (the default value), |
use_seed |
logical. If |
verbose |
logical. |
dist_weights |
a vector containing the weights to apply to the distance between the computed and the targeted statistics. These weights can be used to give more importance to a summary statistisc for example. The weights will be normalized before applying them. If not provided, no weights will be applied. |
... |
Additional arguments can be passed depending on the choosen method (see below) |
See the package's vignette for details on the four algorithms.
The returned value is a list containing the following components:
param |
The model parameters used in the |
stats |
The summary statistics obtained at the end of the |
weights |
The weights of the different |
stats_normalization |
The standard deviation of the summary statistics across the |
epsilon |
The final maximal distance between simulations and data in the retained sample of particles. |
nsim |
The number of |
computime |
The computing time to perform the simulations. |
intermediary |
Intermediary results stored when the option |
Depending on the choosen method, you can specify the following arguments:
a positive integer, the initial seed value provided to the function model
(if use_seed=TRUE
). This value is incremented by 1 at each call of the function model
.
logical used when method
is "Beaumont"
, "Lenormand"
or "Emulation"
. TRUE
by default.
If FALSE
, parameter sampling is not restricted to the initial ranges of the prior distribution during the subsequent algorithm steps.
a vector containing the sequence of tolerance thresholds when method
is "Beaumont"
, or the targeted final tolerance threshold when method
is "Drovandi"
.
a positive number between 0 and 1 (strictly) used when method
is "Drovandi"
, "Delmoral"
, "Lenormand"
or "Emulation"
.
alpha
is the proportion of particles rejected at each step in the algorithm "Drovandi"
.
This is the proportion of particles kept at each step in the algorithms "Delmoral"
, "Lenormand"
and "Emulation"
.
Default values are 0.5 when method
is "Drovandi"
, "Lenormand"
or "Emulation"
and 0.9 for "Delmoral"
.
See the package's vignette for details.
a positive number between 0 and 1 (strictly) used when method
is "Drovandi"
.
This is the expected proportion of particles which are going to be duplicated at each step.
Default value is 0.01.
See the package's vignette and Drovandi and Pettitt (2011) for details.
logical used when method
is "Drovandi"
.
Default value is TRUE
. In this case, the first tolerance threshold is determined by the algorithm, by taking the 1-alpha
quantile of the distances between the simulated and targeted summary statistics.
If FALSE
, the initial tolerance threshold for the first step has to be provided as the first element of the vector tolerance_tab
. In this case, the targeted final tolerance threshold is the second element of tolerance_tab
.
a positive integer used when method
is "Delmoral"
.
This is the number of model
simulations performed for each parameter set.
Default value is 1. See the package's vignette and Del Moral et al. (2012) for details.
a positive integer used when method
is "Delmoral"
. Default value is 0.5*nb_simul
.
This is the minimal effective sample size below which a resampling step is launched. See the package's vignette and Del Moral et al. (2012) for details.
a positive number used when method
is "Delmoral"
.
This is the targeted final tolerance threshold.
a positive number between 0 and 1 (strictly) used when method
is "Lenormand"
or "Emulation"
.
This is the stopping criterion of the algorithm: a small number ensures a better convergence of the algorithm, but at a cost in computing time.
Default value is 0.05. See the package's vignette and Lenormand et al. (2012) for details.
a positive integer, the number of times the emulation is repeated. 9
by default.
a positive number, the number of design points selected for the local regression.
50
by default.
logical, FALSE
by default. If TRUE
, ABC_sequential
will output a bar of progression with the estimated remaining computing time. Option not available with multiple cores.
a positive number, the max number of fails when moving particle inside the prior. Enabled only if inside_prior is to TRUE
.
10000
by default.
Franck Jabot, Thierry Faure and Nicolas Dumoulin
Beaumont, M. A., Cornuet, J., Marin, J., and Robert, C. P. (2009) Adaptive approximate Bayesian computation. Biometrika,96, 983–990.
Del Moral, P., Doucet, A., and Jasra, A. (2012) An adaptive sequential Monte Carlo method for approximate Bayesian computation. Statistics and Computing, 22, 1009–1020.
Drovandi, C. C. and Pettitt, A. N. (2011) Estimation of parameters for macroparasite population evolution using approximate Bayesian computation. Biometrics, 67, 225–233.
Lenormand, M., Jabot, F., Deffuant G. (2012) Adaptive approximate Bayesian computation for complex models. http://arxiv.org/pdf/1111.1308.pdf
Jabot, F., Lagarrigues G., Courbaud B., Dumoulin N. (2015). A comparison of emulation methods for Approximate Bayesian Computation. To be published.
binary_model
, binary_model_cluster
, ABC_rejection
, ABC_emulation
, ABC_mcmc
## Not run: ##### EXAMPLE 1 ##### ##################### set.seed(1) ## artificial example to show how to use the 'ABC_sequential' function. ## defining a simple toy model: toy_model<-function(x){ 2 * x + 5 + rnorm(1,0,0.1) } ## define prior information toy_prior=list(c("unif",0,1)) # a uniform prior distribution between 0 and 1 ## define the targeted summary statistics sum_stat_obs=6.5 ## to perform the Beaumont et al. (2009)'s method: ## tolerance=c(1.5,0.5) ABC_Beaumont<-ABC_sequential(method="Beaumont", model=toy_model, prior=toy_prior, nb_simul=20, summary_stat_target=sum_stat_obs, tolerance_tab=tolerance) ABC_Beaumont ## to perform the Drovandi and Pettitt (2011)'s method: ## tolerance=0.5 c_drov=0.7 ABC_Drovandi<-ABC_sequential(method="Drovandi", model=toy_model, prior=toy_prior, nb_simul=20, summary_stat_target=sum_stat_obs, tolerance_tab=tolerance, c=c_drov) ABC_Drovandi ## to perform the Del Moral et al. (2012)'s method: ## alpha_delmo=0.5 tolerance=0.5 ABC_Delmoral<-ABC_sequential(method="Delmoral", model=toy_model, prior=toy_prior, nb_simul=20, summary_stat_target=sum_stat_obs, alpha=alpha_delmo, tolerance_target=tolerance) ABC_Delmoral ## to perform the Lenormand et al. (2012)'s method: ## pacc=0.4 ABC_Lenormand<-ABC_sequential(method="Lenormand", model=toy_model, prior=toy_prior, nb_simul=20, summary_stat_target=sum_stat_obs, p_acc_min=pacc) ABC_Lenormand ##### EXAMPLE 2 ##### ##################### ## this time, the model has two parameters and outputs two summary statistics. ## defining a simple toy model: toy_model2<-function(x){ c( x[1] + x[2] + rnorm(1,0,0.1) , x[1] * x[2] + rnorm(1,0,0.1) ) } ## define prior information toy_prior2=list(c("unif",0,1),c("normal",1,2)) # a uniform prior distribution between 0 and 1 for parameter 1, and a normal distribution # of mean 1 and standard deviation of 2 for parameter 2. ## define the targeted summary statistics sum_stat_obs2=c(1.5,0.5) ## to perform the Beaumont et al. (2009)'s method: ## tolerance=c(1.5,0.5) ABC_Beaumont<-ABC_sequential(method="Beaumont", model=toy_model2, prior=toy_prior2, nb_simul=20, summary_stat_target=sum_stat_obs2, tolerance_tab=tolerance) ABC_Beaumont ## to perform the Drovandi and Pettitt (2011)'s method: ## tolerance=0.5 c_drov=0.7 ABC_Drovandi<-ABC_sequential(method="Drovandi", model=toy_model2, prior=toy_prior2, nb_simul=20, summary_stat_target=sum_stat_obs2, tolerance_tab=tolerance, c=c_drov) ABC_Drovandi ## to perform the Del Moral et al. (2012)'s method: ## alpha_delmo=0.5 tolerance=0.5 ABC_Delmoral<-ABC_sequential(method="Delmoral", model=toy_model2, prior=toy_prior2, nb_simul=20, summary_stat_target=sum_stat_obs2, alpha=alpha_delmo, tolerance_target=tolerance) ABC_Delmoral ## to perform the Lenormand et al. (2012)'s method: ## pacc=0.4 # Only uniform priors are supported for the method "Lenormand" (since it performs a Latin # Hypercube sampling at the beginning): toy_prior2=list(c("unif",0,1),c("unif",0.5,1.5)) # a uniform prior distribution between 0 and 1 for parameter 1, and a normal distribution of # mean 1 and standard deviation of 1 for parameter 2. ABC_Lenormand<-ABC_sequential(method="Lenormand", model=toy_model2, prior=toy_prior2, nb_simul=20, summary_stat_target=sum_stat_obs2, p_acc_min=pacc) ABC_Lenormand ##### EXAMPLE 3 ##### ##################### ## this time, the model is a C++ function packed into a R function -- this time, the option # 'use_seed' must be turned to TRUE. n=10 ## define prior information trait_prior=list(c("unif",3,5),c("unif",-2.3,1.6),c("unif",-25,125),c("unif",-0.7,3.2)) trait_prior ## define the targeted summary statistics sum_stat_obs=c(100,2.5,20,30000) ## to perform the Beaumont et al. (2009)'s method: ## tolerance=c(8,5) ABC_Beaumont<-ABC_sequential(method="Beaumont", model=trait_model, prior=trait_prior, nb_simul=20, summary_stat_target=sum_stat_obs, tolerance_tab=tolerance, use_seed=TRUE) ABC_Beaumont ## to perform the Drovandi and Pettitt (2011)'s method: ## tolerance=3 c_drov=0.7 ABC_Drovandi<-ABC_sequential(method="Drovandi", model=trait_model, prior=trait_prior, nb_simul=20, summary_stat_target=sum_stat_obs, tolerance_tab=tolerance, c=c_drov, use_seed=TRUE) ABC_Drovandi ## to perform the Del Moral et al. (2012)'s method: ## alpha_delmo=0.5 tolerance=3 ABC_Delmoral<-ABC_sequential(method="Delmoral", model=trait_model, prior=trait_prior, nb_simul=20, summary_stat_target=sum_stat_obs, alpha=alpha_delmo, tolerance_target=tolerance, use_seed=TRUE) ABC_Delmoral ## to perform the Lenormand et al. (2012)'s method: ## pacc=0.4 ABC_Lenormand<-ABC_sequential(method="Lenormand", model=trait_model, prior=trait_prior, nb_simul=20, summary_stat_target=sum_stat_obs, p_acc_min=pacc, use_seed=TRUE) ABC_Lenormand ##### EXAMPLE 4 - Parallel implementations ##### ################################################ ## NB: the option use_seed must be turned to TRUE. ## For models already running with the option use_seed=TRUE, simply change # the value of n_cluster: sum_stat_obs=c(100,2.5,20,30000) ABC_Lenormand<-ABC_sequential(method="Lenormand", model=trait_model, prior=trait_prior, nb_simul=20, summary_stat_target=sum_stat_obs, p_acc_min=pacc, use_seed=TRUE, n_cluster=2) ABC_Lenormand ## For other models, change the value of n_cluster and modify the model so that the # first parameter becomes a seed information value: toy_model_parallel<-function(x){ set.seed(x[1]) 2 * x[2] + 5 + rnorm(1,0,0.1) } sum_stat_obs=6.5 ABC_Lenormand<-ABC_sequential(method="Lenormand", model=toy_model_parallel, prior=toy_prior, nb_simul=20, summary_stat_target=sum_stat_obs, p_acc_min=pacc, use_seed=TRUE, n_cluster=2) ABC_Lenormand ## End(Not run)
## Not run: ##### EXAMPLE 1 ##### ##################### set.seed(1) ## artificial example to show how to use the 'ABC_sequential' function. ## defining a simple toy model: toy_model<-function(x){ 2 * x + 5 + rnorm(1,0,0.1) } ## define prior information toy_prior=list(c("unif",0,1)) # a uniform prior distribution between 0 and 1 ## define the targeted summary statistics sum_stat_obs=6.5 ## to perform the Beaumont et al. (2009)'s method: ## tolerance=c(1.5,0.5) ABC_Beaumont<-ABC_sequential(method="Beaumont", model=toy_model, prior=toy_prior, nb_simul=20, summary_stat_target=sum_stat_obs, tolerance_tab=tolerance) ABC_Beaumont ## to perform the Drovandi and Pettitt (2011)'s method: ## tolerance=0.5 c_drov=0.7 ABC_Drovandi<-ABC_sequential(method="Drovandi", model=toy_model, prior=toy_prior, nb_simul=20, summary_stat_target=sum_stat_obs, tolerance_tab=tolerance, c=c_drov) ABC_Drovandi ## to perform the Del Moral et al. (2012)'s method: ## alpha_delmo=0.5 tolerance=0.5 ABC_Delmoral<-ABC_sequential(method="Delmoral", model=toy_model, prior=toy_prior, nb_simul=20, summary_stat_target=sum_stat_obs, alpha=alpha_delmo, tolerance_target=tolerance) ABC_Delmoral ## to perform the Lenormand et al. (2012)'s method: ## pacc=0.4 ABC_Lenormand<-ABC_sequential(method="Lenormand", model=toy_model, prior=toy_prior, nb_simul=20, summary_stat_target=sum_stat_obs, p_acc_min=pacc) ABC_Lenormand ##### EXAMPLE 2 ##### ##################### ## this time, the model has two parameters and outputs two summary statistics. ## defining a simple toy model: toy_model2<-function(x){ c( x[1] + x[2] + rnorm(1,0,0.1) , x[1] * x[2] + rnorm(1,0,0.1) ) } ## define prior information toy_prior2=list(c("unif",0,1),c("normal",1,2)) # a uniform prior distribution between 0 and 1 for parameter 1, and a normal distribution # of mean 1 and standard deviation of 2 for parameter 2. ## define the targeted summary statistics sum_stat_obs2=c(1.5,0.5) ## to perform the Beaumont et al. (2009)'s method: ## tolerance=c(1.5,0.5) ABC_Beaumont<-ABC_sequential(method="Beaumont", model=toy_model2, prior=toy_prior2, nb_simul=20, summary_stat_target=sum_stat_obs2, tolerance_tab=tolerance) ABC_Beaumont ## to perform the Drovandi and Pettitt (2011)'s method: ## tolerance=0.5 c_drov=0.7 ABC_Drovandi<-ABC_sequential(method="Drovandi", model=toy_model2, prior=toy_prior2, nb_simul=20, summary_stat_target=sum_stat_obs2, tolerance_tab=tolerance, c=c_drov) ABC_Drovandi ## to perform the Del Moral et al. (2012)'s method: ## alpha_delmo=0.5 tolerance=0.5 ABC_Delmoral<-ABC_sequential(method="Delmoral", model=toy_model2, prior=toy_prior2, nb_simul=20, summary_stat_target=sum_stat_obs2, alpha=alpha_delmo, tolerance_target=tolerance) ABC_Delmoral ## to perform the Lenormand et al. (2012)'s method: ## pacc=0.4 # Only uniform priors are supported for the method "Lenormand" (since it performs a Latin # Hypercube sampling at the beginning): toy_prior2=list(c("unif",0,1),c("unif",0.5,1.5)) # a uniform prior distribution between 0 and 1 for parameter 1, and a normal distribution of # mean 1 and standard deviation of 1 for parameter 2. ABC_Lenormand<-ABC_sequential(method="Lenormand", model=toy_model2, prior=toy_prior2, nb_simul=20, summary_stat_target=sum_stat_obs2, p_acc_min=pacc) ABC_Lenormand ##### EXAMPLE 3 ##### ##################### ## this time, the model is a C++ function packed into a R function -- this time, the option # 'use_seed' must be turned to TRUE. n=10 ## define prior information trait_prior=list(c("unif",3,5),c("unif",-2.3,1.6),c("unif",-25,125),c("unif",-0.7,3.2)) trait_prior ## define the targeted summary statistics sum_stat_obs=c(100,2.5,20,30000) ## to perform the Beaumont et al. (2009)'s method: ## tolerance=c(8,5) ABC_Beaumont<-ABC_sequential(method="Beaumont", model=trait_model, prior=trait_prior, nb_simul=20, summary_stat_target=sum_stat_obs, tolerance_tab=tolerance, use_seed=TRUE) ABC_Beaumont ## to perform the Drovandi and Pettitt (2011)'s method: ## tolerance=3 c_drov=0.7 ABC_Drovandi<-ABC_sequential(method="Drovandi", model=trait_model, prior=trait_prior, nb_simul=20, summary_stat_target=sum_stat_obs, tolerance_tab=tolerance, c=c_drov, use_seed=TRUE) ABC_Drovandi ## to perform the Del Moral et al. (2012)'s method: ## alpha_delmo=0.5 tolerance=3 ABC_Delmoral<-ABC_sequential(method="Delmoral", model=trait_model, prior=trait_prior, nb_simul=20, summary_stat_target=sum_stat_obs, alpha=alpha_delmo, tolerance_target=tolerance, use_seed=TRUE) ABC_Delmoral ## to perform the Lenormand et al. (2012)'s method: ## pacc=0.4 ABC_Lenormand<-ABC_sequential(method="Lenormand", model=trait_model, prior=trait_prior, nb_simul=20, summary_stat_target=sum_stat_obs, p_acc_min=pacc, use_seed=TRUE) ABC_Lenormand ##### EXAMPLE 4 - Parallel implementations ##### ################################################ ## NB: the option use_seed must be turned to TRUE. ## For models already running with the option use_seed=TRUE, simply change # the value of n_cluster: sum_stat_obs=c(100,2.5,20,30000) ABC_Lenormand<-ABC_sequential(method="Lenormand", model=trait_model, prior=trait_prior, nb_simul=20, summary_stat_target=sum_stat_obs, p_acc_min=pacc, use_seed=TRUE, n_cluster=2) ABC_Lenormand ## For other models, change the value of n_cluster and modify the model so that the # first parameter becomes a seed information value: toy_model_parallel<-function(x){ set.seed(x[1]) 2 * x[2] + 5 + rnorm(1,0,0.1) } sum_stat_obs=6.5 ABC_Lenormand<-ABC_sequential(method="Lenormand", model=toy_model_parallel, prior=toy_prior, nb_simul=20, summary_stat_target=sum_stat_obs, p_acc_min=pacc, use_seed=TRUE, n_cluster=2) ABC_Lenormand ## End(Not run)
This function enables to link a binary executable to a R
function.
binary_model(command)
binary_model(command)
command |
a character string indicating the command to launch the executable on your system. |
A binary executable used with binary_model
has to be placed in the current directory of the R
session.
It further has to respect several constraints: it has to read the seed for its pseudo-random number generator and the model parameters in a file "input", and it must write the summary statistic in a file "output".
The file "input" will be generated by the wrapper binary_model
in the current directory of the R
session, and the wrapper will read the file "output" generated by the binary executable.
In the file "input", the first line contains the seed, and each subsequent line contains one model parameter value.
In the file "output", each summary statistic should be separated by a space or a tab separation.
This wrapper should be used for use with a single core of the computer. If the user wishes to use several cores of the computer, the wrapper binary_model_cluster
should be used.
Note that the files "input" and "output" are deleted by the wrapper at the end of the function.
A R
function wrapping the binary executable, to be used with the EasyABC
functions.
Franck Jabot, Thierry Faure and Nicolas Dumoulin
binary_model_cluster
, ABC_rejection
, ABC_sequential
, ABC_mcmc
## Not run: ## artificial example to show how to use the binary_model function with # an executable "My_Executable" ABC_rej<-ABC_rejection(model=binary_model("./My_Executable"), prior=..., n_cluster=1,...) # NB: on windows, "My_Executable" should be of the form "My_Executable.exe" : ABC_rej<-ABC_rejection(model=binary_model("./My_Executable.exe"), prior=..., n_cluster=1,...) ## End(Not run)
## Not run: ## artificial example to show how to use the binary_model function with # an executable "My_Executable" ABC_rej<-ABC_rejection(model=binary_model("./My_Executable"), prior=..., n_cluster=1,...) # NB: on windows, "My_Executable" should be of the form "My_Executable.exe" : ABC_rej<-ABC_rejection(model=binary_model("./My_Executable.exe"), prior=..., n_cluster=1,...) ## End(Not run)
This function enables to link a binary executable to a R
function.
binary_model_cluster(command)
binary_model_cluster(command)
command |
a character string indicating the command to launch the executable on your system. |
A binary executable used with binary_model_cluster
has to be placed in the current directory of the R
session.
It further has to respect several constraints:
1- it has to have a single argument: a number k
used by the binary executable to know in which files read and write.
2- it has to read the seed for its pseudo-random number generator and the model parameters in a file "inputk" (where k is the argument passed to the executable: input1, input2,...).
3- it has to write the summary statistic in a file "outputk" (where k is the argument passed to the executable: output1, output2,...).
The file "inputk" will be generated by the wrapper binary_model_cluster
in the current directory of the R
session, and the wrapper will read the file "outputk" generated by the binary executable.
This construction ensures that each core reads and writes in different files.
In the file "inputk", the first line contains the seed, and each subsequent line contains one model parameter value.
In the file "outputk", each summary statistic should be separated by a space or a tab separation.
This wrapper should be used for use with multiple cores of the computer. If the user wishes to use a single core of the computer, the wrapper binary_model
should be used.
Note that the files "inputk" and "outputk" are deleted by the wrapper at the end of the function.
A R
function wrapping the binary executable, to be used with the EasyABC
functions.
Franck Jabot, Thierry Faure and Nicolas Dumoulin
binary_model
, ABC_rejection
, ABC_sequential
, ABC_mcmc
## Not run: ## artificial example to show how to use the binary_model function with # an executable "My_Executable" ABC_rej<-ABC_rejection(model=binary_model_cluster("./My_Executable"), prior=..., n_cluster=2,...) # NB: on windows, "My_Executable" should be of the form "My_Executable.exe" : ABC_rej<-ABC_rejection(model=binary_model_cluster("./My_Executable.exe"), prior=..., n_cluster=2,...) ## End(Not run)
## Not run: ## artificial example to show how to use the binary_model function with # an executable "My_Executable" ABC_rej<-ABC_rejection(model=binary_model_cluster("./My_Executable"), prior=..., n_cluster=2,...) # NB: on windows, "My_Executable" should be of the form "My_Executable.exe" : ABC_rej<-ABC_rejection(model=binary_model_cluster("./My_Executable.exe"), prior=..., n_cluster=2,...) ## End(Not run)
Algorithms for the Simulated Annealing approach to Approximate Bayesian Computation (SABC).
SABC(r.model, r.prior, d.prior, n.sample, eps.init, iter.max, v=ifelse(method=="informative",0.4,1.2), beta=0.8, delta=0.1, resample=5*n.sample, verbose=n.sample, method="noninformative", adaptjump=TRUE, summarystats=FALSE, y=NULL, f.summarystats=NULL, ...)
SABC(r.model, r.prior, d.prior, n.sample, eps.init, iter.max, v=ifelse(method=="informative",0.4,1.2), beta=0.8, delta=0.1, resample=5*n.sample, verbose=n.sample, method="noninformative", adaptjump=TRUE, summarystats=FALSE, y=NULL, f.summarystats=NULL, ...)
r.model |
Function that returns either a random sample from the likelihood or a scalar distance between such a sample and the data. The first argument must be the parameter vector. |
r.prior |
Function that returns a random sample from the prior. |
d.prior |
Function that returns the density of the prior distribution. |
n.sample |
Size of the ensemble. |
eps.init |
Initial tolerance or temperature. |
iter.max |
Total number of simulations from the likelihood. |
v |
Tuning parameter that governs the annealing speed. Defaults to 1.2, for the |
beta |
Tuning parameter that governs the mixing in parameter space. Defaults to 0.8. |
delta |
Tuning parameter for the resampling steps. Defaults to 0.1. |
resample |
Number of accepted particle updates after which a resampling step is performed. Defaults to 5* |
verbose |
Shows the iteration progress each |
adaptjump |
Whether to adapt covariance of jump distribution. Default is TRUE. |
method |
Argument to select algorithm. Accepts |
summarystats |
Whether summary statistics shall be calculated (semi-) automatically. Defaults to FALSE. |
y |
Data vector. Needs to be provided if either |
f.summarystats |
If |
... |
further arguments passed to |
SABC defines a class of algorithms for particle ABC that are inspired by Simulated Annealing. Unlike other algorithms, this class is not based on importance sampling, and hence does not suffer from a loss of effective sample size due to re-sampling. The approach is presented in detail in Albert, Kuensch, and Scheidegger (2014; see references).
This package implements two versions of SABC algorithms, for the cases of a non-informative or an informative prior. These are described in detail in the paper. The algorithms can be selected using the method
argument: method=noninformative
or method=informative
.
In the informative case, the algorithm corrects for the bias caused by an over- or under-representation of the prior.
The argument adaptjump
allows a choice of whether to adapt the covariance of the jump distribution. Default is TRUE.
Furthermore, the package allows for three different ways of using the data.
If y
is not provided, the algorithm expects r.model
to return a scalar measuring the distance between a random sample from the likelihood and the data.
If y
is provided and summarystats = FALSE
, the algorithm expects r.model
to return a random sample from the likelihood and uses the relative sum of squares to measure the distances between y
and random likelihood samples.
If summarystats = TRUE
the algorithm calculates summary statistics semi-automatically, as described in detail in the paper by Fearnhead et al. (2012; see references).
The summary statistics are calculated by means of a linear regression applied to a sample from the prior and the image of f.summarystats
of an associated sample from the likelihood.
Returns a list with the following components:
E |
Matrix with ensemble of samples. |
P |
Matrix with prior ensemble of samples. |
eps |
Value of tolerance (temperature) at final iteration. |
ESS |
Effective sample size, due to final bias correction ( |
Carlo Albert <[email protected]>, Andreas Scheidegger, Tobia Fasciati. Package initially compiled by Lukas M. Weber.
C. Albert, H. R. Kuensch and A. Scheidegger, Statistics and Computing 0960-3174 (2014), arXiv:1208.2157, A Simulated Annealing Approach to Approximate Bayes Computations.
P. Fearnhead and D. Prangle, J. R. Statist. Soc. B 74 (2012), Constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation.
## Not run: ## Example for "noninformative" case # Prior is uniform on [-10,10] d.prior <- function(par) dunif(par,-10,10) r.prior <- function() runif(1,-10,10) # Model is the sum of two normal distributions. Return distance to observation 0: f.dist <- function(par) return( abs(rnorm( 1 , par , ifelse(runif(1)<0.5,1,0.1 ) ))) # Run algorithm ("noninformative" case) res <- SABC(f.dist,r.prior,d.prior,n.sample=500,eps.init=2,iter.max=50000) ## End(Not run) ## Not run: # Histogram of results hist(res$E[,1],breaks=200) ## End(Not run)
## Not run: ## Example for "noninformative" case # Prior is uniform on [-10,10] d.prior <- function(par) dunif(par,-10,10) r.prior <- function() runif(1,-10,10) # Model is the sum of two normal distributions. Return distance to observation 0: f.dist <- function(par) return( abs(rnorm( 1 , par , ifelse(runif(1)<0.5,1,0.1 ) ))) # Run algorithm ("noninformative" case) res <- SABC(f.dist,r.prior,d.prior,n.sample=500,eps.init=2,iter.max=50000) ## End(Not run) ## Not run: # Histogram of results hist(res$E[,1],breaks=200) ## End(Not run)