Help for package CGGP

Type:

Package

Title:

Composite Grid Gaussian Processes

Version:

1.0.4

Description:

Run computer experiments using the adaptive composite grid algorithm with a Gaussian process model. The algorithm works best when running an experiment that can evaluate thousands of points from a deterministic computer simulation. This package is an implementation of a forthcoming paper by Plumlee, Erickson, Ankenman, et al. For a preprint of the paper, contact the maintainer of this package.

License:

GPL-3

Imports:

Rcpp (≥ 0.12.18)

LinkingTo:

Rcpp, RcppArmadillo

Suggests:

testthat, covr, ggplot2, reshape2, plyr, MASS, rmarkdown, knitr

URL:

https://github.com/CollinErickson/CGGP

BugReports:

https://github.com/CollinErickson/CGGP/issues

Encoding:

UTF-8

RoxygenNote:

7.2.3

VignetteBuilder:

knitr

NeedsCompilation:

yes

Packaged:

2024-01-23 02:12:28 UTC; colli

Author:

Collin Erickson [aut, cre], Matthew Plumlee [aut]

Maintainer:

Collin Erickson <collinberickson@gmail.com>

Repository:

CRAN

Date/Publication:

2024-01-23 03:22:57 UTC

CGGP: A package for running sparse grid computer experiments

Description

The CGGP package implements the method presented in Plumlee et al. (2019).

CGGP functions

The CGGP functions: CGGPcreate, CGGPfit, CGGPappend, and CGGPpred

Cauchy correlation function

Description

Calculate correlation matrix for two sets of points in one dimension. Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatCauchy(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

x1

Vector of coordinates from same dimension

x2

Vector of coordinates from same dimension

theta

Correlation parameters:

LS Log of parameter that controls lengthscale
FD Logit of 0.5*parameter that controls the fractal dimension
HE Log of parameter that controls the hurst effect

return_dCdtheta

Should dCdtheta be returned?

return_numpara

Should it just return the number of parameters?

returnlogs

Should log of correlation be returned?

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatCauchy(c(0,.2,.4),c(.1,.3,.5), theta=c(-1,.9,.1))

CauchySQ correlation function

Description

Calculate correlation matrix for two sets of points in one dimension Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatCauchySQ(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

x1

Vector of coordinates from same dimension

x2

Vector of coordinates from same dimension

theta

Correlation parameters:

LS Log of parameter that controls lengthscale
FD Logit of 0.5*parameter that controls the fractal dimension
HE Log of parameter that controls the hurst effect

return_dCdtheta

Should dCdtheta be returned?

return_numpara

Should it just return the number of parameters?

returnlogs

Should log of correlation be returned?

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatCauchySQ(c(0,.2,.4),c(.1,.3,.5), theta=c(-.7,-.5))

CauchySQT correlation function

Description

Calculate correlation matrix for two sets of points in one dimension. Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatCauchySQT(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

x1

Vector of coordinates from same dimension

x2

Vector of coordinates from same dimension

theta

Correlation parameters:

LS Log of parameter that controls lengthscale
FD Logit of 0.5*parameter that controls the fractal dimension
HE Log of parameter that controls the hurst effect

return_dCdtheta

Should dCdtheta be returned?

return_numpara

Should it just return the number of parameters?

returnlogs

Should log of correlation be returned?

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatCauchySQT(c(0,.2,.4),c(.1,.3,.5), theta=c(-.1,.3,-.7))

Gaussian correlation function

Description

Calculate correlation matrix for two sets of points in one dimension Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatGaussian(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

x1

Vector of coordinates from same dimension

x2

Vector of coordinates from same dimension

theta

Correlation parameters:

LS Log of parameter that controls lengthscale
FD Logit of 0.5*parameter that controls the fractal dimension
HE Log of parameter that controls the hurst effect

return_dCdtheta

Should dCdtheta be returned?

return_numpara

Should it just return the number of parameters?

returnlogs

Should log of correlation be returned?

Details

WE HIGHLY ADVISE NOT USING THIS CORRELATION FUNCTION. Try Power Exponential, CauchySQT, Cauchy, or Matern 3/2 instead.

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatGaussian(c(0,.2,.4),c(.1,.3,.5), theta=c(-.7))

Matern 3/2 correlation function

Description

Calculate correlation matrix for two sets of points in one dimension. Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatMatern32(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

x1

Vector of coordinates from same dimension

x2

Vector of coordinates from same dimension

theta

Correlation parameters:

LS Log of parameter that controls lengthscale
FD Logit of 0.5*parameter that controls the fractal dimension
HE Log of parameter that controls the hurst effect

return_dCdtheta

Should dCdtheta be returned?

return_numpara

Should it just return the number of parameters?

returnlogs

Should log of correlation be returned?

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatMatern32(c(0,.2,.4),c(.1,.3,.5), theta=c(-.7))

Matern 5/2 correlation function

Description

Calculate correlation matrix for two sets of points in one dimension. Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatMatern52(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

x1

Vector of coordinates from same dimension

x2

Vector of coordinates from same dimension

theta

Correlation parameters:

LS Log of parameter that controls lengthscale
FD Logit of 0.5*parameter that controls the fractal dimension
HE Log of parameter that controls the hurst effect

return_dCdtheta

Should dCdtheta be returned?

return_numpara

Should it just return the number of parameters?

returnlogs

Should log of correlation be returned?

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatMatern52(c(0,.2,.4),c(.1,.3,.5), theta=c(-.7))

Power exponential correlation function

Description

Calculate correlation matrix for two sets of points in one dimension. Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatPowerExp(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

x1

Vector of coordinates from same dimension

x2

Vector of coordinates from same dimension

theta

Correlation parameters:

LS Log of parameter that controls lengthscale
FD Logit of 0.5*parameter that controls the fractal dimension
HE Log of parameter that controls the hurst effect

return_dCdtheta

Should dCdtheta be returned?

return_numpara

Should it just return the number of parameters?

returnlogs

Should log of correlation be returned?

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatPowerExp(c(0,.2,.4),c(.1,.3,.5), theta=c(-.7,.2))

Wendland0 (Triangle) correlation function

Description

Calculate correlation matrix for two sets of points in one dimension. Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatWendland0(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

x1

Vector of coordinates from same dimension

x2

Vector of coordinates from same dimension

theta

Correlation parameters:

LS Log of parameter that controls lengthscale
FD Logit of 0.5*parameter that controls the fractal dimension
HE Log of parameter that controls the hurst effect

return_dCdtheta

Should dCdtheta be returned?

return_numpara

Should it just return the number of parameters?

returnlogs

Should log of correlation be returned?

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatWendland0(c(0,.2,.4),c(.1,.3,.5), theta=-.7)

Wendland1 1 correlation function

Description

Calculate correlation matrix for two sets of points in one dimension. Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatWendland1(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

x1

Vector of coordinates from same dimension

x2

Vector of coordinates from same dimension

theta

Correlation parameters:

LS Log of parameter that controls lengthscale
FD Logit of 0.5*parameter that controls the fractal dimension
HE Log of parameter that controls the hurst effect

return_dCdtheta

Should dCdtheta be returned?

return_numpara

Should it just return the number of parameters?

returnlogs

Should log of correlation be returned?

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatWendland1(c(0,.2,.4),c(.1,.3,.5), theta=-.7)

Wendland2 2 correlation function

Description

Calculate correlation matrix for two sets of points in one dimension. Note that this is not the correlation between two vectors.

Usage

CGGP_internal_CorrMatWendland2(
  x1,
  x2,
  theta,
  return_dCdtheta = FALSE,
  return_numpara = FALSE,
  returnlogs = FALSE
)

Arguments

x1

Vector of coordinates from same dimension

x2

Vector of coordinates from same dimension

theta

Correlation parameters:

LS Log of parameter that controls lengthscale
FD Logit of 0.5*parameter that controls the fractal dimension
HE Log of parameter that controls the hurst effect

return_dCdtheta

Should dCdtheta be returned?

return_numpara

Should it just return the number of parameters?

returnlogs

Should log of correlation be returned?

Value

Matrix of correlation values between x1 and x2

Examples

CGGP_internal_CorrMatWendland2(c(0,.2,.4),c(.1,.3,.5), theta=-.7)

Calculate MSE prediction along a single dimension

Description

Calculate MSE prediction along a single dimension

Usage

CGGP_internal_MSEpredcalc(xp, xl, theta, CorrMat)

Arguments

xp

Points at which to calculate MSE

xl

Levels along dimension, vector???

theta

Correlation parameters

CorrMat

Function that gives correlation matrix for vectors of 1D points.

Value

MSE predictions

Examples

CGGP_internal_MSEpredcalc(c(.4,.52), c(0,.25,.5,.75,1), theta=c(.1,.2),
             CorrMat=CGGP_internal_CorrMatCauchySQ)

Calculate MSE over single dimension

Description

Calculated using grid of integration points. Can be calculated exactly, but not much reason in 1D.

Usage

CGGP_internal_calcMSE(xl, theta, CorrMat)

Arguments

xl

Vector of points in 1D

theta

Correlation parameters

CorrMat

Function that gives correlation matrix for vectors of 1D points.

Value

MSE value

Examples

CGGP_internal_calcMSE(xl=c(0,.5,.9), theta=c(1,2,3),
         CorrMat=CGGP_internal_CorrMatCauchySQT)

Calculate MSE over blocks

Description

Delta of adding block is product over i=1..d of IMSE(i,j-1) - IMSE(i,j)

Usage

CGGP_internal_calcMSEde(valsinds, MSE_MAP)

Arguments

valsinds

Block levels to calculate MSEs for

MSE_MAP

Matrix of MSE values

Value

All MSE values

Examples

SG <- CGGPcreate(d=3, batchsize=100)
y <- apply(SG$design, 1, function(x){x[1]+x[2]^2})
SG <- CGGPfit(SG, Y=y)
MSE_MAP <- outer(1:SG$d, 1:8, 
     Vectorize(function(dimlcv, lcv1) {
        CGGP_internal_calcMSE(SG$xb[1:SG$sizest[dimlcv]],
        theta=SG$thetaMAP[(dimlcv-1)*SG$numpara+1:SG$numpara],
        CorrMat=SG$CorrMat)
 }))
CGGP_internal_calcMSEde(SG$po[1:SG$poCOUNT, ], MSE_MAP)

Calculate predictive weights for CGGP

Description

Predictive weights are Sigma^(-1)*y in standard GP. This calculation is much faster since we don't need to solve the full system of equations.

Usage

CGGP_internal_calcpw(CGGP, y, theta, return_lS = FALSE)

Arguments

CGGP

CGGP object

y

Measured values for CGGP$design

theta

Correlation parameters

return_lS

Should lS be returned?

Value

Vector with predictive weights

Examples

cggp <- CGGPcreate(d=3, batchsize=100)
y <- apply(cggp$design, 1, function(x){x[1]+x[2]^2+rnorm(1,0,.01)})
CGGP_internal_calcpw(CGGP=cggp, y=y, theta=cggp$thetaMAP)

Calculate derivative of pw

Description

Calculate derivative of pw

Usage

CGGP_internal_calcpwanddpw(CGGP, y, theta, return_lS = FALSE)

Arguments

CGGP

CGGP object

y

Measured values for CGGP$design

theta

Correlation parameters

return_lS

Should lS and dlS be returned?

Value

derivative matrix of pw with respect to logtheta

Examples

cggp <- CGGPcreate(d=3, batchsize=100)
y <- apply(cggp$design, 1, function(x){x[1]+x[2]^2+rnorm(1,0,.01)})
CGGP_internal_calcpwanddpw(CGGP=cggp, y=y, theta=cggp$thetaMAP)

Gradient of negative log likelihood posterior

Description

Gradient of negative log likelihood posterior

Usage

CGGP_internal_gneglogpost(
  theta,
  CGGP,
  y,
  ...,
  return_lik = FALSE,
  ys = NULL,
  Xs = NULL,
  HandlingSuppData = "Correct"
)

Arguments

theta

Log of correlation parameters

CGGP

CGGP object

y

CGGP$design measured values

...

Forces you to name remaining arguments

return_lik

If yes, it returns a list with lik and glik

ys

Supplementary output data

Xs

Supplementary input data

HandlingSuppData

How should supplementary data be handled? * Correct: full likelihood with grid and supplemental data * Only: only use supplemental data * Ignore: ignore supplemental data

Value

Vector for gradient of likelihood w.r.t. x (theta)

Examples

cg <- CGGPcreate(d=3, batchsize=20)
Y <- apply(cg$design, 1, function(x){x[1]+x[2]^2})
cg <- CGGPfit(cg, Y)
CGGP_internal_gneglogpost(cg$thetaMAP, CGGP=cg, y=cg$y)

Calculate negative log posterior

Description

Calculate negative log posterior

Usage

CGGP_internal_neglogpost(
  theta,
  CGGP,
  y,
  ...,
  ys = NULL,
  Xs = NULL,
  HandlingSuppData = "Correct"
)

Arguments

theta

Correlation parameters

CGGP

CGGP object

y

Measured values of CGGP$design

...

Forces you to name remaining arguments

ys

Supplementary output data

Xs

Supplementary input data

HandlingSuppData

How should supplementary data be handled?

Correct: full likelihood with grid and supplemental data
Only: only use supplemental data
Ignore: ignore supplemental data

Value

Likelihood

Examples

cg <- CGGPcreate(d=3, batchsize=20)
Y <- apply(cg$design, 1, function(x){x[1]+x[2]^2})
cg <- CGGPfit(cg, Y)
CGGP_internal_neglogpost(cg$thetaMAP, CGGP=cg, y=cg$y)

Set correlation function of CGGP object

Description

Set correlation function of CGGP object

Usage

CGGP_internal_set_corr(CGGP, corr)

Arguments

CGGP

CGGP object

corr

Correlation function

Value

CGGP object

Examples

obj <- CGGPcreate(3, 20, corr="matern52")
CGGP_internal_set_corr(obj, "gaussian")

Add points to CGGP

Description

Add 'batchsize' points to 'SG' using 'theta'.

Usage

CGGPappend(CGGP, batchsize, selectionmethod = "MAP")

Arguments

CGGP

Sparse grid object

batchsize

Number of points to add

selectionmethod

How points will be selected: one of 'UCB', 'TS', 'MAP', 'Oldest', 'Random', or 'Lowest'. 'UCB' uses Upper Confidence Bound estimates for the parameters. 'TS' uses Thompson sampling, a random sample from the posterior. 'MAP' uses maximum a posteriori parameter estimates. 'Oldest' adds the block that has been available the longest. 'Random' adds a random block. 'Lowest' adds the block with the lowest sum of index levels. 'UCB' and 'TS' are based on bandit algorithms and account for uncertainty in the parameter estimates, but are the slowest. ‘MAP' is fast but doesn’t account for parameter uncertainty. The other three are naive methods that are not adaptive and won't perform well.

Value

SG with new points added.

Examples

SG <- CGGPcreate(d=3, batchsize=100)
y <- apply(SG$design, 1, function(x){x[1]+x[2]^2})
SG <- CGGPfit(SG, Y=y)
SG <- CGGPappend(CGGP=SG, batchsize=20, selectionmethod="MAP")

Create sparse grid GP

Description

Create sparse grid GP

Usage

CGGPcreate(
  d,
  batchsize,
  corr = "PowerExponential",
  grid_sizes = c(1, 2, 4, 4, 8, 12, 20, 28, 32),
  Xs = NULL,
  Ys = NULL,
  HandlingSuppData = "Correct",
  supp_args = list()
)

Arguments

d

Input dimension

batchsize

Number added to design each batch for now only on predictions

corr

Name of correlation function to use. Must be one of "CauchySQT", "CauchySQ", "Cauchy", "Gaussian", "PowerExp", "Matern32", "Matern52".

grid_sizes

Size of grid refinements.

Xs

Supplemental X data

Ys

Supplemental Y data

HandlingSuppData

How should supplementary data be handled? * Correct: full likelihood with grid and supplemental data * Only: only use supplemental data * Ignore: ignore supplemental data

supp_args

Arguments used to fit if Xs and Ys are given

Value

CGGP

Examples

CGGPcreate(d=8,200)

Update CGGP model given data

Description

This function will update the GP parameters for a CGGP design.

Usage

CGGPfit(
  CGGP,
  Y,
  Xs = NULL,
  Ys = NULL,
  theta0 = pmax(pmin(CGGP$thetaMAP, 0.8), -0.8),
  HandlingSuppData = CGGP$HandlingSuppData,
  separateoutputparameterdimensions = is.matrix(CGGP$thetaMAP),
  set_thetaMAP_to,
  corr,
  Ynew
)

Arguments

CGGP

Sparse grid objects

Y

Output values calculated at CGGP$design

Xs

Supplemental X matrix

Ys

Supplemental Y values

theta0

Initial theta

HandlingSuppData

How should supplementary data be handled? * Correct: full likelihood with grid and supplemental data * Only: only use supplemental data * Ignore: ignore supplemental data

separateoutputparameterdimensions

If multiple output dimensions, should separate parameters be fit to each dimension?

set_thetaMAP_to

Value for thetaMAP to be set to

corr

Will update correlation function, if left missing it will be same as last time.

Ynew

Values of 'CGGP$design_unevaluated'

Value

Updated CGGP object fit to data given

Examples

cg <- CGGPcreate(d=3, batchsize=100)
y <- apply(cg$design, 1, function(x){x[1]+x[2]^2})
cg <- CGGPfit(CGGP=cg, Y=y)

CGGP block plot

Description

Plot the 2D projections of the blocks of an CGGP object.

Usage

CGGPplotblocks(CGGP, singleplot = TRUE)

Arguments

CGGP

CGGP object

singleplot

If only two dimensions, should a single plot be made?

Value

ggplot2 plot

Examples


# The first and fourth dimensions are most active and will have greater depth
ss <- CGGPcreate(d=5, batchsize=50)
f <- function(x) {cos(2*pi*x[1]*3) + x[3]*exp(4*x[4])}
ss <- CGGPfit(ss, Y=apply(ss$design, 1, f))
ss <- CGGPappend(CGGP=ss, batchsize=100)
CGGPplotblocks(ss)

mat <- matrix(c(1,1,1,2,2,1,2,2,1,3), ncol=2, byrow=TRUE)
CGGPplotblocks(mat)

Plot CGGP block selection over time

Description

Shows the order in which blocks were selected for each dimension. Gives an idea of how the selections change over time.

Usage

CGGPplotblockselection(CGGP, indims)

Arguments

CGGP

CGGP object

indims

Which input dimensions should be shown?

Value

ggplot2 object

Examples

gs <- CGGPcreate(d=3, batchsize=100)
# All dimensions will look similar
CGGPplotblockselection(gs)

# You need to append with CGGPappend after fitting to see a difference
f <- function(x){x[1]^1.2}
y <- apply(gs$design, 1, f)
gs <- CGGPfit(gs, Y=y)
gs <- CGGPappend(gs, 100)
# Now you will see higher for X1 from 100 to 200 while others remain low.
CGGPplotblockselection(gs)

Plot correlation samples

Description

Plot samples for a given correlation function and parameters. Useful for getting an idea of what the correlation parameters mean in terms of smoothness.

Usage

CGGPplotcorr(
  Corr = CGGP_internal_CorrMatGaussian,
  theta = NULL,
  numlines = 20,
  outdims = NULL,
  zero = TRUE
)

Arguments

Corr

Correlation function or CGGP object. If CGGP object, it will make plots for thetaMAP, the max a posteriori theta.

theta

Parameters for Corr

numlines

Number of sample paths to draw

outdims

Which output dimensions should be used?

zero

Should the sample paths start at y=0?

Value

Plot

Examples


CGGPplotcorr()
CGGPplotcorr(theta=c(-2,-1,0,1))

SG <- CGGPcreate(d=3, batchsize=100)
f <- function(x){x[1]^1.2+sin(2*pi*x[2]*3)}
y <- apply(SG$design, 1, f)
SG <- CGGPfit(SG, Y=y)
CGGPplotcorr(SG)

Heatmap of SG design depth

Description

The values on the diagonal are largest design depth for that dimension. The off-diagonal values are the largest design depth that both dimensions have been measured at simultaneously. A greater depth means that more points have been measured along that dimension or two-dimensional subspace.

Usage

CGGPplotheat(CGGP)

Arguments

CGGP

CGGP object

Value

A heat map made from ggplot2

References

https://stackoverflow.com/questions/14290364/heatmap-with-values-ggplot2

Examples


# All dimensions should look similar
d <- 8
SG = CGGPcreate(d,201)
CGGPplotheat(SG)

# The first and fourth dimensions are most active and will have greater depth
SG <- CGGPcreate(d=5, batchsize=50)
f <- function(x) {cos(2*pi*x[1]*3) + exp(4*x[4])}
for (i in 1:1) {
  SG <- CGGPfit(SG, Y=apply(SG$design, 1, f))
  SG <- CGGPappend(CGGP=SG, batchsize=200)
}
# SG <- CGGPfit(SG, Y=apply(SG$design, 1, f))
CGGPplotheat(SG)

Histogram of measurements at each design depth of each input dimension

Description

A greater design depth signifies a more important dimension. Thus a larger right tail on the histogram are more important variables.

Usage

CGGPplothist(CGGP, ylog = TRUE)

Arguments

CGGP

CGGP object

ylog

Should the y axis be put on a log scale?

Value

Histogram plot made using ggplot2

Examples


# All dimensions should look similar
d <- 8
SG = CGGPcreate(d,201)
CGGPplothist(SG)
CGGPplothist(SG, ylog=FALSE)

# The first dimension is more active and will have greater depth
f <- function(x) {sin(x[1]^.6*5)}
SG <- CGGPcreate(d=5, batchsize=100)
SG <- CGGPfit(SG, apply(SG$design, 1, f))
SG <- CGGPappend(CGGP=SG, batchsize=1000)
CGGPplothist(SG)

Plot negative log posterior likelihood of samples

Description

Plot negative log posterior likelihood of samples

Usage

CGGPplotsamplesneglogpost(CGGP)

Arguments

CGGP

CGGP object

Value

ggplot2 object

Examples

gs <- CGGPcreate(d=3, batchsize=100)
f <- function(x){x[1]^1.2+x[3]^.4*sin(2*pi*x[2]^2*3) + .1*exp(3*x[3])}
y <- apply(gs$design, 1, f)
gs <- CGGPfit(gs, Y=y)
CGGPplotsamplesneglogpost(gs)

CGGP slice plot

Description

Show prediction plots when varying over only one dimension. Most useful when setting all values to 0.5 because it will have the most points.

Usage

CGGPplotslice(
  CGGP,
  proj = 0.5,
  np = 300,
  color = "pink",
  outdims,
  scales = "free_y",
  facet = "grid"
)

Arguments

CGGP

CGGP object

proj

Point to project onto

np

Number of points to use along each dimension

color

Color to make error region

outdims

If multiple outputs, which of them should be plotted?

scales

Parameter passed to ggplot2::facet_grid()

facet

If "grid", will use ggplot2::facet_grid(), if "wrap" will use ggplot2::facet_wrap(). Only applicable for a single output dimension.

Value

ggplot2 object

Examples


d <- 5
f1 <- function(x){x[1]+x[2]^2 + cos(x[3]^2*2*pi*4) - 3.3}
s1 <- CGGPcreate(d, 200)
s1 <- CGGPfit(s1, apply(s1$design, 1, f1))
#s1 <- CGGPappend(s1, 200)
#s1 <- CGGPfit(s1, apply(s1$design, 1, f1))
CGGPplotslice(s1)
CGGPplotslice(s1, 0.)
CGGPplotslice(s1, s1$design[nrow(s1$design),])

Plot theta samples

Description

Plot theta samples

Usage

CGGPplottheta(CGGP)

Arguments

CGGP

CGGP object

Value

ggplot2 object

Examples

gs <- CGGPcreate(d=3, batchsize=100)
f <- function(x){x[1]^1.2+x[3]^.4*sin(2*pi*x[2]^2*3) + .1*exp(3*x[3])}
y <- apply(gs$design, 1, f)
gs <- CGGPfit(gs, Y=y)
CGGPplottheta(gs)

Plot something similar to a semivariogram

Description

It's not actually a variogram or semivariogram. It shows how the correlation function falls off as distance increases.

Usage

CGGPplotvariogram(CGGP, facet = 1, outdims = NULL)

Arguments

CGGP

CGGP object

facet

How should the plots be faceted? If 1, in a row, if 2, in a column, if 3, wrapped around.

outdims

Which output dimensions should be shown.

Value

ggplot2 object

Examples

SG <- CGGPcreate(d=3, batchsize=100)
f <- function(x){x[1]^1.2+x[3]^.4*sin(2*pi*x[2]^2*3) + .1*exp(3*x[3])}
y <- apply(SG$design, 1, f)
SG <- CGGPfit(SG, Y=y)
CGGPplotvariogram(SG)

Plot validation prediction errors for CGGP object

Description

Plot validation prediction errors for CGGP object

Usage

CGGPvalplot(CGGP, Xval, Yval, d = NULL)

Arguments

CGGP

CGGP object that has been fitted

Xval

X validation data

Yval

Y validation data

d

If output is multivariate, which column to use. Will do all if left as NULL.

Value

None, makes a plot

Examples

SG <- CGGPcreate(d=3, batchsize=100)
f1 <- function(x){x[1]+x[2]^2}
y <- apply(SG$design, 1, f1)
SG <- CGGPfit(SG, y)
Xval <- matrix(runif(3*100), ncol=3)
Yval <- apply(Xval, 1, f1)
CGGPvalplot(CGGP=SG, Xval=Xval, Yval=Yval)

Calculate stats for CGGP prediction on validation data

Description

Calculate stats for CGGP prediction on validation data

Usage

CGGPvalstats(CGGP, Xval, Yval, bydim = TRUE, ...)

Arguments

CGGP

CGGP object

Xval

X validation matrix

Yval

Y validation data

bydim

If multiple outputs, should it be done separately by dimension?

...

Passed to valstats, such as which stats to calculate.

Value

data frame

Examples


SG <- CGGPcreate(d=3, batchsize=100)
f1 <- function(x){x[1]+x[2]^2}
y <- apply(SG$design, 1, f1)
SG <- CGGPfit(SG, y)
Xval <- matrix(runif(3*100), ncol=3)
Yval <- apply(Xval, 1, f1)
CGGPvalstats(CGGP=SG, Xval=Xval, Yval=Yval)

# Multiple outputs
SG <- CGGPcreate(d=3, batchsize=100)
f1 <- function(x){x[1]+x[2]^2}
f2 <- function(x){x[1]^1.3+.4*sin(6*x[2])+10}
y1 <- apply(SG$design, 1, f1)#+rnorm(1,0,.01)
y2 <- apply(SG$design, 1, f2)#+rnorm(1,0,.01)
y <- cbind(y1, y2)
SG <- CGGPfit(SG, Y=y)
Xval <- matrix(runif(3*100), ncol=3)
Yval <- cbind(apply(Xval, 1, f1),
              apply(Xval, 1, f2))
CGGPvalstats(SG, Xval, Yval)
CGGPvalstats(SG, Xval, Yval, bydim=FALSE)

S3 plot method for CGGP

Description

There are a few different plot functions for CGGP objects: 'CGGPplotblocks', 'CGGPplotblockselection', 'CGGPplotcorr', 'CGGPplotheat', 'CGGPplothist', 'CGGPvalplot', 'CGGPplotslice', 'CGGPplotslice', and 'CGGPplotvariogram'. Currently 'CGGPplotblocks' is the default plot object.

Usage

## S3 method for class 'CGGP'
plot(x, y, ...)

Arguments

x

CGGP object

y

Don't use

...

Passed to CGGPplotblocks

Value

Either makes plot or returns plot object

Examples

SG = CGGPcreate(3,100)
plot(SG)

S3 predict method for CGGP

Description

Passes to CGGPpred

Predict using SG with y values at xp? Shouldn't y values already be stored in SG?

Usage

## S3 method for class 'CGGP'
predict(object, xp, ...)

CGGPpred(CGGP, xp, theta = NULL, outdims = NULL)

Arguments

object

CGGP object

xp

x value to predict at

...

Other arguments passed to 'CGGPpred'

CGGP

SG object

theta

Leave as NULL unless you want to use a value other than thetaMAP. Much slower.

outdims

If multiple outputs fit without PCA and with separate parameters, you can predict just for certain dimensions to speed it up. Will leave other columns in the output, but they will be wrong.

Value

Predicted mean values

Examples

SG <- CGGPcreate(d=3, batchsize=100)
y <- apply(SG$design, 1, function(x){x[1]+x[2]^2+rnorm(1,0,.01)})
SG <- CGGPfit(SG, Y=y)
CGGPpred(SG, matrix(c(.1,.1,.1),1,3))
cbind(CGGPpred(SG, SG$design)$mean, y) # Should be near equal

Print CGGP object

Description

Default print as a list is bad since there's a lot of elements.

Usage

## S3 method for class 'CGGP'
print(x, ...)

Arguments

x

CGGP object

...

Passed to print

Value

String to be printed

Examples

SG = CGGPcreate(3,21)
print(SG)
f <- function(x) {x[1]+exp(x[2]) + log(x[3]+4)}
y <- apply(SG$design, 1, f)
SG <- CGGPfit(SG, y)
print(SG)

rcpp_fastmatclcr

Description

rcpp_fastmatclcr

Usage

rcpp_fastmatclcr(I, w, MSEmat, S, maxlevel)

Arguments

I

Matrix

w

vector

MSEmat

Matrix

S

Vector

maxlevel

Integer

Value

Nothing, void

rcpp_fastmatclcranddclcr

Description

rcpp_fastmatclcranddclcr

Usage

rcpp_fastmatclcranddclcr(I, w, MSEmat, dMSEmat, S, dS, maxlevel, numpara)

Arguments

I

Matrix

w

vector

MSEmat

Matrix

dMSEmat

Matrix

S

Vector

dS

Matrix

maxlevel

Integer

numpara

Integer

Value

Nothing, void

rcpp_kronDBS

Description

rcpp_kronDBS

Usage

rcpp_gkronDBS(A, dA, B, p)

Arguments

A

Vector

dA

Vector

B

Vector

p

Vector

Value

kronDBS calculation

Examples

rcpp_gkronDBS(c(1,1), c(0,0), c(.75), c(1,1))

rcpp_kronDBS

Description

rcpp_kronDBS

Usage

rcpp_kronDBS(A, B, p)

Arguments

A

Vector

B

Vector

p

Vector

Value

kronDBS calculation

Plot validation prediction errors

Description

Plot validation prediction errors

Usage

valplot(predmean, predvar, Yval, d = NULL)

Arguments

predmean

Predicted mean

predvar

Predicted variance

Yval

Y validation data

d

If output is multivariate, which column to use. Will do all if left as NULL.

Value

None, makes a plot

Examples

x <- matrix(runif(100*3), ncol=3)
f1 <- function(x){x[1]+x[2]^2}
y <- apply(x, 1, f1)
# Create a linear model on the data
mod <- lm(y ~ ., data.frame(x))
# Predict at validation data
Xval <- matrix(runif(3*100), ncol=3)
mod.pred <- predict.lm(mod, data.frame(Xval), se.fit=TRUE)
# Compare to true results
Yval <- apply(Xval, 1, f1)
valplot(mod.pred$fit, mod.pred$se.fit^2, Yval=Yval)

Calculate stats for prediction on validation data

Description

Calculate stats for prediction on validation data

Usage

valstats(
  predmean,
  predvar,
  Yval,
  bydim = TRUE,
  RMSE = TRUE,
  score = TRUE,
  CRPscore = TRUE,
  coverage = TRUE,
  corr = TRUE,
  R2 = TRUE,
  MAE = FALSE,
  MIS90 = FALSE,
  metrics,
  min_var = .Machine$double.eps
)

Arguments

predmean

Predicted mean

predvar

Predicted variance

Yval

Y validation data

bydim

If multiple outputs, should it be done separately by dimension?

RMSE

Should root mean squared error (RMSE) be included?

score

Should score be included?

CRPscore

Should CRP score be included?

coverage

Should coverage be included?

corr

Should correlation between predicted and true mean be included?

R2

Should R^2 be included?

MAE

Should mean absolute error (MAE) be included?

MIS90

Should mean interval score for 90% confidence be included? See Gneiting and Raftery (2007).

metrics

Optional additional metrics to be calculated. Should have same first three parameters as this function.

min_var

Minimum value of the predicted variance. Negative or zero variances can cause errors.

Value

data frame

References

Gneiting, Tilmann, and Adrian E. Raftery. "Strictly proper scoring rules, prediction, and estimation." Journal of the American Statistical Association 102.477 (2007): 359-378.

Examples

valstats(c(0,1,2), c(.01,.01,.01), c(0,1.1,1.9))
valstats(cbind(c(0,1,2), c(1,2,3)),
         cbind(c(.01,.01,.01),c(.1,.1,.1)),
         cbind(c(0,1.1,1.9),c(1,2,3)))
valstats(cbind(c(0,1,2), c(8,12,34)),
         cbind(c(.01,.01,.01),c(1.1,.81,1.1)),
         cbind(c(0,1.1,1.9),c(10,20,30)), bydim=FALSE)
valstats(cbind(c(.8,1.2,3.4), c(8,12,34)),
         cbind(c(.01,.01,.01),c(1.1,.81,1.1)),
         cbind(c(1,2,3),c(10,20,30)), bydim=FALSE)

CGGP: A package for running sparse grid computer experiments

Description

CGGP functions

Cauchy correlation function

Description

Usage

Arguments

Value

See Also

Examples

CauchySQ correlation function

Description

Usage

Arguments

Value

See Also

Examples

CauchySQT correlation function

Description

Usage

Arguments

Value

See Also

Examples

Gaussian correlation function

Description

Usage

Arguments

Details

Value

See Also

Examples

Matern 3/2 correlation function

Description

Usage

Arguments

Value

See Also

Examples

Matern 5/2 correlation function

Description

Usage

Arguments

Value

See Also

Examples

Power exponential correlation function

Description

Usage

Arguments

Value

See Also

Examples

Wendland0 (Triangle) correlation function

Description

Usage

Arguments

Value

See Also

Examples

Wendland1 1 correlation function

Description

Usage

Arguments

Value

See Also

Examples

Wendland2 2 correlation function

Description

Usage

Arguments

Value

See Also

Examples

Calculate MSE prediction along a single dimension

Description

Usage

Arguments

Value

Examples