floodgate.Rd
This function infers the model-free variable importance, mMSE gap via floodgate.
floodgate( X, Y, i1, i2, nulls.list = NULL, gamma_X.list = NULL, sigma_X.list = NULL, Xmodel = "gaussian", funs, algo = "lasso", cv.rule = "min", one.sided = TRUE, alevel = 0.05, test = "z", verbose = TRUE )
X | a n by p matrix, containing all the covariates. |
---|---|
Y | a n by 1 matrix, containing the response variables. |
i1 | the index of training samples. |
i2 | the index of inference samples. |
nulls.list | a list of length p, whose element is a (|i2|*K)-dimensional vector, which contains K set of null samples. |
gamma_X.list | a list of length p, with each element being the linear coefficient of the given covariate on the other covariates (only relevant when Xmodel = "gaussian"; default: NULL). |
sigma_X.list | a list of length p, with each element being the variance of the conditional distribution. |
Xmodel | model of the covaraites (default: "gaussian"). |
funs | a list of three: train.fun, active.fun and predict.fun. |
algo | a fitting algorithm (default: "lasso"). |
cv.rule | indicates which rule should be used for the predict function, either "min" (the usual rule) or "1se" (the one-standard- error rule); default: "min"). See the glmnet help files for details. |
one.sided | whether to obtain LCB or p-values via the one-sided way (default: TRUE). |
alevel | confidence level (defaul: 0.05). |
test | type of the hypothesis test (defaul: "z"). |
verbose | whether to show intermediate progress (default: FALSE). |
A list of three objects. inf.out: a matrix of |S|-by-4, containing the p-values, LCI, UCI and the floodgate LCB for variable in S; S: a list of selected variables; cpu.time: computing time.
Zhang L, Janson L (2020). “Floodgate: Inference for Model-Free Variable Importance.” arXiv preprint arXiv:2007.01283.
Other methods:
calculate.V_mean()
,
calulate.mu_Xk()
,
fg.inference.binary()
,
fg.inference()
,
fit.mu()
,
floodgate.binary()
,
inference_general()