Title: | Power Simulation for Sequential Analyses and Multiple Hypotheses |
---|---|
Description: | Calculates, via simulation, power and appropriate stopping alpha boundaries (and/or futility bounds) for sequential analyses (i.e., group sequential design) as well as for multiple hypotheses (multiple tests included in an analysis), given any specified global error rate. This enables the sequential use of practically any significance test, as long as the underlying data can be simulated in advance to a reasonable approximation. Lukács (2022) <doi:10.21105/joss.04643>. |
Authors: | Gáspár Lukács [aut, cre] |
Maintainer: | Gáspár Lukács <[email protected]> |
License: | BSD_2_clause + file LICENSE |
Version: | 0.6.4 |
Built: | 2025-03-20 03:45:48 UTC |
Source: | https://github.com/gasparl/possa |
This function attempts to extract p values from certain tests where it could otherwise be complicated to do so. Please make sure, in case of each new test, whether the function actually returns the values you want.
get_p(x) ## S3 method for class 'aov' get_p(x) ## S3 method for class 'aovlist' get_p(x)
get_p(x) ## S3 method for class 'aov' get_p(x) ## S3 method for class 'aovlist' get_p(x)
x |
Test results. |
Supported functions: all tests that return the p value as p.value
(including most R stats
test functions, having htest
class),
and the aov()
function (aov
and aovlist
classes).
Returns either a single p value or, in case of multiple p values, a list or nested list with each p value.
get_p(aov)
: get_p method for class 'aov'
get_p(aovlist)
: get_p method for class 'aovlist'
get_p(t.test(extra ~ group, data = sleep)) # returns 0.07939414 # same as printed via t.test(extra ~ group, data = sleep) get_p(prop.test(c(83, 90, 129, 70), c(86, 93, 136, 82))) # returns 0.005585477, # same as printed prop.test(c(83, 90, 129, 70), c(86, 93, 136, 82)) get_p(aov(yield ~ block + N * P * K, npk)) # returns list of p values # corresponds to summary(aov(yield ~ block + N * P * K, npk)) get_p(aov(yield ~ N * P * K + Error(block), npk)) # returns nested list of p values (effects per error term) # again corresponds printed p values via summary()
get_p(t.test(extra ~ group, data = sleep)) # returns 0.07939414 # same as printed via t.test(extra ~ group, data = sleep) get_p(prop.test(c(83, 90, 129, 70), c(86, 93, 136, 82))) # returns 0.005585477, # same as printed prop.test(c(83, 90, 129, 70), c(86, 93, 136, 82)) get_p(aov(yield ~ block + N * P * K, npk)) # returns list of p values # corresponds to summary(aov(yield ~ block + N * P * K, npk)) get_p(aov(yield ~ N * P * K + Error(block), npk)) # returns nested list of p values (effects per error term) # again corresponds printed p values via summary()
Calculates power and local alphas based on simulated p values
(which should be provided as created by the
POSSA::sim
function). The calculation for
sequential testing involves a staircase procedure during which an initially
provided set of local alphas is continually adjusted until the (approximate)
specified global type 1 error rate (e.g., global alpha = .05) is reached:
the value of adjustment is decreasing while global type 1 error rate is
larger than specified, and increasing while global type 1 error rate is
smaller than specified; a smaller step is chosen whenever the direction
(increase vs. decrease) changes; the procedure stops when the global type 1
error rate is close enough to the specified one (e.g., matches it up to 4
fractional digits) or when the specified smallest step is passed. The
adjustment works via a dedicated ("adjust
") function that either
replaces missing (NA
) values with varying alternatives or (when there
are no missing values) in some manner varyingly modifies the initial values
(e.g. by addition or multiplication).
pow( p_values, alpha_locals = NULL, alpha_global = 0.05, adjust = TRUE, adj_init = NULL, staircase_steps = NULL, alpha_precision = 5, fut_locals = NULL, multi_logic_a = "all", multi_logic_fut = "all", multi_logic_global = "any", group_by = NULL, alpha_loc_nonstop = NULL, round_to = 5, iter_limit = 100, seed = 8, prog_bar = FALSE, hush = FALSE )
pow( p_values, alpha_locals = NULL, alpha_global = 0.05, adjust = TRUE, adj_init = NULL, staircase_steps = NULL, alpha_precision = 5, fut_locals = NULL, multi_logic_a = "all", multi_logic_fut = "all", multi_logic_global = "any", group_by = NULL, alpha_loc_nonstop = NULL, round_to = 5, iter_limit = 100, seed = 8, prog_bar = FALSE, hush = FALSE )
p_values |
A |
alpha_locals |
A number, a numeric vector, or a named |
alpha_global |
Global alpha (expected type 1 error rate in total);
|
adjust |
The function via which the initial vector local alphas is
modified with each step of the staircase procedure. Three arguments are
passed to it: |
adj_init |
The initial adjustment value that is used as the " |
staircase_steps |
Numeric vector that specifies the (normally decreasing)
sequence of step sizes for the staircase that narrows down on the specified
global error error by decreasing or increasing the adjustment value
(initially: |
alpha_precision |
During the staircase procedure, at any point when the
simulated global type 1 error rate first matches the given
|
fut_locals |
Specifies local futility bounds that may stop the experiment
at the given interim looks if the corresponding p value is above the given
futility bound value. When |
multi_logic_a |
When multiple p values are evaluated for local alpha
stopping rules, |
multi_logic_fut |
Same as |
multi_logic_global |
Similar as |
group_by |
When given as a character element or vector, specifies the
factors by which to group the analysis: the |
alpha_loc_nonstop |
Optional "non-stopper" alphas via which to evaluate p
values per look, but without stopping the data collection regardless of
statistical significance. Must be a list with names indicating p value
column name pairs, similarly as for the |
round_to |
Number of fractional digits (default: |
iter_limit |
In some specific cases of unideal/wrong input, the staircase
may get stuck at a given step's loop process. The |
seed |
Number for |
prog_bar |
Logical, |
hush |
Logical. If |
The returns a list
(with class "possa_pow_list"
)
that includes all details of the calculated power, T1ER, and sample
information. This list can be printed legibly (via POSSA's
print()
method).
For the replicability, in case the adjust
function uses any
randomization, set.seed
is executed in the beginning of this
function, each time it is called; see the seed
parameter.
This function uses, internally, the data.table
R package.
# below is a (very) minimal example # for more, see the vignettes via https://github.com/gasparl/possa#usage # create sampling function customSample = function(sampleSize) { list( sample1 = rnorm(sampleSize, mean = 0, sd = 10), sample2_h0 = rnorm(sampleSize, mean = 0, sd = 10), sample2_h1 = rnorm(sampleSize, mean = 5, sd = 10) ) } # create testing function customTest = function(sample1, sample2_h0, sample2_h1) { c( p_h0 = t.test(sample1, sample2_h0, 'less', var.equal = TRUE)$p.value, p_h1 = t.test(sample1, sample2_h1, 'less', var.equal = TRUE)$p.value ) } # run simulation dfPvals = sim( fun_obs = customSample, n_obs = 80, fun_test = customTest, n_iter = 1000 ) # get power info pow(dfPvals)
# below is a (very) minimal example # for more, see the vignettes via https://github.com/gasparl/possa#usage # create sampling function customSample = function(sampleSize) { list( sample1 = rnorm(sampleSize, mean = 0, sd = 10), sample2_h0 = rnorm(sampleSize, mean = 0, sd = 10), sample2_h1 = rnorm(sampleSize, mean = 5, sd = 10) ) } # create testing function customTest = function(sample1, sample2_h0, sample2_h1) { c( p_h0 = t.test(sample1, sample2_h0, 'less', var.equal = TRUE)$p.value, p_h1 = t.test(sample1, sample2_h1, 'less', var.equal = TRUE)$p.value ) } # run simulation dfPvals = sim( fun_obs = customSample, n_obs = 80, fun_test = customTest, n_iter = 1000 ) # get power info pow(dfPvals)
Prints, in a readable manner, the main information from any of
the data frames containing power information from the list created by the
POSSA::pow
function. This is an extension (method)
of the base R print
function, so it can be called simply as
print()
.
## S3 method for class 'possa_pow_df' print(x, round_to = 5, possa_title = TRUE, ...)
## S3 method for class 'possa_pow_df' print(x, round_to = 5, possa_title = TRUE, ...)
x |
Power information |
round_to |
Number of fractional digits to round to, for the displayed
numbers ( |
possa_title |
Set to |
... |
(Allow additional arguments for technical reasons.) |
Returns nothing (NULL
); this method serves only to print
information to the console.
Prints, in a readable manner, the main information from the list
created by the POSSA::pow
function, calling
print.possa_pow_df
for each of the POSSA power information
data frames in the list. This is an extension (method) of the base R
print
function, so it can be called simply as print()
.
## S3 method for class 'possa_pow_list' print(x, round_to = NA, ...)
## S3 method for class 'possa_pow_list' print(x, round_to = NA, ...)
x |
The |
round_to |
Number of fractional digits to round to, for the displayed
numbers. The default is the value passed from the
|
... |
(Allow additional arguments for technical reasons.) |
Returns nothing (NULL
); this method serves only to print
information to the console.
Prints information about the simulated p values created by the
POSSA::sim
function. This is an extension (method)
of the base R print
function, so it can be called simply as
print()
.
## S3 method for class 'possa_sim_df' print(x, group_by = NULL, descr_cols = TRUE, descr_func = summary, ...)
## S3 method for class 'possa_sim_df' print(x, group_by = NULL, descr_cols = TRUE, descr_func = summary, ...)
x |
The |
group_by |
When given as a character element or vector, specifies the
factors by which to group the descriptives: the |
descr_cols |
When given as a character element or vector, specifies the
factors for which descriptive data should be shown (by group, if
applicable). By default |
descr_func |
Function used for printing descriptives (see
|
... |
(Allow additional arguments for technical reasons.) |
Returns nothing (NULL
); this method serves only to print
information to the console.
This function performs the simulation procedure in order to get
the p values that will eventually serve for power calculations (via
pow
). The observation values ("sample") to be tested are
simulated via the given fun_obs
function, and the significance
testing is performed via the given fun_test
function. The numbers of
observations per look (for a sequential design) are specified in
n_obs
.
sim( fun_obs, n_obs, fun_test, n_iter = 45000, adjust_n = 1, seed = 8, pair = NULL, ignore_suffix = FALSE, prog_bar = FALSE, hush = FALSE )
sim( fun_obs, n_obs, fun_test, n_iter = 45000, adjust_n = 1, seed = 8, pair = NULL, ignore_suffix = FALSE, prog_bar = FALSE, hush = FALSE )
fun_obs |
A |
n_obs |
A numeric vector or a named list of numeric vectors. Specifies
the numbers of observations (i.e., samples sizes) that are to be generated
by |
fun_test |
The function for significance testing. The list of samples
returned by |
n_iter |
Number of iterations (default: 45000). |
adjust_n |
Adjust total number of observations via simple multiplication.
Might be useful in some specific cases, e.g. if for some reason multiple p
values are derived from the same sample without specifying grouping
( |
seed |
Number for |
pair |
Logical or |
ignore_suffix |
Set to |
prog_bar |
Logical, |
hush |
Logical, |
To specify a variable that differs depending on whether the null hypothesis
("H0") or the alternative hypothesis ("H1") is true, a pair of samples are
needed for fun_test
, for which the argument names should have an
identical root and "_h0
" and "_h1
" endings, such as
"var_x_h0
" (for sample in case of H0) and "var_x_h1
" (for sample
in case of H1). Then, since the observation number for this pair will always
be the same, as a convenience, parameters with "_h0
" and "_h1
"
endings specifically can be specified together in n_obs
with the last
"0"/"1" character dropped, hence ending with "_h
". So, for example,
"var_x_h = c(30, 60, 90)
" will be automatically adjusted to specify the
observation numbers for both "var_x_h0
" and "var_x_h1
". In that
case, fun_obs
must have a single argument "var_x_h
", while
fun_test
must have both full names as arguments ("var_x_h0
" and
"var_x_h1
").
Optionally, fun_obs
can be provided in list
format for
the convenience of exploring varying factors (e.g., different effect sizes,
correlations) at once, without writing a dedicated fun_obs
function for
each combination, and each time separately running the simulation and the
power calculation. In this case, the first element of the list must be the
actual function
, which contains certain parameters for
specifying varying factors, while the rest of the elements should contain the
various argument values for these parameters of the function as named elements
of the list (e.g., list(my_function, factor1=c(1, 2, 3), factor2=c(0,
5))
), with the name corresponding to the parameter name in the function, and
the varying values (numbers or strings). When so specified, a separate
simulation procedure will be run for each combination of the given factors
(or, if only one factor is given, for each element of that factor). The
POSSA::pow
function will be able to automatically
detect (by default) the factors generated this way in the present
POSSA::sim
function, in order to calculate power
separately for each factor combination.
Returns a data.frame
(with class "possa_sim_df"
)
that includes the columns .iter
(the iterations of the simulation
procedure numbered from 1
to n_iter
), .look
(the
interim "looks" numbered from 1
to the maximum number of looks,
including the final one), and the information returned by the
fun_test
function for H0 and H1 outcomes (mainly p values; but also
other, optional information, if any) and the corresponding observation
numbers, as well as the total observation number per each look under a
dedicated .n_total
column. When this data frame is printed to the
console (via POSSA's print()
method), the head (first few lines) of the data is shown, as well as, in
case of any varying factors included, summary information per factor
combination.
For the replicability (despite the randomization), set.seed
is
executed in the beginning of this function, each time it is called; see the
seed
parameter.
# below is a (very) minimal example # for more, see the vignettes via https://github.com/gasparl/possa#usage # create sampling function customSample = function(sampleSize) { list( sample1 = rnorm(sampleSize, mean = 0, sd = 10), sample2_h0 = rnorm(sampleSize, mean = 0, sd = 10), sample2_h1 = rnorm(sampleSize, mean = 5, sd = 10) ) } # create testing function customTest = function(sample1, sample2_h0, sample2_h1) { c( p_h0 = t.test(sample1, sample2_h0, 'less', var.equal = TRUE)$p.value, p_h1 = t.test(sample1, sample2_h1, 'less', var.equal = TRUE)$p.value ) } # run simulation dfPvals = sim( fun_obs = customSample, n_obs = 80, fun_test = customTest, n_iter = 1000 ) # get power info pow(dfPvals)
# below is a (very) minimal example # for more, see the vignettes via https://github.com/gasparl/possa#usage # create sampling function customSample = function(sampleSize) { list( sample1 = rnorm(sampleSize, mean = 0, sd = 10), sample2_h0 = rnorm(sampleSize, mean = 0, sd = 10), sample2_h1 = rnorm(sampleSize, mean = 5, sd = 10) ) } # create testing function customTest = function(sample1, sample2_h0, sample2_h1) { c( p_h0 = t.test(sample1, sample2_h0, 'less', var.equal = TRUE)$p.value, p_h1 = t.test(sample1, sample2_h1, 'less', var.equal = TRUE)$p.value ) } # run simulation dfPvals = sim( fun_obs = customSample, n_obs = 80, fun_test = customTest, n_iter = 1000 ) # get power info pow(dfPvals)