Professional Documents
Culture Documents
ESTUDIANTE
YULI TATIANA SAAVEDRA ANGEL
DOCENTE
CARLOS DAVID MONCADA BELTRAN
Simulation studies are used to obtain empirical results about the performance of
statistical methods in certain scenarios, as opposed to more general analytic
(algebraic) results, which may cover many scenarios. It is not always possible, or
may be difficult, to obtain analytic results. Simulation studies come into their own
when methods make wrong assumptions or data are messy because they can
assess the resilience of methods in such situations. We use the term “data-
generating mechanism” to denote how random numbers are used to generate a
dataset. This is in preference to “data-generating model,” which implies parametric
models and so is a specific class of data-generating mechanism. It is not the
purpose of this article to explain how specific types of data should be generated.
Data may be generated by producing parametric draws from a known model (once
or many times), or by repeated resampling with replacement from a specific
dataset (where the true data-generating model is unknown). For resampling
studies, the true data-generating mechanism is unknown and resamples are used
to study the sampling distribution. While parametric simulation can explore many
different data-generating mechanisms (which may be completely unrealistic),
resampling typically explores only one mechanism (which will be relevant for at
least the study at hand).
Simulation studies are an invaluable tool for research into statistical methods,
evidenced by the large proportion. Because methods promoted may be used in
medical research (or many other scientific areas), transparent reporting of the
design and execution of simulation studies is critical. While simulation studies are
widely used, they tend to be poorly reported by those who publish their results.
There are many areas to be improved in the reporting of simulation studies.