One Preregistration to Rule Them All?

February 5th, 2018,
Posted in: Preregistration

One of the most frequently asked questions we receive about preregistration is “how do I preregister this complex study that will result in many different publications?” This question is particularly challenging if there is only a single data collection effort that will be used to support distinct parts of the larger study and subsequent papers.

First, let's describe where this is not problematic. Many papers describe a series of studies, each with a concise objective. If a round of data collection precedes each study, a researcher can preregister prior to each*, and report the results of confirmed findings and unexpected discoveries separately for each study. Nothing out of the ordinary there.

However, many large projects, particularly interdisciplinary collaborations, will collect data from a population of study participants. Then, each researcher will conduct a set of analyses, while collaborating to a greater or lesser extent with their peers. This is problematic. 

Why? Because complete reporting of proposed analyses is necessary in order to properly interpret the results. Let's imagine an extreme example to demonstrate the possible problem: we preregister 100 hypothesis tests but only report the results of the 5 that have a p-value of less than .05, claiming that other results will appear in subsequent publications. Even though we are sincere, when the results are disconnected from each other the reader is unable to easily interpret the true credibility of the reported findings. Five out of five significant results are very different than five out of 100.

The best solution to this problem is the same solution that underlies any preregistration: planning. Clearly state ahead of time which subset of results will be reported together. This guarantees that our results can be comprehensively reported. Even if we never get all the publications out of the project that we want, each individual set of results will be reported based on the grouping determine ahead of time, before the results could bias that decision.

Alternatively, it could also be beneficial to make more than one preregistration, even if the data collection effort is shared. If each preregistration is connected to the related ones (for example, they could be part of the same OSF project), this will help clarify the relationship between them. Then, each study simply needs to report the results of the analyses specified in the corresponding preregistration. Disclosing the existence of related research efforts is also helpful for providing the full context of the preregistration. For example, if we preregister work that analyzes a common dataset with two collaborators, note that in the preregistration and link to any other preregistration; this enhances the discoverability of all related analyses.

Finally, a reminder about the three rules for reporting the results of preregistered work. First, include a link to the preregistration in any resulting article. Second, report the results of all preregistered analyses (or if the preregistration specifies which results will be reported separately, note that). Third, any unexpected trends or discoveries that you make that were not part of your preregistration should be reported and described as preliminary findings that deserve additional confirmation.

*Commonly, just the final study will be preregistered, though we encourage preregistration prior to any data collection effort.

Recent Posts