Negative Controls and Boring Results

May 7, 2026 · Benjamin Kramer

research

I have started to trust boring results more than exciting ones. That sounds obvious, but it is harder to practice than it is to say. Exciting results create momentum. They make a figure feel alive. They give a paper a center of gravity. Boring results mostly slow everything down.

Negative controls are a good example. Nobody is excited to run the analysis that should produce nothing. It rarely becomes a main figure. It does not usually give you a sentence that sounds good in an abstract. But it is one of the cleanest ways to ask whether the pipeline is hallucinating structure.

In observational biomedical data, this matters a lot. If a model finds an association where biology gives you no plausible reason to expect one, that does not automatically invalidate the whole analysis. But it does change the burden of proof. It tells you that some combination of ascertainment, coding, selection, or model structure may be generating signal. That is not a nuisance detail. That is the difference between a result and an artifact.

The difficulty is that negative controls are only useful if they are chosen carefully. A lazy negative control is just a ritual. It makes the methods section look more responsible without actually testing the failure mode you care about. The control has to be connected to a specific worry. If the worry is healthcare utilization, choose an outcome that should reflect utilization but not the proposed biological mechanism. If the worry is family structure, use a comparison that should preserve the confounding but remove the exposure logic.

I think this is one reason computational biology can drift into overconfident language. The models are complex, the datasets are large, and the outputs are visually convincing. But the credibility of the result often depends on very plain checks: Does the effect survive an alternative phenotype definition? Does it appear in a place where it should not? Does it change when the comparison is made within families? Does the direction match anything known?

The best analyses are not the ones where every result is dramatic. They are the ones where the boring checks make the dramatic result harder to dismiss. That is the standard I am trying to hold myself to, especially in claims-data work. A surprising association is useful only after it survives the possibility that the pipeline was built to find surprises.

← Back to Writing