


Hi Stephen,
I would expect that bar(Q) is a noisier statistic than Q since it contains the variance of both Q and the normalisation of the weights and for this reason is less powerful for testing the null hypothesis. The first analogy that occurred to me was in the context of the standard importance sampling estimate which can be computed with or without normalisation of the weights. Although normalisation seems intuitively like it should be better it is easy to show that its variance is typically greater, as in e.g. Hesterberg 1992/4/5 (“Weighted Average Importance Sampling and Defensive …”).
Hope that steers you in a useful direction,
cheers,
Ewan.

Hello Ewan,
Thank you for pointing me in the right direction! I will take a look at the suggested paper.
Best,
Stephen