Resolution in Partial Wave Analysis

Resolution is the effect of detector. To Consider the resolution properly, We need to take a general look about the detector process. We can divide the process of detector into two parts. The first part is acceptance, with the probability for truth value \(x\) as \(\epsilon_{T} (x)\). The second part is resolution, it means the measurement value \(y\) will be a random number base on truth value \(x\). It is a conditional probability as \(R_{T}(y|x)\). The conditional probability is normalized as \(\int R_{T}(y|x) \mathrm{d} y = 1\). So, the total effect of detector is transition function

\[T(x,y) = R_{T}(y|x)\epsilon_{T} (x).\]

When we have a distribution of truth value with probability \(p(x)\), then we can get the distribution of measurement value with probability

\[p'(y)= \int p(x) T(x,y) \mathrm{d} x.\]

Using the Bayes Rule, we can rewrite \(T(x,y)\) as

\[T(x,y) = R(x|y) \epsilon_{R}(y),\]

where

\[\epsilon_{R}(y) = \int T(x,y) \mathrm{d} x, \ R(x|y) = \frac{T(x,y)}{\epsilon_{R}(y)}.\]

\(R(x|y)\) is the posterior probability, that means the probability of a certain \(y\) is from \(x\). \(\epsilon_{R}(y)\) is the projection of \(y\) for \(T(x,y)\), and is also the normalize factor for \(R(x|y)\).

Then, the probability \(p'(y)\) can be rewritten as

\[p'(y) = \epsilon_{R}(y) \int p(x) R(x|y) \mathrm{d} x.\]

To consider the resolution, we need to determine \(R(x|y)\). Generally, we use simulation to determine \(R(x|y)\). When \(p(x)=1\) is a flat distribution, then the joint distribution of \(x\) and \(y\) has the probability density \(T(x,y)\). We can build a model for this distribution. To get \(R(x|y)\), we only need to do a normalization for \(T(x,y)\).

In PWA, we usually use the MC to do the normalization for signal probability density. We need to calculate the integration of \(p'(y)\) as

\[\int p'(y) \mathrm{d} y = \int p(x) \epsilon_{T} (x) \int R_{T}(y|x) \mathrm{d} y \mathrm{d} x = \int p(x) \epsilon_{T} (x) \mathrm{d} x.\]

The final negative log-likelihood with considering resolution is

\[- \ln L = -\sum \ln \frac{p'(y)}{\int p'(y) \mathrm{d}y} = -\sum \ln \frac{\int p(x) R(x|y) \mathrm{d} x}{ \int p(x) \epsilon_{T} (x) \mathrm{d} x } - \sum \ln \epsilon_{R}(y).\]

The last part is a constant, we can ignore it in fit. In the numerical form, it can be written as

\[- \ln L = -\sum \ln \frac{1}{M}\sum_{x \sim R(x|y)} p(x) + N \ln \sum_{x \sim \epsilon_{T}(x)} p(x).\]

For the second part, which we already have MC sample with \(x \sim \epsilon_{T}(x)\), we can use MC sample to do the sum directly. For the first part, we can generate some \(x\) (\(M\) times) for every \(y\) (\(N\) events). Using the generated samples (\(MN\) events), we can calculate though the summation.

In addition, we can insert some importance information for the summation as

\[\int p(x) R(x|y) \mathrm{d} x \approx \frac{1}{\sum w_i} \sum_{x\sim \frac{R(x|y)}{w_i(x)}} w_i p(x).\]

We need to keep the normalization. For example, we can use Gauss-Hermite quadrature.

In a simple situation, we only use mass for the variable for resolution function. We can build the datasets by replacing the mass by random number based on the resolution function, keeping the same for other variables and using some constrains.

Once we get such datasets, we can use the likelihood method to fit the dataset with resolution. There is an example in checks.