Measurement and Bayesian Updating - The Bayesian Second Law of Ther- modynamics

The Bayesian Second Law of Ther- modynamics

3.2 Setup

3.2.2 Measurement and Bayesian Updating

The probability density on phase space can also change through Bayesian updates when a measurement is made: the experimenter modifies her probabilities to account for the new information. We will restrict ourselves to measurements performed at time τ, the end of the experiment, though it is simple to extend the results to more general measurement protocols. The measurement outcome is a random variableM that only depends on the state of the system at time τ, not on the prior history of the system.

The measurement is then characterized by the function

P(m|x⁰)≡P(M =m|X_τ =x⁰) (3.2.9)

= probability of measurement outcomem given state x⁰ at time τ . The updated phase space distribution at time τ is obtained by Bayes’s rule, which in this case takes the form

ρτ|m(x⁰)≡P(X_τ =x⁰|M =m) = P(m|x⁰)

P(m) ρ_τ(x⁰). (3.2.10) Here the denominator is P(m) ≡ ^R P(m|y⁰)ρτ(y⁰)dy⁰, and serves as a normalization factor.

If we know the transition function, we can also update the phase space distribution at any other time based on the measurement outcome at timeτ. Below we will make use of the updated initial distribution:

ρ_0|m(x)≡P(X₀ =x|M =m) = ρ₀(x)^R dx⁰ π_F(x→x⁰)P(m|x⁰)

P(m) . (3.2.11)

ρ

₀_|_m

(x) ρ

_τ_|_m

(x

) ρ

_τ

(x

)

ρ

₀

(x) ρ(x) ˜

˜ ρ

(x)

Measurement

Forward

experiment Reverse

experiment

(12) (11)

(15) (53)

(32) (5)

Figure 3.1. Relationships between the various distribution functions we define: the original distributionρ0(x), its time-evolved version ρτ(x⁰), their corresponding Bayesian-updated versionsρ_0|m(x) and ρ_τ|m(x⁰), and the cycled distributions ˜ρ(x) and ˜ρ_m(x) discussed in Sec- tions3.3.2and 3.4.3. Equation numbers refer to where the distributions are related to each other.

This reflects our best information about the initial state of the system given the outcome of the experiment; ρ_0|m(x) is the probability, given the original distribution ρ₀(x) and the measurement outcome m at time t = τ, that the system was in state x at time t= 0. For example, we may initially be ignorant about the value of an exactly conserved quantity. If we measure it at the end of the experiment then we know that it had to have the same value at the start; this could mean a big difference betweenρ₀ and ρ0|m, though often the effects will be more subtle. The various distribution functions we work with are summarized in Figure3.1 and listed in Table 3.1.

Finally, we can update the forward transition functions,

πF|m(x→x⁰)≡P(X_τ =x⁰|X₀ =x, M =m) = πF(x→x⁰)P(m|x⁰)

R dy⁰ π_F(x→y⁰)P(m|y⁰), (3.2.12) and the joint distributions,

P_F|m(x, x⁰)≡P(X₀ =x, X_τ =x⁰|M =m) = P(m|x⁰)

P(m) P_F(x, x⁰) = ρ0|m(x)π_F|m(x→x⁰), (3.2.13) based on the measurement outcome. As we would expect, the updated transition function evolves the updated distribution from the initial to the final time:

ρ_τ|m(x⁰) =

dx ρ_0|m(x)π_F_|m(x→x⁰). (3.2.14)

Distribution Name Definition

ρ₀(x) Initial Distribution 3.2.1

π_F(x→x⁰) Forward Transition Function 3.2.2

ρ_τ(x⁰) Final Distribution 3.2.4

PF(x, x⁰) Joint Forward Distribution 3.2.5

P(m|x) Measurement Function 3.2.9

ρτ|m(x⁰) Updated Final Distribution 3.2.10 ρ0|m(x) Updated Initial Distribution 3.2.11 πF|m(x→x⁰) Updated Forward Transition Function 3.2.12 P_F_|m(x, x⁰) Updated Joint Forward Distribution 3.2.13 π_R(x⁰ →x) Reverse Transition Function 3.2.17 PR(x, x⁰) Joint Reverse Distribution 3.2.18 P_R|m(x, x⁰) Updated Joint Reverse Distribution 3.2.18

ρ(x) Cycled Distribution 3.3.10

ρ_m(x) Updated Cycled Distribution 3.4.17

Table 3.1. List of named probability distributions and their defining equations. These are grouped according to whether they are updated and/or time-reversed.

It may seem odd to update the transition functions based on measurements, since in principle the original transition functions were completely determined by the stochastic dynamics of the system and this is a desirable property that one would like to preserve.

For this reason, the unupdated transition functions will play a special role below, while the updated ones are only used as an intermediate quantity in algebraic manipulations.

To illustrate these definitions, consider a simple toy model: a collection of N independent classical spins, each of which has a fixed probability to flip its state at each timestep. In this model it is most intuitive to work with a distribution function defined on macrostates (total number of up spins) rather than on microstates (ordered sequences of up/down spins).

The distribution functions relevant to our analysis are illustrated for this toy model with N = 100 spins in Fig. 3.2. To make the effects of evolution and updating most clear, we start with a bimodal initial distribution ρ₀(x), uniform on the intervals 0 ≤ x < 10 and 90 < x ≤ 100. The system is evolved for a short time τ, not long enough to attain the equilibrium distribution, which would be a binomial centered at x= N/2 = 50. The final distribution ρ_τ(x⁰) therefore has two small peaks just above and belowx⁰ = 50. We then perform a measurement, which simply asks whether most of the spins are up or down, obtaining the answer “mostly down.” This corresponds to

0 20 40 60 80 100 0.00

0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08

0.09 initial

final

measurement updated initial updated final

Figure 3.2. The various distribution functions illustrated within a toy model of 100 independent spins with a fixed chance of flipping at every timestep. The distributions are normalized functions on the space of the total numberx of up-spins. We consider an initial distribution (thick solid blue line) that is equally split between the intervals x < 10 and 90 < x. The system is evolved for enough time to come close to equilibrium but not quite reach it, as shown by the final distribution (thin solid red line). A measurement is performed, revealing that less than half of the spins are up (dot-dashed purple line). We can therefore update the post-measurement final distribution (dashed red line). The corresponding updated initial distribution (dotted blue line) is similar to the original initial distribution, but with a boost at lowx and a decrease at highx.

a measurement function

P(m|x) =







1 if x≤50,

0 if x >50. (3.2.15)

In Fig.3.2we have plotted the normalized versionP(m|x)/P(m). From this we can con- struct the updated final and updated initial distributions, using (3.2.10) and (3.2.11).

The updated final distribution is just the left half of the non-updated final distribution, suitably renormalized. The updated initial distribution is a re-weighted version of the non-updated initial distribution, indicating that there is a greater probability for the system to have started with very few up spins (which makes sense, since our final measurement found that the spins were mostly down). This toy model does not have especially intricate dynamics, but it suffices to show how our evolution-and-updating procedure works.

Dalam dokumen The Union of Quantum Field Theory and Non-equilibrium Thermodynamics (Halaman 62-66)