by jpfeifer » Thu Aug 21, 2014 4:55 pm
They are not set, but deduced. The Kalman smoother works backwards in time and finally ends up estimating the best guess for x_{0|T}, that is the best estimate for states at time 0 given all the observed data. In general, the estimate will be different from the steady state, because the data did not start in steady state.
Consider a two-period model where you have a mean 0 AR(1) process with autocorrelation 0.95 and a standard normal shock. Assume you observe a value of 3 in the second period and you do not know the initial value. What is more likely? That a three-sigma shock happened and the system was at steady state initially? Or that a small shock happened and the system was already above 0 when time started (essentially meaning there were positive shocks before time started and due to the persistence those effects can still be seen today)? The Kalman smoother for its class of problems optimally trades off these two considerations. If persistence is not high, the deviation from steady state due to the initial state will die out quickly.