Health policy evaluations estimate the response of population aggregate outcomes to interventions. However, clarity on the form of the expected causal relationship, the parameter identification strategy, and the mode of hypothesis testing is required to overcome a number of conceptual and methodological problems. ⋯ We discuss the identification options and show the sensitivity of estimates of the response function to different specifications of the stochastic and intervention components and to different modes of inference. Model misspecification is demonstrated by rolling Chow tests for structural breaks in repeated observations.