source: sasview/src/sas/sasgui/perspectives/fitting/media/residuals_help.rst @ 84ac3f1

magnetic_scattrelease-4.2.2ticket-1009ticket-1094-headlessticket-1242-2d-resolutionticket-1243ticket-1249unittest-saveload
Last change on this file since 84ac3f1 was 84ac3f1, checked in by butler, 6 years ago

More Edits

Clean up new documentation as well as fix older incorrect documention on
simultaneous/constrained fitting.

  • Property mode set to 100644
File size: 3.6 KB
RevLine 
[ad2ce4e]1.. residuals_help.rst
2
3
4.. ZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZ
5
6.. _Assessing_Fit_Quality:
7
8Assessing Fit Quality
9---------------------
10
11When performing model-fits to some experimental data it is helpful to be able to
12gauge how good an individual fit is, how it compares to a fit of the *same model*
13*to another set of data*, or how it compares to a fit of a *different model to the*
14*same data*.
15
16One way is obviously to just inspect the graph of the experimental data and to
17see how closely (or not!) the 'theory' calculation matches it. But *SasView*
18also provides two other measures of the quality of a fit:
19
[5ed76f8]20*  $\chi^2$ (or 'Chi2'; pronounced 'chi-squared')
[ad2ce4e]21*  *Residuals*
22
23.. ZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZ
24
25Chi2
26^^^^
27
[99ded31]28$\chi^2$ is a statistical parameter that quantifies the differences between
[a7c6f38]29an observed data set and an expected dataset (or 'theory') calculated as
[ad2ce4e]30
[5ed76f8]31.. math::
[ad2ce4e]32
[a7c6f38]33  \chi^2
34  =  \sum[(Y_i - \mathrm{theory}_i)^2 / \mathrm{error}_i^2]
[5ed76f8]35
[84ac3f1]36Fitting typically minimizes the value of $\chi^2$.  For assessing the quality of
37the model and its "fit" however, *SasView* displays the traditional reduced
38$\chi^2_R$ which normalizes this parameter by dividing it by the number of
39degrees of freedom (or DOF). The DOF is the number of data points being
40considered, $N_\mathrm{pts}$, reduced by the number of free (i.e. fitted)
41parameters, $N_\mathrm{par}$. Note that model parameters that are kept fixed do
42*not* contribute to the DOF (they are not "free"). This reduced value is then
43given as
[ad2ce4e]44
[99ded31]45.. math::
46
47  \chi^2_R
48  =  \sum[(Y_i - \mathrm{theory}_i)^2 / \mathrm{error}_i^2]
49  / [N_\mathrm{pts} - N_\mathrm{par}]
50
[84ac3f1]51Note that this means the displayed value will vary depending on the number of
52parameters used in the fit. In particular, when doing a calculation without a
53fit (e.g. manually changing a parameter) the DOF will now equal $N_\mathrm{pts}$
54and the $\chi^2_R$ will be the smallest possible for that combination of model,
55data set, and set of parameter values.
[a7c6f38]56
57When $N_\mathrm{pts} \gg N_\mathrm{par}$ as it should for proper fitting, the
58value of the reduced $\chi^2_R$ will not change very much.
[ad2ce4e]59
[99ded31]60For a good fit, $\chi^2_R$ tends to 1.
61
62$\chi^2_R$ is sometimes referred to as the 'goodness-of-fit' parameter.
[ad2ce4e]63
64.. ZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZ
65
66Residuals
67^^^^^^^^^
68
69A residual is the difference between an observed value and an estimate of that
[99ded31]70value, such as a 'theory' calculation (whereas the difference between an
71observed value and its *true* value is its error).
[ad2ce4e]72
[5ed76f8]73*SasView* calculates 'normalized residuals', $R_i$, for each data point in the
[ad2ce4e]74fit:
75
[5ed76f8]76.. math::
77
[99ded31]78  R_i = (Y_i - \mathrm{theory}_i) / \mathrm{error}_i
79
80Think of each normalized residual as the number of standard deviations
81between the measured value and the theory.  For a good fit, 68% of $R_i$
82will be within one standard deviation, which will show up in the Residuals
83plot as $R_i$ values between $-1$ and $+1$.  Almost all the values should
84be between $-3$ and $+3$.
[ad2ce4e]85
[99ded31]86Residuals values larger than $\pm 3$ indicate that the model
87is not fit correctly, the wrong model was chosen (e.g., because there is
88more than one phase in your system), or there are problems in
89the data reduction.  Since the goodness of fit is calculated from the
90sum-squared residuals, these extreme values will drive the choice of fit
91parameters.  Any uncertainties calculated for the fitting parameters will
92be meaningless.
[ad2ce4e]93
94.. ZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZZ
95
[99ded31]96*Document History*
97
98| 2015-06-08 Steve King
[84ac3f1]99| 2017-09-28 Paul Kienzle
100| 2018-03-04 Paul Butler
Note: See TracBrowser for help on using the repository browser.