I just need some help hence why I can't show any working, I just am not sure where to begin....
Perhaps taking a step backwards . . .
Suppose you have a MODEL for a process, but you can't predict the coefficients of the model. For instance, you believe
y should be proportional to A, but with a correction proportional to A^2, and another term depending on Z. Your model is
y=β1×A+β2×A2+β3×Z
So you set up an experiment to measure y for as wide a range of the parameters (A,Z) as you can manage. Note that A and Z are either controlled, or are found as part of the measurement. In either case they are known "perfectly" without any uncertainty. All of the uncertainty of each measurement is associated with
y, and is usually expressed as a standard deviation,
u.
1st measurement:
y1=β1A1+β2A12+β3Z1+u1
2nd
....................y2=β1A2+β2A22+β3Z2+u2
. . .
jth
......................yj=β1Aj+β2Aj2+β3Zj+uj
The next task is to find a set of coefficients
(β1 β2 β3) that make a "best fit" of the model to the data. One of the most frequently used procedures is "ordinary least squares," in which the sum of the squares of the differences between the measured and predicted values of
y is minimized. ["Ordinary" means the uncertainties
u are not propagated - my own preference is to do a "weighted" least squares with each datum weighted as
1/uj2.]
Let
.........S=j=1∑N(yj−y^)2=j=1∑N(yj−β1Aj−β2Aj2−β3Zj)2
then set
..∂βi∂S=0 for i=1 to k
which is now a set of
k equations in
k unknowns. The questions you are being asked have to do with the properties of this system of equations.
Does it help to start with a somewhat more explicit model?