\documentclass{article}
\usepackage[pdftex]{graphicx}
\usepackage{amsmath}
\usepackage{Sweave}
\addtolength{\textwidth}{1in}
\addtolength{\oddsidemargin}{-.5in}
\setlength{\evensidemargin}{\oddsidemargin}
\newcommand{\myfig}[1]{\resizebox{0.85\textwidth}{!}
                        {\includegraphics{#1.pdf}}}
\newcommand{\code}[1]{\texttt{#1}}
\newcommand{\sign}{{\rm sign}}

\SweaveOpts{keep.source=TRUE, width=7, height=5.2}
%\VignetteIndexEntry{Deming, Theil-Sen, and Passing-Bablock Regression}
%\VignetteDependes{deming}

\DefineVerbatimEnvironment{Sinput}{Verbatim} {xleftmargin=2em}
\DefineVerbatimEnvironment{Soutput}{Verbatim}{xleftmargin=2em}
\DefineVerbatimEnvironment{Scode}{Verbatim}{xleftmargin=2em}
\fvset{listparameters={\setlength{\topsep}{0pt}}}
\renewenvironment{Schunk}{\vspace{\topsep}}{\vspace{\topsep}}

\title{Total Least Squares: Deming, Theil-Sen, and Passing-Bablock Regression}
\author{Terry Therneau\\ Mayo Clinic}

\begin{document}
\maketitle 
<<f1, echo=FALSE>>=
library(deming)
options(continue="  ", width=60)
options(SweaveHooks=list(fig=function() par(mar=c(5.1, 4.1, .3, 1.1))))
@ 

\section{Introduction}
The methods in the \emph{deming} package are concerned with the
problem of comparing two assays, both of which are measured with
error.
Let $x_i$ and $y_i$ be the two measurements of a compound where
the true value of the quantity is $u_i$
and assume that both assays are linear.
\begin{align}
  x_i &= a + bu_i + \epsilon_i \label{basic1}\\
  y_i &= c + du_i + \delta_i \label{basic2} \\
\end{align}
where $\epsilon$ and $\delta$ are the errors.
We would like to find the calibration equation 
 $y = \alpha + \beta x$ that best maps between the two assays.

In this situation ordinary least squares applied to $x$ and $y$
is unsatisfactory since it is asymmetric.
The fitted lines for $y \sim x$ and $x \sim y$ are not the same,
and neither has an expected slope of 1 when $\beta=1$.

\begin{figure}[t]
<<fig=TRUE, echo=FALSE>>=
tdata <- data.frame(x=1:6, y=c(2.3,1.3, 4.1,3.5,6.3, 3))
lfit <- lm(y ~ x, tdata)      # y on x
dfit <- deming(y ~ x, tdata)  # Deming
lfit2 <- lm(x ~ y, tdata)     # x on y

with(tdata, plot(x, y, xlim=c(0,7), ylim=c(1,7)))
abline(lfit)
abline(-lfit2$coef[1]/lfit2$coef[2], 1/lfit2$coef[2], col=4, lty=2)
abline(dfit, col=2, lwd=2)
segments(tdata$x, tdata$y, tdata$x, predict(lfit), col=1, lty=1)  
segments(tdata$x, tdata$y, predict(lfit2), tdata$y, col=4, lty=2)
@ 
  \caption{Example of linear and deming regression applied to
    a simple data set.  The ordinary linear regression of y on x (black)
    minimizes the sum of squared vertical distances.
    The regression of x on y (blue, dashed) minimizes a sum of squared 
    horizontal distances.  The Deming regression (red) minimizes the
    sum of orthagonal distances between the points and the line.}
    \label{f1}
\end{figure}

Least squares regression of $y$ on $x$ assumes that the $x$ variate
is measured without error, and minimizes the sum of squared vertical
distance between the data points $y$ and the fitted regression line.
Regression of $x$ on $y$ minimizes the horizontal distances.
Adcock \cite{Adcock78} in 1878 suggested minimizing the sum of squared
horizontal + vertical distances to the predicted values.
However the idea of Adcock remained largely unnoticed for 
more than 50 years, until it
was widely propogated in the book by Deming \cite{Deming43}.
The latter has become so well known that a common label for the
method is ``Deming regression'' in many fields.
Figure \ref{f1} shows a typical case.

An almost entirely separate discussion of the same issue is found under
the label ``total least squares'' (TLS), which is where one will find most
of the modern literature on this topic.   
Markovsky and Van Huffel \cite{Markovsky07} present a good overview of the
area, which has a rich literature of algorithms and extensions.
They catalog multiple discoveries of the approach across different
fields.  (Interestingly, Deming is not listed in their bibliography.)

The code for figure \ref{f1}.

<<f1b, echo=TRUE, fig=TRUE, include=FALSE>>= 
tdata <- data.frame(x=1:6, y=c(2.3,1.3, 4.1,3.5,6.3, 3))
lfit <- lm(y ~ x, tdata)      # y on x
dfit <- deming(y ~ x, tdata)  # Deming
lfit2 <- lm(x ~ y, tdata)     # x on y

with(tdata, plot(x, y, xlim=c(0,7), ylim=c(1,7)))
abline(lfit)
abline(-lfit2$coef[1]/lfit2$coef[2], 1/lfit2$coef[2], col=4, lty=2)
abline(dfit, col=2, lwd=2)
segments(tdata$x, tdata$y, tdata$x, predict(lfit), col=1, lty=1)  
segments(tdata$x, tdata$y, predict(lfit2), tdata$y, col=4, lty=2)
@ 

\section{Generalized Deming regression}

There are a number of alternate ways to compute the Deming regression
line. The Deming line will be the first
principle component of the centered data, the first eignevector of the 
matrix $Z$ whose 2 columns are the centered $x$ and $y$ vectors, or
the first component of a singular value decomposition or factor analysis of $Z$.
A partial least squares (PLS) or structural equation modeling (SEM)  model 
fit to $x$ and $y$ will also recover the Deming estimate of slope.

In the TLS literature both $X$ and $Y$ can be matrices, and the most
straightforward approach is to obtain the singular value decompostition
$$
   (X, Y) = UDV'
$$ 
where $D$ is the diagonal matrix of singular values. 
Assume $X$ is of dimension $n$ by $p$ and partion $V$ as 
\begin{equation}
  V = \left[ \begin{array}{cc} V_{11} V_{12}\\ V_{21} V_{22} \end{array} \right]
\end{equation}
Then if $V_{22}$ is non-singular a solution exists with 
$\hat\beta = -V_{12}V_{xx}^{-1}$.
A solution will not exist when a coefficient is infinite, i.e.,  if the best 
fitting line is vertical in one of the $p$ dimensions.
The solution will be unique if $d_p > d_{p+1}$.  
The counterexample to uniqueness is when the data lies on a circle, then the
variance explained by any regression line will be the same as
any other, and $d_p = d_{p+1}$.

There would appear to be little need for yet another program to compute
this quantity other than providing a recognizable name to search for in the
R libraries.
For laboratory work, however, it is the generalized Deming method that is of
most interest.  
Returning to our original definitions \eqref{basic1} and \eqref{basic2},
ordinary Deming regression is based on the assumtion that that the assay errors
$\epsilon$ and $\delta$ are equal in magnitude for the two assays
and are constant across the range of $u$.
This latter is rarely if ever true for biologic assays, and both will
normally be false in more general applications.

\begin{figure}
  \myfig{deming-f2}
  \caption{Bland-Altman plot of the ferritin data.}
  \label{ferritinb}
\end{figure}

Figure \ref{ferritinb} shows a Bland-Altman plot of paired assay results
from long-term monitoring of a ferritin assay.
Each time that a new lot of the principle reagent was brought into use, 
a subset of currently available
samples were assayed in duplicate using both the old and new lot.
If the assumptions of standard Deming regression hold we would expect to
see approximately constant vertical variation across the range of the
horizontal axis of the plot.  This is clearly not the case.
(The x-axis was plotted on a square root scale to spread out the data somewhat,
but this does not change the message.)
<<f2, echo=TRUE, fig=TRUE, include=FALSE>>=
f.ave <- with(ferritin, (old.lot + new.lot)/2)
f.diff<- with(ferritin, old.lot - new.lot)
plot(sqrt(f.ave), f.diff, xaxt='n', 
     xlab="Average", ylab="Difference")
temp <- 0:7*5
axis(1, temp, temp^2)
@ 

\begin{figure}
  \myfig{deming-f3}
  \caption{Revised variance plot for the ferritin data on a coefficient
    of variation scale.}
  \label{ferritinb2}
\end{figure}
Figure \ref{ferritinb2} shows a revised plot with average of the two assay
values on the horizontal and abs(difference/mean) along the vertical axis,
along with a lowess line.
A horizontal trend in this plot corresponds to constant coefficient 
of variation, which for this data set appears to be a reasonable
assumption.
<<f3, fig=TRUE, include=FALSE>>=
plot(f.ave, abs(f.diff/f.ave), log='x', 
     xlab="Average", ylab="Estimated CV")
lines(lowess(f.ave, abs(f.diff/f.ave)), col=2)
@ 

Linnet \cite{Linnet90} discusses fitting regression lines in the situation
of constant coefficient of variation, and gives a more complete rationale.
We use an algorithm based on Ripley and Thompson \cite{Ripley87} which 
includes both ordinary Deming regression and Linnet's extension
within a more general framework.
Referring again to equations \eqref{basic1} and \eqref{basic2},
assume that $x$ and $y$ both estimate the common unknown quantity
$u$, and the error terms have standard deviations
\begin{align}
  {\rm sd}(x) &= \sigma[e + fu] \\
  {\rm sd}(y) &= \sigma[g + hu]
\end{align}
for known constants $e$, $f$, $g$, and $h$ and an unknown scale factor
$\sigma$, where $u$ is again the true value. 
A value of $(e,f,g,h)= (1,0,1,0)$ corresponds to standard Deming regression,
and $(e,f,g,h)= (0,1,0,1)$ corresponds to the constant proportional
errors assumption of Linnet.
The \code{cv} argument of the \code{deming} function chooses between these
two cases, or all four constants can be supplied using the
\code{stdpat} argument.
A second alternative is for the user to directly supply 
values for ${\rm sd}(x)$ or ${\rm sd}(y)$ for each data point
using the \code{xstd} and \code{ystd} arguments.
The following produces the 7 calibration equations for each of the 7 reagent
changes in the ferritin data set.
<<>>=
cmat <- matrix(0, nrow=3, ncol=7)
for (i in 1:7) {
    dfit <- deming(new.lot ~ old.lot, data=ferritin, 
                   subset=(period==i), cv=TRUE)
    cmat[1:2,i] <- coef(dfit)
    cmat[3,i] <- coef(lm(new.lot ~ old.lot, ferritin, 
                         subset= (period==i), weight=1/new.lot))[2]
}
dimnames(cmat) <- list(c("Intercept", "old.lot", "old.lot (LS)") , 1:7)
round(cmat,3)
@ 
For unweighted regression the Deming slope is always larger than the
least squares line but in the constant CV case it can go either way.
The difference in regression slopes for any given batch is small, but
corrections to the clinical assay must be cumulative over time.
From the first regression equation, results from assays after the first lot
change need to be modified with 
\begin{center}
  corrected result = 
\Sexpr{round(cmat[1,1],3)} + \Sexpr{round(cmat[2,1], 3)} * value
\end{center}
in order to have them match prior reports.
Matching is important since a given patient may be followed sequentially over
many years.
The second assay change compounds this
\begin{center}
   corrected result = \Sexpr{round(cmat[1,1],3)} + \Sexpr{round(cmat[2,1], 3)}*
     ( \Sexpr{round(cmat[1,2],3)} +  \Sexpr{round(cmat[2,2], 3)} *value)
   \end{center}
The cumulative effect under the Deming fits has a slope coefficient of
\Sexpr{round(prod(cmat[2,]), 3)}, the product of the 7 slopes,
an estimated loss in potency of 21\%.
  
When the data has both a wide range and results near zero, 
it will often be necessary for the error to include both a constant and
a proportional portion.  
The arsenate data set contains results of two different methods for 
assessment of arsenate(V) in river waters;
the resultant estimates range from 0 to 19.25 $\mu g/l$.
Constant proportional error (constant CV) is clearly untenable, since it
would predict infinite precision for the smallest values.
This data set contains estimates of the precision of each point, which
we can use to obtain an appropriate fit.
<<>>=
afit <- deming(aas ~ aes, arsenate, xstd=se.aes, ystd=se.aas)
afit

dfit <- deming(aas ~ aes, arsenate)
lfit <- lm(aas ~ aes, arsenate)
temp <- cbind(coef(afit), coef(dfit), coef(lfit))
dimnames(temp)[[2]] <- c("weighted Deming", "unweighted Deming", "Linear")
round(temp,3)
@ 
For values less than .3 (about 10\% of the data) 
the constant part of the error is
predominant while for those above 2 the proportional part dominates.
Calibration fits that do or do not properly account for the error
differ by important amounts.


\section{Theil-Sen Regression}
One interesting way to characterize the slope of least squares regression line
is that it is the solution of $\rho(x,r(\beta)) =0$,
where $\rho$ is the Pearson correlation coefficient and
$r(\beta)$ are the residuals from a fitted line with slope $\beta$.
A non-parametric counterpoint to this is Thiel-Sen regression, which satisfies
$\tau(x, r(\beta)) =0$ where $\tau$ is Kendall's tau, 
a rank based alternative to the correlation coefficient.
This was proposed by Theil \cite{Theil50};
Sen \cite{Sen68} extended the results and added a confidence interval estimate.
The approach is well known in selected fields (e.g. astronomy), 
and almost completely unknown in others.
It has strong resistance to outliers and nearly full efficiency compared to
linear regression when the errors are Gaussian.

The standard way to calculate TS regression is to first
draw a line segment between each of the $n(n-1)/2$ unique pairs of points in
the data; the TS slope estimate is the median of these $n(n-1)/2$ slope values.
Once the slope is established the intercept is chosen so that the median 
residual is zero.
\begin{figure}
  \myfig{deming-f4}
  \caption{The geometry underlying the Thiel-Sen estimator.
    The set of values $x_i-x_j$ is plotted versus $y_i-y_j$ for all
    $i \ne j$ along with a reference line $x=0$.  
    The red line divides the points into four equal groups, and is the Thiel-Sen 
    estimate of slope.}
  \label{f4}
\end{figure}
<<f4, echo=FALSE,fig=TRUE, include=FALSE>>=
tdata <- data.frame(x=1:8, y=c(2,1.2, 4.1,3.5,6.3, 3, 7,6))
xx <- with(tdata, outer(x, x, "-"))
yy <- with(tdata, outer(y, y, "-"))
xx <- c(xx[row(xx) != col(xx)]) #don't compare a point with itself
yy <- c(yy[row(yy) != col(yy)])
plot(xx, yy,
     xlab="Paired x difference", ylab="Paired y difference")
abline(v=0)
fit <- theilsen(y ~ x, tdata)
abline(0, coef(fit)[2], col=2)
text(c(2 ,4, -2, -3), c(5, -4, -5, 4), c("I", "II", "III", "IV"))
@ 

Figure \ref{f4} shows a plot of $x_i - x_j$ vs $y_i-y_j$ for all 
$8*7=56$ data pairs from a small set of 8 data points.
A line from the origin to each point has identical angle to a line
connecting that pair of points in a plot of the 8 original $(x,y)$ pairs.
Each pair of points $i,j$ appears twice in the paired plot, 
corresponding once to
$y_i - y_j$ and a second time using $y_j - y_i$. 
The Thiel-Sen estimate of slope is that line through the origin such that
quadrants 1--4 of the plot, formed by this line and the vertical axis,
each have the same number of points.
The solution is simply \code{median(atan(dy/dx))} where \code{dy} and
\code{dx} are the paired $y$ and $x$ differences, respectively.
Since $(y_i-y_j)/(x_i-x_j) = (y_j-y_i)/(x_j-x_i)$ the 
computer program only uses the
$n(n-1)/2$ unique values, removing any which lie exactly on the vertical
axis since they would count equally in two quadrants and thus cancel.
Theil-Sen regression of $x$ on $y$ would use the horizontal axis, rather than
vertical, as the second reference line for forming quadrants.

\section{Passing-Bablock Regression}
The Thiel-Sen slope, like ordinarly least squares, is biased towards zero
if there is error in both $x$ and $y$, nor is it
symmetric in $x$ and $y$.
Passing and Bablock proposed variations on the Thiel-Sen estimate to
address these concerns.
Their method is well known in the field of laboratory
testing but almost unheard of outside of that domain.
There are actually 3 estimators, proposed in a series of papers
in 1983, 1984, and 1988.

\begin{figure}
  \myfig{deming-f5}
  \caption{The geometry underlying the Passing-Bablock estimate..
    The set of values $x_i-x_j$ is plotted versus $y_i-y_j$ for all
    $i \ne j$.  A reference line with slope -1 (black) and the estimated
    PB slope coefficent (red) divide the points into 4 equal groups.}
  \label{f5}
\end{figure}
<<f5, echo=FALSE,fig=TRUE, include=FALSE>>=
tdata <- data.frame(x=1:8, y=c(2,1.2, 4.1,3.5,6.3, 3, 7,6))
xx <- with(tdata, outer(x, x, "-"))
yy <- with(tdata, outer(y, y, "-"))
xx <- c(xx[row(xx) != col(xx)]) #don't compare a point with itself
yy <- c(yy[row(yy) != col(yy)])
plot(xx, yy,
     xlab="Paired x difference", ylab="Paired y difference")
abline(0, -1)
fit <- pbreg(y ~ x, tdata)
abline(0, coef(fit)[2], col=2)
text(c(0 ,5, 0, -6), c(5, -1, -5, 2), c("I", "II", "III", "IV"))
@ 

The first Passing-Bablock method (PB1) is described in their 1983 paper
\cite{Passing83}. It modifies the Thiel-Sen estimate so as
to make the procedure symmetric about the line $y=x$
instead of about the horizontal axis.
Where a Thiel-Sen regression of $y$ on $x$ uses the regression line plus the
vertical axis to partition the points, and TS regression of $x$ on $y$ would
use the regression line plus the horizontal axis,
the Passing-Bablock line chooses that regression line such that it and
the $y= -x$ line separate the data points into 4 equal portions.
This is illustrated in figure \ref{f5}.
Computationally, it suffices to modify the $\arctan$ function 
so as to return angles in the range
of $(-\pi/4, 3\pi/4)$ instead of the default of $(-\pi/2, \pi/2)$.  The kernel
of the R code is three lines:
\begin{verbatim}
 theta <- atan(dy/dx)
 theta <- ifelse(theta < -pi/4, theta+pi, theta)
 slope <- median(theta)
\end{verbatim}
where \code{dy} and \code{dx} are the paired differences in $x$ and $y$.
Points where the angle is exactly $-\pi/4$ would count equally in both
quadrants so can be ignored.
(Since both $x$ and $y$ are measured with error such values should be rare
in real data.)
As with the Theil-Sen estimate, the underlying R routine only evaluates and
uses the $n(n-1)/2$ unique pairs.

For a two-sided confidence interval Passing and Bablock
use an identical formula to
that derived for Thiel-Sen regression, namely the $k$th
angles above and below the median value where
\begin{align*}
  k &= (z_{\alpha/2}/2) \sqrt{V_n}/2 \\
  V_n&= (1/18)[n (n-1)(2n+5)/18 
\end{align*}
In the second paper of their series \cite{Passing84} they show
that this method has excellent power, nearly as good as
Deming regression when the data has Gaussian errors,
while gaining resistance to outliers.


A second approach to the Passing-Bablock estimate, and one that
is more informative with respect to extending the method,
is based on another property of Deming regression:
the slope of the Deming regression line is that rotation of
the original data such that a least-squares regression on 
the rotated data has a slope of zero.
A symmetric Thiel-Sen (STS) estimate can then be defined as that rotation
of the original data set such that the Thiel-Sen estimate of slope is zero.
A simple iterative algorithm to compute this is to
compute the TS estimate, rotate the original data by the resulting
angle, and continue refitting and further rotation until convergence.
Geometrically, the STS estimate corresponds to a pair of orthagonal lines
that partition the points of figure 4 equally.
 
The PB1 estimate can be viewed as a one step approximation to the STS estimate
above: start with a clockwise rotation of $\pi/4$ (45 degrees) 
and then do a singe iteration of refinement. 
Since it is based on a single Thiel-Sen regression the theoretical
justification for the Thiel-Sen confidence
interval formula translates directly to the Passing-Bablock estimate.
For all of the data sets considered thus far the STS algorithm converges in
2 or 3 iterations, and the PB1 estimator reaches the same or very nearly the
same value as the fully iterated estimate.

Neither the Deming, STS, nor PB1 estimates of slope are scale invariant.
Starting with a data set whose slope estimate is $\hat\beta$, multiplication
of all the $y$ values by some constant $k$ does not necessarily lead to
an estimated slope of $k \hat\beta$.
In the third paper of their series \cite{Passing88}
two further estimators PB2 and PB3
are proposed which are scale invariant while still retaining
symmetry in $x$ and $y$. 
For the PB2 estimate, first find a value $m$ which is the median of the angles
in the lower right portion of figure \ref{f5}, i.e. points with
$dy <0$ and $dx >0$.  
The estimated regression line is defined such that it and a line of angle $m$ 
partition the points equally.
It can also be viewed as a 1 step STS estimator using $\pi/2 + m$ as the
initial clockwise rotatation.

The PB3 estimate is defined by two lines.
Referring again to figure \ref{f4} or \ref{f5}, 
a pair of lines at angles $\theta$ and $-\theta$ are
opened and shut like a pair of scissors about the $x$-axis until they
evenly partition the data points, then $\theta$ taken as the estimated slope.
Passing and Bablock describe an iterative estimation procedure, however it
is easy to see that \code{median(abs(theta))} provides a direct solution.
The PB3 estimator is not a simple one-step approximation to the symmetric
Thiel-Sen (STS) estimate.

\section{Other notes}
Unlike the other estimates found in the package the STS estimate can
have multiple zeros.
For a data set like the arsenate study, where the overall data clusters
tightly around a line, multiple solutions are uncommon, and when they
occur normally form a small tight cluster of values.  
The other extreme is a set of points evenly distributed in cirle about
the origin, for which there will be $n$ solutions.
When multiple solutions occur the program returns the value of that one
having the smallest MAD of the residuals.
The output structure includes an additional component \code{angle} 
containing the full set of solutions.

For the PB2, PB3 and STS methods it is not at all certain
that the Sen estimator of
confidence limits is valid.  
Since they are are iterative the
1 to 1 mapping between the slope and
Kendall's tau which forms the basis for Sen's argument no longer holds.
Secondly,
extending the Sen variance formula to data with case weights is far from clear. 
The \code{pbreg} and \code{theilsen} routines therefore
also include an option for bootstrap confidence
intervals, and we recommend using it whenever there are case weights or
for the STS, PB2, and PB3 estimators.
Due to the excessive number of ties that would be generated by ordinary
bootstrap sampling the wild bootstrap method \cite{Wu86} is used.

\section{Which method is best?}

\begin{figure}
  \myfig{deming-f7}
  \caption{Ferritin data with outliers, along with OLS (dashed), 
    Deming (dotted), and Passing-Bablock (solid) regression lines.}
  \label{f7}
\end{figure}

The two primary advantages of the robust methods in laboratory studies
are that they give a robust estimate of the slope in
the case of outliers and are less sensitive to choosing the correct
variance specification.
Figure \ref{f7} shows the result on a data set with outliers: one of the
two laboratory methods has had 3 assay failures.
The PB regression line tracks the main body of the data, while the other
two lines are pulled away.
<<f7, fig=TRUE, include=FALSE>>=
plot(new.lot ~ old.lot, data=ferritin2, subset=(period==2),
     xlab="Old lot", ylab="New lot")
dfit <- deming(new.lot ~ old.lot, ferritin2, subset=(period==2),
               cv=TRUE)

lfit <- lm(new.lot ~ old.lot, ferritin2, subset=(period==2))
pfit <- pbreg(new.lot ~ old.lot, ferritin2, subset=(period==2))
abline(pfit, col=1)
abline(lfit, lty=2)
abline(dfit, lty=3)
@ 

A discussion by St{\o}ckl, Dewitte, and Thienpont
provides a useful counterpoint.  Essentially, if the data is
good, all the methods will agree on that fact.  
If there are assay issues, outliers in particular, 
then the actual source of the problem needs to be 
investigated rather than just using a ``better'' regression tool.
Understanding data requires more than pushing a button.

They argue further, and I think incorrectly, that ordinary least
squares can suffice.  The ferritin data is a counter-example.  
In order to provide long term calibration of the assay for the 
purposes of patient care, the calibration corrections used by the lab
will be the cumulative product of the regression slopes.
If OLS were used at each stage the downward bias, even if it is small for
each given reagent change,
would accumulate over time.


\begin{thebibliography}{9}
  \bibitem{Adcock78} Adcock, R. J. (1878). A problem in least squares. 
    The Analyst (Annals of Mathematics) 5:53--54.
    
  \bibitem{Deming43}  Deming, W. E. (1943). Statistical adjustment of data. 
    Wiley, NY.
    
  \bibitem{Linnet90} Linnet, K. (1990), 
    Estimation of the linear relationship between the
    measurements of two methods with proportional errors.  
    Statistics in Medicine 9:1463-1473.
    
  \bibitem{Markovsky07} I. Markovsky and S. Van Huffel (2007).
    Overview of total least squares methods. 
    Signal Processing, 87:2283--2302.  
  
  \bibitem{Passing83} Passing, H. and Bablock, W. (1983).
    A new biometrical procedure for testing the equality of measurements 
    from two different analytical methods. Application of linear 
    regression procedures for method comparison studies in Clinical Chemistry, 
    Part I. J. Clin. Chem. Clin. Biochem. 21:709--720.

  \bibitem{Passing84} Passing, H.  and Bablock, W. (1984).
    Comparison of several regression procedures for method comparison
    studies and determination of sample size. Application of linear 
    regression procedures for method comparison studies in Clinical Chemistry, 
    Part II. J. Clin. Chem. Clin. Biochem. 22:431--435.

  \bibitem{Passing88} Bablock, W., Passing, H., Bender, R. and 
    Schneider, B. (1988).
    A general regression procedure for method transformations.
    Application of linear 
    regression procedures for method comparison studies in Clinical Chemistry, 
    Part III. J. Clin. Chem. Clin. Biochem. 26:783--790.

  \bibitem{Ripley87} Ripley, B. D. and Thompson, M. (1987), 
    Regression techniques for the detection
    of analytical bias, Analyst 112:377-383.
    
  \bibitem{Sen68} Sen, P.B. (1968), 
    Estimates of the regression coefficient based on Kendall's tau,
    Journal of the American Statistical Association 63: 1379-1389. 

  \bibitem{Stockl98} St{\o}ckl D., Dewitte K., Thienpont L. M. (1998).
    Validity of linear regression in method comparison studies: 
    is it limited by the statistical model or the quality of the 
    analytical input data? Clin Chem. 44:2340-6.

  \bibitem{Theil50} Theil, H. (1950), 
    A rank-invariant method of linear and polynomial regression analysis. 
    I, II, III, Nederl. Akad. Wetensch., Proc. 53: 386-392, 521-525, 1397-1412.
  
  \bibitem{Wu86} Wu, C.F.J. (1986). Jackknife, bootstrap and other 
    resampling methods in regression analysis (with discussion). 
    Annals of Statistics, 14, 1261-1350.
\end{thebibliography}
\end{document}