Talk:Least absolute deviations
This article is rated Start-class on Wikipedia's content assessment scale. It is of interest to the following WikiProjects: | |||||||||||
|
This article links to one or more target anchors that no longer exist.
Please help fix the broken anchors. You can remove this template after fixing the problems. | Reporting errors |
Least absolute deviations
edit(This contrib copied from Wikipedia talk:WikiProject Statistics) Melcombe (talk) 09:40, 9 March 2010 (UTC)
The article about Least absolute deviations (LAD), in the section "Solving methods", omits a simple transformation that casts LAD problems as Linear Programs (LP), which can in turn be reliably and efficiently solved by general purpose LP packages (for the transformation, see p. 294 of Boyd and Vandenberghe's book "Convex Optimization", freely available at http://www.stanford.edu/~boyd/cvxbook/). Most people would be better served by doing this simple and intuitive transformation and then applying one of the many Linear Programming packages available instead of trying to code their own solution based on Barrodale and Roberts' paper.
I have never contributed to Wikipedia before, so I don't know what is the "adequate" way of doing this. I don't know if I have to ask permission from someone, so I decided to post here before and see if there was any feedback. I would be glad to write this myself.
Reference to quantile regression article?
editFrom what I understand, least absolute deviation is equivalent to a special case of quantile regression (with 0.5-th quantile). At the moment, there is no reference to the "Quantile Regression" article (nor there is a reference from the latter).
Moreover, I am surprised that the article does not mention that what you are actually modelling with LAD is the median (as opposed to the mean, as in least squares). Am I missing something?
Finally, while the reduction to linear programming is excellent (clean and simple), it is not clear, why is it so much different from the reduction to LP in the "Quantile Regression" article. 84.52.37.109 (talk) 20:47, 21 November 2011 (UTC)
It might be that some analytical communities are using the term LAD to refer to a model of the median. However, I would not presume that this is the dominant use of the term LAD, since there is much analysis in which LAD is just modelling to the mean, that is, taking the deviations from the mean. Kitpuppy (talk) 13:38, 13 July 2014 (UTC)
Before reading this discussion I had already put the relation to quantile regression into the section on "Variations, extensions, specializations". I hope that is o.k. Delius (talk) 15:58, 7 May 2021 (UTC)
Does any reliable source claim that LAD "ignores" outliers. See Gorard papers.
editBased on one reading of Gorard (author of hundreds of papers and over a dozen books), it seems hard to believe that the following is reliable, educated and correct:
quote: Least absolute deviations is robust in that it is resistant to outliers in the data. This may be helpful in studies where outliers may be safely and effectively ignored. If it is important to pay attention to any and all outliers, the method of least squares is a better choice.
This obviously implies that LAD strips out from its sum of residuals those that come from outliers. On what is the phrase "safely ignored" based? Who is saying that LAD ignores residuals from outliers? Reading Gorard's published avaialable academic papers, links below, it seems mistaken. In reading Gorard or any textbook, when OLS squares the errors, it is giving more weight to the observations that have greater errors (if the error is 3, the square is 09); that is, it is giving more weight to outliers.
Since LAD does not do that (it is called absolute deviation), it may be more robust, as the article rightly notes, but why? who would claiming that LAD is more robust because it ignores outliers? What reliable sources advocate removing the outliers in a process they are trying to call LAD? Gorard suggests LAD may be more robust because it gives outliers that same importance as all observations. OLS might be less robust not because it gives outliers equal importance, but because it gives outliers inflated weight.
http://www.econ.uiuc.edu/~roger/research/rq/QRJEP.pdf
http://www.leeds.ac.uk/educol/documents/00003759.htm Kitpuppy (talk) 13:46, 13 July 2014 (UTC)