Talk:Likelihood-ratio test

Learn more about this page

This is the talk page for discussing improvements to the Likelihood-ratio test article.
This is not a forum for general discussion of the article's subject.

Put new text under old text. Click here to start a new topic.
New to Wikipedia? Welcome! Learn to edit; get help.

Article policies

Find sources: Google (books · news · scholar · free images · WP refs) · FENS · JSTOR · TWL

Statistics High‑importance

	This article is within the scope of WikiProject Statistics, a collaborative effort to improve the coverage of statistics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.StatisticsWikipedia:WikiProject StatisticsTemplate:WikiProject StatisticsStatistics articles
High	This article has been rated as High-importance on the importance scale.

Mathematics High‑priority

	Mathematics portal This article is within the scope of WikiProject Mathematics, a collaborative effort to improve the coverage of mathematics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.MathematicsWikipedia:WikiProject MathematicsTemplate:WikiProject Mathematicsmathematics articles
High	This article has been rated as High-priority on the project's priority scale.

Economics Mid‑importance

	Business and economics portal This article is within the scope of WikiProject Economics, a collaborative effort to improve the coverage of Economics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.EconomicsWikipedia:WikiProject EconomicsTemplate:WikiProject EconomicsEconomics articles
Mid	This article has been rated as Mid-importance on the project's importance scale.

Added General Audience Introduction and Created Examples Contents

Latest comment: 14 years ago4 comments4 people in discussion

The instructions for creating less technical articles suggest starting with a simplier explanation upfront and then get into the technical details later. With a table of contents, the instructions indicate that provides for having something accessible for those that haven't extensively studied this topic; while at the same time, leaving a meaty article for those interested in something more sophisticated. I dont know if I pulled it off perfectly, but I think it improves the article in a way in which who ever put the "to technical" banner would approve.

At the same time I moved the example into its own contents tab to seperate it from the theory portion.

Jeremiahrounds 18:59, 20 June 2007 (UTC)Reply

Would it be possible for any one to add a proof of why the test follows a chi squared distribution ? —Preceding unsigned comment added by Thedreamshaper (talk • contribs) 20:52, 17 February 2010 (UTC)Reply

I think the introduction might benifit from a re-write, perhaps this formula would be more appropriate than the asymptotic version: $\Lambda (x)={\frac {\sup\{\,L(\theta \mid x):\theta \in \Theta _{0}\,\}}{\sup\{\,L(\theta \mid x):\theta \in \Theta \,\}}}.$ --131.111.243.37 (talk) 10:18, 25 May 2010 (UTC)Reply

I added a non-technical description about when these tests arise in practice to the first paragraph. Not an expert, but using this page without something like that was not helpful. —Preceding unsigned comment added by 98.143.103.218 (talk) 04:07, 29 September 2010 (UTC)Reply

difficult take on the likelihood viewpoint?

Latest comment: 17 years ago3 comments3 people in discussion

I believe this essentially obscures the idea here:

\Lambda (x)={\frac {\sup\{\,L(\theta \mid x):\theta \in \Theta _{0}\,\}}{\sup\{\,L(\theta \mid x):\theta \in \Theta \,\}}}.

The likelihood ratio test is the ratio of the probability of the result GIVEN the maximum likelihood estimator in the domain of the null and alternative hypothesis.

The supremums in that equation sort of combine the maximum likelihood method into the theory of likelihood ratios.

I am not making this up. For example, the text Hoel, Introduction of Statistical Theory uses L(x| theta0) / L(x | theta) where each theta is the maximum likelihood estimate applicable to each hypothesis.

You can more simply state it as Hoel does and just note that the thetas are produced by maximum likelihood estimates. So the supremum doesnt need to appear in the theory of likelihood ratios. Then you get a ratio of probabilities that is easier to read and even think about.

I actually initially called the offered equation an error. But that is a bridge to far I think. Putting the supremums in the context where you appear to be maximizing something after the data is taken isnt very useful for understanding the actual method though.

Jeremiahrounds 12:11, 20 June 2007 (UTC)Reply

I don't think there is any maximum involved in the Likelihood-ratio test, you just have to make the ratio of the likelihood under hypothesis H0 and H1. I'm not an expert in statistics but I think this equation introduces a confusion between Likelihood-ratio test and maximum likelihood estimation. I have never seen it presented this way anyway... Sylenius 14:45, 27 June 2007 (UTC)Reply

I think Jeremiahrounds is mistaken. In case the MLEs actually exist, the likelihood-ratio test statistic is in fact equal to what Hoel's book says it is, and also it is equal to the expression in TeX above, which appears in this article. But the likelihood-ratio test statistic can exist even in cases where MLEs don't exist, simply because the sup exists and the max does not, i.e. the sup is not actually attained. Moreover, the problem of non-unique MLEs doesn't matter, since it is only the value of the sup rather than the value of θ where the sup occurs that matters. Michael Hardy 19:05, 27 June 2007 (UTC)Reply

Untitled

Latest comment: 16 years ago2 comments2 people in discussion

Can someone please replace the awful ascii-art in this article with TeX, please?

I may get to that if someone doesn't beat me to it. Hundreds of articles here are in need of TeX to replace what was used here before 2003. Michael Hardy 22:57 Feb 2, 2003 (UTC)

The article uses λ in some places, and Λ in others -- is this intentional, or should they all be one or the other?

This article needs thorough checking and copyediting.

(Capital) Λ is the most frequently used notation for the test statistic. Michael Hardy 20:12 Feb 4, 2003 (UTC)

Can the Likelihoor ratio test be used in place of the F-test for a fixed effects models. Any diffrences from the F-test in this case? What about using LRT for testing fixed effects in mixed model?

The F-test is the likelihood ratio test in such models. Michael Hardy 22:30, 3 September 2005 (UTC)Reply

Hi. I may be misguided or mistaken here, I'm hardly expert. But I think the definition of the test statistic given is inconsistent with the test statistic given. The unrestricted numerator will be larger than the restricted denominator, so the ratio will be greater than 1, and its log will be positive, so -2 log Λ will be negative and can hardly be chi-square distributed. I think that either the ratio should be inverted, or the test statistic multiplied by negative 1, to keep things consistent. (My apologies again if I'm making a basic mistake, a possibility of which the likelihood is high.) Stevewaldman (talk) 00:58, 20 January 2008 (UTC)Reply

"asymptotically"

Latest comment: 17 years ago2 comments2 people in discussion

"If the null hypothesis is true, then −2 log Λ will be asymptotically χ2 distributed" The validity conditions of this theorem should be given. "asymptotically" when what tends to what value ?

I have now answered this question in the article. Michael Hardy 02:36, 28 October 2005 (UTC)Reply

There's really no further restriction on the random variables ("n independent identically distributed random variables")? Dchudz 15:22, 13 July 2007 (UTC)Reply

References

Latest comment: 16 years ago1 comment1 person in discussion

This article lacks references. For a instance, who proved that the likelihood ratio has density function is $\chi ^{2}+O_{p}(n^{-1})$ ?

I believe the critical paper is WILKS, SS (1938): "The Large Sample Distribution of the Likelihood Ratio for Testing Composite Hypotheses," Annals of Mathematical Statistics, 9, 60-62.

Freely available online at http://projecteuclid.org/euclid.aoms/1177732360 —Preceding unsigned comment added by 61.18.170.102 (talk) 18:15, 6 April 2008 (UTC)Reply

Coins

Latest comment: 17 years ago1 comment1 person in discussion

Hi, I think your example of the coins is fine but needs elaborating.

You haven't defined m_ij which I assume is the probability of event j when the two coins have the same probability of event j. It might be better calling it m_j then m_ij.
I think you should put in the equation for the likelihood ratio lambda, then follow it with the -2 log lambda (-2LL) equation
I'm not sure your -2LL equation is right, though I may be wrong. It looks to me as if your -2LL equation converts to lambda squared equals the ratio of the max likelihood of the data for the two hypotheses.

Desmond D.Campbell@iop.kcl.ac.uk 89.241.126.245 01:36, 24 March 2007 (UTC)Reply

This page is FUNDAMENTALLY WRONG.

Latest comment: 7 years ago7 comments6 people in discussion

Where to begin? A likelihood ratio test is for simple-vs-simple hypothesis. The test statistic given is a generalized, or maximum, likelihood ratio statistic. It may be commonly referred to in conversation as an LRT, but no competent mathematical statistics text will refer to it as such.

The distinction is critical. For example, the Neyman-Pearson lemma, mentioned in the article, is only directly applicable to the simple-vs-simple test. It may be extended to some composite alternatives (UMP test) through eg. Monotone likelihood ratios. For most practical composite hypotheses, the best results are generally more restrictive, eg. UMPU.

As for the flag about "too technical for a general audience". Blah. No choice, but one has to understand some mathematical statistics to have a chance of understanding LRT's. Conversely, a "General Audience" will have little concern over LRT's.

Anyway, another page for the "expert needed" flag... --Zaqrfv (talk) 09:37, 25 August 2008 (UTC)Reply

"Zaqrfv", your main point is wrong. It is true that the LR test referred to in the Neyman–Pearson lemma is for simple-versus-simple. But to say that no respectable text will use this term for the generalized version is wrong. It's quite commonplace. Michael Hardy (talk) 20:51, 17 March 2009 (UTC)Reply

Blah? I wholeheartedly disagree that this article can't be directed at a more general audience (e.g. physicians wanting to interpret the diagnostic validity of a test). The more complex stuff is fine towards the end of the article, but let's put the accessible stuff up front. Currently, the Wikipedia article is one of the least accessible articles on LRs on the web. After having read this article, I still have no idea what they are.164.111.16.221 (talk) 13:50, 5 November 2008 (UTC)Reply

Hi, I think that the definition given for the ratio is wrong: "The numerator corresponds to the maximum probability of an observed result under the null hypothesis. The denominator corresponds to the maximum probability of an observed result under the alternative hypothesis." I was checking in some books, Mathematics and Statistics for science page 157 for example and the definition is the other way around. Then the interpretation needs another review also. —Preceding unsigned comment added by Isapedraza (talk • contribs) 12:52, 26 February 2009 (UTC)Reply

It can be done either way; you just need to say that in one case you reject the null if the ratio is too big and in the other case if it's too small; the test is the same either way (in the sense that any dataset will lead to rejection in one case if and only if it leads to rejection in the other case). Michael Hardy (talk) 20:51, 17 March 2009 (UTC)Reply

A problem might be in the Criticism > Practical paragraph, which states that a disease is present if the likelihood ratio is large. This would be the other way around if we want to be consistent with the definition given. Jonas Wagner (talk) 13:16, 10 June 2010 (UTC)Reply

We just reviewed this article here in Biostats at Vanderbilt and there are several misleading statements, have to agree with "Zaqrfv". Professor Blume (worked with Royall and Goodman) and I would like to take a crack at cleaning it up. ShawnGarbett (talk) 18:42, 16 March 2017 (UTC)Reply

What is f(.)

Latest comment: 15 years ago2 comments2 people in discussion

Are we talking cdf or pdf? The probability that x is observed exactly as is? or that x or something more extreme than x was observed cancan101 (talk) 03:02, 18 February 2009 (UTC)Reply

In standard usage a lower-case ƒ is the pdf, and capital F is the cdf. Michael Hardy (talk) 22:09, 20 March 2009 (UTC)Reply

Dubious

Latest comment: 15 years ago2 comments2 people in discussion

I have revised the section, including the para marked dubious. Is it better/good enough? Otherwise give details of apparent problem points. Melcombe (talk) 10:16, 18 February 2009 (UTC)Reply

I have revised the section, the paragraph does not seem to hold good under the revised definition and hence is omitted. Kniwor (talk) 18:58, 23 August 2009 (UTC)Reply

Inconsistencies (Revised for improvement)

Latest comment: 15 years ago1 comment1 person in discussion

The sections and the definitions(though correct) seem inconsistent to me, and thoroughly confusing for a reader unfamiliar with the topic. I have revised and rewritten the first two sections to avoid any confusion and make things clear and consistent. Please point out any errors. Kniwor (talk) 18:57, 23 August 2009 (UTC)Reply

The ratio

Latest comment: 13 years ago1 comment1 person in discussion

Since the test is for nested ones, so it is better to state in the following way: $D=2\left(L\left(unconstrained\right)-L\left(constrained\right)\right)$

rather than the original articulation in the article.

For the non-logarithmized one, $D={\frac {l\left(unconstrained\right)}{l\left(constrained\right)}}$

Jackzhp (talk) 22:18, 9 February 2011 (UTC)Reply

Wilks's theorem

Latest comment: 9 years ago3 comments3 people in discussion

can someone put a reference so we can see where to look for its precise format. Jackzhp (talk) 22:21, 9 February 2011 (UTC)Reply

I've added the relevant ref: Wilks, S. S. (1938). "The Large-Sample Distribution of the Likelihood Ratio for Testing Composite Hypotheses". The Annals of Mathematical Statistics. 9: 60–62. doi:10.1214/aoms/1177732360. --Qwfp (talk) 19:37, 11 February 2011 (UTC)Reply

It seems like Wilks's theorem deserves its own article. Is there a good reason to keep it constrained to just this sub-subsection? Zhermes (talk) 18:41, 27 June 2015 (UTC)Reply

Background?

Latest comment: 12 years ago3 comments3 people in discussion

Wasn't the likelihood ratio test a result of Søren Johansen's work, or am I mistaken? Shouldn't he be mentioned in the Background section? I've only ever heard this referred to as Johansen's likelihood ratio and Johansen's likelihood ratio test. — Preceding unsigned comment added by 64.71.89.15 (talk) 19:34, 29 November 2011 (UTC)Reply

I think Johansen just developed a specific likelihood ratio test for cointegration. He was born in 1939 and Wilks' theorem about the asymptotic distribution of the log-likelihood ratio dates from 1938, so it seems improbable that likelihood ratio tests in general are a result of Johansen's work. Qwfp (talk) 20:10, 29 November 2011 (UTC)Reply

Thanks! I understand now. Apologies for forgetting to login and sign my previous post - didn't realize I was logged out. John Shandy` • talk 20:57, 29 November 2011 (UTC)Reply

Definition of Deviance is wrong

Latest comment: 11 years ago2 comments2 people in discussion

The definition of deviance on the page now, 25. of January 2013 is wrong.

It should be: -2ln [ likelihood of fitted model / likelihood of saturated model ] Which is the correct definition from Hosmer and Lemeshow's Applied logistic regression p. 13. — Preceding unsigned comment added by 62.242.0.66 (talk) 11:29, 25 January 2013 (UTC)Reply

Deviance is not mentioned in this article at all. There is a different quantity denoted by D. 81.98.35.149 (talk) 19:19, 25 January 2013 (UTC)Reply

Coin toss example issue

Latest comment: 7 years ago1 comment1 person in discussion

There seems to be an error in the coin toss example: in the last equation describing the likelihood ratio, the likelihood ratio is stated as n_ij/m_ij. But n_ij and m_ij are described in the text as the maximum likelihood *estimates* for the parameter of interest under the non-null and null models, respectively. We don't want to take the ratio of the log of the maximum likelihood estimates of the parameter, we want the ratio of the log of the actual likelihoods, no? — Preceding unsigned comment added by 164.107.189.130 (talk) 09:51, 20 October 2017 (UTC)Reply

Inconsistencies in definition of LR

Latest comment: 6 years ago2 comments1 person in discussion

A recent change switched the numerator and denominator in the definition of LR. As I understand it (I am not an expert) it occurs both ways in the literature, so either way is OK, but now there are inconsistencies, I believe, with other parts of the article. e.g.: in the 'Simple hypotheses' section: 'In the form stated here, the likelihood ratio is small if the alternative model is better than the null model.', and in the 'Interpretation' section: 'The likelihood ratio test rejects the null hypothesis if the value of this statistic is too small.', and 'The numerator corresponds to the likelihood of an observed outcome under the null hypothesis. The denominator ...'. And maybe there are other inconsistencies. I don't know whether is would be better to revert to null model likelihood in numerator, or fix the statements inconsistent with the current version. I leave it to someone with greater expertise/stronger opinion to do that.tom fisher-york (talk) 18:38, 22 December 2017 (UTC)Reply

I reverted to fix this inconsistency - if there is strong preference for the other way (null in denominatortom fisher-york (talk) 23:00, 31 December 2017 (UTC)) then the above mentioned statement need to be revised also to be consistent.tom fisher-york (talk) 19:06, 22 December 2017 (UTC)Reply

Separating LR test from Wilks' theorem

Latest comment: 6 years ago2 comments1 person in discussion

Wilks' theorem is an asymptotic method to estimate the LR test under some conditions. One could also use either analytical or numerical solutions. I think Wilks' theorem should get it's own article, and LR test should be extended with some other methods. I'll start making this change, if others disagree, feel free to explain why or revert. Tal Galili (talk) 19:29, 19 August 2018 (UTC)Reply

ok, I've now created Wilks' theorem. The current article on LR tests should have a section showing how to do LR test on simple cases (such as comparing two simple hypothesis on n bernulli trials), and also discuss the difference between LR test and "generalized likelihood ratio test". I hope someone else can continue this a bit further. Tal Galili (talk) 20:00, 19 August 2018 (UTC)Reply

explanation needed for capital P in equation for significance level alpha

Latest comment: 3 years ago1 comment1 person in discussion

hi there, as I see it it is not really clear what is meant by P here https://wikimedia.org/api/rest_v1/media/math/render/svg/a865919a1124526b5c890302848283418e2ddb6b. is it a probability density? but if so what do the arguments mean?

hope some can explain this in more detail. 31.17.204.117 (talk) 06:20, 21 March 2021 (UTC)Reply

suggestion:add an example for uses in nuisance parameters case

Latest comment: 2 years ago1 comment1 person in discussion

suggestion:add an example for uses in nuisance parameters case

show people how to use the lRT 68.134.243.51 (talk) 20:34, 5 August 2022 (UTC)Reply

Add topic