Numbers war: How Bayesian vs frequentist statistics influence AI • The Register Forums

Thursday 22nd June 2017 09:24 GMT Anonymous Coward

Maybe a daft question but would the bayesian method not have been applied to original 99% infection rates?

Thursday 22nd June 2017 09:36 GMT DavCrav

If there is no infection at all then 1% of the population will still test positive.

In mathematics (and statistics), a good way to test your reasoning is to go for the extreme values and see if it still works. So you should test 50/50, no infection and complete infection, and see what happens.

13 0 Reply

Thursday 22nd June 2017 09:50 GMT Paul Kinsler

If there is no infection at all then 1% of the population will still test positive.

Your sentence is the wrong way around. The data is (would be) that 1% of the population have been tested and came back positive, which means that ... <insert deduction about infection rates here>

2 0 Reply
1. Thursday 22nd June 2017 20:50 GMT sad_loser
  
  Re: If there is no infection at all then 1% of the population will still test positive.
  
  This gets into understanding how medical tests work (IAAD, and this is a bit of a niche) and which tests you should use.
  
  If there is a very low infection rate, and you are looking to 'rule in' you need a test that is very specific (is only true in disease) otherwise you suffer from too many false positives.
  
  Equally, once the infection is fairly widespread, it is generally safe to assume that anyone who looks like a zombie, is a zombie.
  
  We generally use medical tests with a 1% 'error' rate because in practice that is usually good enough.
  
  To look at this problem in a better way, Likelihood Ratios are the way to go.
  
  1 0 Reply
  1. Thursday 22nd June 2017 22:24 GMT Adam 52
    
    Re: If there is no infection at all then 1% of the population will still test positive.
    
    "it is generally safe to assume that anyone who looks like a zombie, is a zombie."
    
    Unfortunately it's not safe. Which is why we have so many people being sent home by their GP and then dying of meningitis / pulmonary embolism / cancer.
    
    0 0 Reply
    1. Friday 23rd June 2017 23:03 GMT John Presland
      
      Re: If there is no infection at all then 1% of the population will still test positive.
      
      It is not safe to asume that anyone who does not look like a zombie is a zombie.
      
      0 0 Reply

Thursday 22nd June 2017 09:38 GMT Anonymous Coward

Hmm. Think the communications theory profs upstairs are Bayesians. The field seems to have achieved a lot that way. But I'm not going to check, as I left my ear defenders at home.

3 0 Reply

Thursday 22nd June 2017 09:58 GMT Anonymous Coward

"How can you possibly do statistics on a guess!?"

Isn't it how pollsters work today??

5 0 Reply

Thursday 22nd June 2017 10:10 GMT Anonymous Coward

Re: "How can you possibly do statistics on a guess!?"

"Isn't it how pollsters work today??"

No. TFA explains the problem pretty well.

If different age groups have a different probability of actually voting and this is affected by factors like weather, it's very hard for polls to produce a single meaningful number.

The "Corbyn surge" in the under-25s and the 35-45 age group made a big difference. ComRes seems to have called it better than the other polls*. But if you were doing a poll to sell to the Torygraph, and assumption (1) gives May a majority of 100 while assumption (2) gives a hung Parliament, which assumption would the boss wish you to make? At that point he thinks about the cheque and mutters about turnout being guesswork.

Note that I am not making a party political point, but one about not wanting to be the messenger who's going to be shot. I detest roughly equal numbers of politicians on both sides. But our Press is mostly so partisan that it's part of the problem, not the solution.

*I first suspected that they might be right when Johnson tried to spin the BBC audience as composed of "lefties". You can pretty much assume that if Johnson opens his mouth he's either lying or evading the truth, so it was evidence that they had indeed picked a representative sample and he was getting worried.

8 1 Reply
1. Thursday 22nd June 2017 10:31 GMT Anonymous Coward
  
  Re: "How can you possibly do statistics on a guess!?"
  
  "But if you were doing a poll to sell to the Torygraph"
  
  ... well, the Guardian uses (I think) ICM who were consitently giving the highest Concervative lead - though of course, to the "true believer" the Guardian is one of the leaders of the MSM anti-Corbyn conspiracy
  
  0 2 Reply
  1. Thursday 22nd June 2017 11:07 GMT BebopWeBop
    
    Re: "How can you possibly do statistics on a guess!?"
    
    Well conspiracy aside, the Grauniad is not exactly filled with joy about the prospect Corbyn (not that I am either - Hobson's choice to some extent) and a lot of the pieces they have published have been somewhat ignorant of facts and statements.
    
    Labour give an impression about being equally gung-ho (if a little reserved on a few issues) on cutting or ties with Europe as a number of the hard-liners in the Conservative party.
    
    And I say that with regret - we have just achieved a larger European-based workforce (OK by 1 it is a small workforce) than UK.
    
    1 0 Reply
    1. Friday 23rd June 2017 09:42 GMT Anonymous Coward
      
      Re: choice
      
      Why choose the lesser evil; vote Cthulhu
      
      1 0 Reply
  2. Thursday 22nd June 2017 13:11 GMT Anonymous Coward
    
    Re: "How can you possibly do statistics on a guess!?"
    
    "well, the Guardian uses (I think) ICM who were consitently giving the highest Concervative lead "
    
    The Guardian is basically Lib Dem these days; telling the faithful that voting Labour was a wasted vote is no skin off their noses.
    
    0 0 Reply
Thursday 22nd June 2017 12:20 GMT IamStillIan

Re: "How can you possibly do statistics on a guess!?"

I think the response is "How can you possible do statistics without a guess!?". Given that there are no comprehensive models of the world, and practically nothing is truely independant, you always assume something, whether you realise it or not.

3 0 Reply
1. Thursday 22nd June 2017 18:03 GMT Ken Hagan
  
  Re: Given that there are no comprehensive models of the world...
  
  I think that's the key point. Everyone brings a prior (guess). The frequentists insist that the only legitimate prior is one that expresses total ignorance. The Bayesians are willing to start from somewhere else. Once enough evidence actually turns up to make the prior unimportant, both parties agree. Until then, you don't actually have enough evidence.
  
  1 0 Reply

Thursday 22nd June 2017 09:59 GMT Michael H.F. Wilkinson

I am Bayesian ...

probably, but certainly not religiously

4 0 Reply

Thursday 22nd June 2017 10:33 GMT simon_c

But, in the example given don't we also have to take into account the effects of those false results ?

A false positive : Some pool innocent gets shoved into the quarantine hospital room/jail cell/pit (depending on the stage of the overall infection) until they are eaten by the other recently turned.

A false negative : soon-to-be zombie killer goes back to their family to consume them at a later date.

How is that figured out when using stats to save the world ?

1 0 Reply

Thursday 22nd June 2017 10:37 GMT Third Man

Now we're into Decision Theory

It's the utility of the outcomes that will underpin your decision, often this makes the Bayes/Frequentist views entirely irrelevant...as observed.

4 0 Reply
Thursday 22nd June 2017 11:04 GMT Tony Haines

//How is that figured out when using stats to save the world ?//

good question. I think stats would help.

Let us assume we have a certain amount of money to spend on the issue, which essentially can be converted into putting people into isolation, quarantine and/or providing good separation between arbitrarily sized groups in the community (i.e. building walls and guarding access-gates). Plus further testing beyond the initial screen.

Depending on the relative costs of those, and the estimate of the infection rate (which we can quickly obtain given the known test error rates, once the population screen is complete), and the cross-infection rate (how many latent cases an infective zombie causes) an equation could be derived to optimise the number of people saved.

0 0 Reply
1. Thursday 22nd June 2017 11:08 GMT BebopWeBop
  
  Let us assume we have a certain amount of money to spend on the issue, which essentially can be converted into putting people into isolation, quarantine and/or providing good separation between arbitrarily sized groups in the community (i.e. building walls and guarding access-gates). Plus further testing beyond the initial screen.
  
  Ahh you don't fool me - I've seen all of the Zombie films and the inevitable consequence of locking them out....
  
  1 0 Reply
2. Thursday 22nd June 2017 11:16 GMT Anonymous Coward
  
  I recall someone saying that there are lies, damn lies and statistics and anyone that relies on statistics is a fool.
  
  In the real world t is better to rely on actual facts (numbers) rather than some guesswork based on assumptions that mat not be provable.
  
  0 0 Reply
  1. Thursday 22nd June 2017 12:47 GMT Tony Haines
    
    //In the real world t is better to rely on actual facts (numbers) rather than some guesswork based on assumptions that mat not be provable.//
    
    Well obviously. If you have the data you want, you don't need statistics. Unfortunately, the real world is not always so obliging.
    
    If you don't have the information already, what are you going to do - give up?
    
    5 0 Reply
  2. Thursday 22nd June 2017 13:57 GMT Anonymous Coward
    
    _{"In the real world t is better to rely on actual facts"}
    
    The real world doesn't work like that. Note that you missed out a letter which is almost certainly "i" to make "it". All readers of your missive will have had to apply some form of deductive (probability based) reasoning to fill in the gap. Those with a shaky grasp of English may have even got it wrong. That was easy to correct but this error is nearly parse-able without correction to get a different result than that which you intended:
    
    _{"based on assumptions that mat not be provable"}
    
    0 0 Reply
    1. Thursday 22nd June 2017 18:00 GMT Alistair
      
      @gerdesj
      
      I not sure why he's subbing in a variable. He appears to have more than one real world.
      
      0 0 Reply

Thursday 22nd June 2017 11:25 GMT Tom 7

Both methods have merits which are mathematically based.

So I can only think the arguments between the two sides are to try and confuse the punter to ensure they dont get far enough into the subject to realise when they are on the wrong side of a dutch book.

0 0 Reply

Thursday 22nd June 2017 11:26 GMT Destroy All Monsters

There is always more to read

The origins and legacy of Kolmogorov's. Grundbegriffe

and

Bayes and Frequentism: a Particle Physicist's perspective

2 0 Reply

Thursday 22nd June 2017 13:49 GMT EnviableOne

Statistics acan mean whatever you want them to, if you select the right test, right sample size and right sample.

So does it really matter which camp you live in?

1 1 Reply

Thursday 22nd June 2017 21:08 GMT Anonymous Coward

"Statistics acan mean whatever you want them to, if you select the right test, right sample size and right sample."

That basically negates the entire point of statistical analysis, so no. What you are describing is more or less "anecdotal evidence".

1 0 Reply

Friday 23rd June 2017 08:46 GMT jamesthorniley

Not an example of Bayesian inference

Sympathise with the motive of this article but generally not impressed.

The example given has nothing to do with "Bayesian" logic. The author doesn't seem to understand the difference between Bayes theorem, a mathematical result which no one disputes, and Bayesian inference, which is a controversial stance in philosophy of science and stands in contrast to frequentist inference.

No frequentist would dispute the logic of the given example - there is nothing objectionable to a sane frequentist about using population frequencies and Bayes theorem in this way. Nor is there anything wrong with using your best guess of the population frequency if you don't have it exactly (though you can do much more nuanced stats than a single guess, there's no need for a short article to go into that).

The controversy is over the extent to which one can equate "belief" and "probability". It's a thorny topic and this comment is long enough already.

The author is right this is a religious war that will bite you if you jump into ML or stats, but yet another "explainer" from someone who doesn't really understand the subject isn't much help.

0 1 Reply

Friday 23rd June 2017 10:24 GMT Penelope Gwendolyn

Re: Not an example of Bayesian inference

Errr, I think the author makes it very clear that he does understand the difference. The article opens with a discussion of conditional probabilities which, he makes clear, are related to Bayes Theorem.

You title your comment with “Not an example of Bayesian inference”. Where in the article does it say it IS an example of Bayesian inference? Answer - it doesn’t. The word inference is never used. The article is more generally about the Bayesian World.

Strangely, after that, your views seem to mirror his all the way through.

You say “No frequentist would dispute the logic of the given example - there is nothing objectionable to a sane frequentist about using population frequencies and Bayes theorem in this way.”

He says “Now I hope I have convinced you that …. no one would be stupid enough NOT to take it into account if they knew what it was because it clearly makes a difference.”

So you agree there.

He goes on to say “However, a dyed-in-the-wool frequentist would say, "But you don't know the actual number. How can you possibly do statistics on a guess!?”

You say “Nor is there anything wrong with using your best guess of the population frequency if you don't have it exactly.”

Violent agreement there as well. You both agree that most people WOULD use the guess. You say that the “sane” frequentists would use the numbers (implying that some insane ones would not) and the author makes the point that there are some zealots out there who would not if they don’t have the exact figure.

Finally, you say that “The controversy is over the extent to which one can equate "belief" and "probability".

He says “If you read more about the frequentist and Bayesian views of the world it turns out that they diverge much further and the debate becomes much more of a philosophical one about how you view the world.”

I don’t think you have presented any information to suggest that he doesn’t understand the topic; the two of you seem to have essentially identical views.

0 0 Reply

Friday 23rd June 2017 13:54 GMT TW12

Bayes Rule is uncontroversial and unobjectionable to Frequentists.

What the latter object to is applying the Rule to the parameters of a probability distribution so that, say, the mean of a normally distributed random variable now becomes a random variable itself rather than a real albeit unknown constant. Welcome to Bayesianism.

Then the prior can only be interpreted as degrees of belief in its different values; a deviation from objectivism and pure empiricism represented by the undiluted information contained in a study sample. Since many forms of prior make the maths intractable, it is traditional to use a conjugate prior so that the posterior distribution can be more easily calculated, In other words a fudge on top of a fudged method.

Of course we only hear of Bayesians successes though Nate Silver's recent poll forecasting would suggest that what used to be called inverse probability is due a more sober press.

0 1 Reply

Saturday 24th June 2017 02:13 GMT swm

Monty Hall

Monty Hall tells his contestant that there are three curtains and behind one is a car and behind two are goats. The contestant chooses one curtain and Monty hall opens another curtain revealing a goat. Should the contestant switch?

0 0 Reply

Topics

Special Features

Vendor Voice

Resources

COMMENTS

If there is no infection at all then 1% of the population will still test positive.

Re: If there is no infection at all then 1% of the population will still test positive.

Re: If there is no infection at all then 1% of the population will still test positive.

Re: If there is no infection at all then 1% of the population will still test positive.

"How can you possibly do statistics on a guess!?"

Re: "How can you possibly do statistics on a guess!?"

Re: "How can you possibly do statistics on a guess!?"

Re: "How can you possibly do statistics on a guess!?"

Re: choice

Re: "How can you possibly do statistics on a guess!?"

Re: "How can you possibly do statistics on a guess!?"

Re: Given that there are no comprehensive models of the world...

I am Bayesian ...

Now we're into Decision Theory

Both methods have merits which are mathematically based.

There is always more to read

Not an example of Bayesian inference

Re: Not an example of Bayesian inference

Monty Hall

POST COMMENT House rules

Enter your comment

Add an icon

Other stories you might like

Arm flexes silicon muscles to push generative AI at the edge

Developers are calling the shots on AI planning, judging by your experience

Why making pretend people with AGI is a waste of energy

Belgian beer study acquires taste for machine learning

CNCF boss talks 'irrational exuberance' in an AI-heavy Kubecon keynote

New York Times: OpenAI’s claim we 'hacked' its products both 'irrelevant' and 'false'

Nvidia rival Cerebras says it's revived Moore's Law with third-gen waferscale chips

Can AI shorten PC replacement cycles? Dell seems to think so

Meta seeks ASIC designers for ML accelerators and datacenter SoCs

Quilter's AI design service nabs $10M to make circuit board design easier

What is Model Collapse and how to avoid it

IBM Japan and NTT think they can make datacenter aircon adjust to different workloads

About Us

Our Websites

Your Privacy