Mailing List complex-science@necsi.org Message #6439

From: Peter McBurney <p.j.mcburney at csc.liv.ac.uk>
Sender: <yaneer at necsi.org> (Yaneer Bar-Yam)
Subject: Re: Uncertainty
Date: Mon, 23 Feb 2004 23:27:12 -0500
To: complex-science
Full Headers
Undecoded Letter
Aleks --


I hope you understand that the body of work on this subject is huge, and I
have no pretension to be fair when reducing it to two paragraphs. I also
presume that not everyone on this list is an expert in probability and
uncertainty. Finally, the topic is polarized, and most of those involved
have a derogatory opinion of other poles. Therefore, I'd appreciate brief
arguments more than either exclamations "I disagree" or pointers to books
describing one's own point of view. I admit, however, that my post was
written from a probabilistic point of view, while my discussion of the
relationship between probability and reality comes from the overlap between
statistics and physics (Jaynes, etc.).


You presented what I believe was an inaccurate view of the differences between the two main approaches to Probability. I responded with what I thought was a more accurate description of the differences. The two books I cited are not books which describe my own point of view, but books which present these differences neutrally -- in the first case (Gillies), with respect to the underlying philosophies, in the second case (Klir and Wierman), with respect to mathematical formalisms.

Of course, if you reject any alternative to Bayesianism, as Jaynes does, then you would consider any book which even describes these other approaches as biased.





Your characterization of Bayesian statistics focuses on the subjective
component,


Because this is the underlying difference between Bayesian approaches and frequentist approaches.



which makes it seem that Bayesian statistics is all subjectivity,
and frequentistic statistics is all objectivity. The prior knowledge, made
explicit in Bayesian statistics, is a tool that can be used, lay unused or
be abused.


Or it can be considered a totally inappropriate aspect to statistical inference, as most frequentists believe.

The point here is that these 2 approaches are different paradigms. What one focuses on, the other thinks is invalid.




Frequentists do not use them, but the consequence is equivalent
to putting the prior probability of 1 to their chosen modeling technique.


Not at all. It is only "equivalent" inside a Bayesian framework. If you reject this framework, it is not equivalent.





De Finetti's definition of subjective probability is "whatever helps us make
good decisions". Basically, probability is justified through decision
theory. There are several utility functions that achieve the maximum
precisely where the subjective probability matches the objective
probability. Some examples are Brier score, Kullback-Leibler divergence, and
expected zero-one loss. This kind of freedom allows you to apply probability to many more problems
than you could by using the narrow frequentistic definition. At the same
time, with the above utility functions, the optimal subjective probability
is equivalent to the frequentistic probability.


And, likewise, there are problems to which the Bayesian approach fails you. I refer you to the Gillies book, and to the books of Deborah Mayo. (Her books *are* strong propoponents for the frequentist viewpoint, unlike Gillies.)





So, frequentist statistics assumes vague reality, but crisp models.
Again, I disagree. This dimension (crisp-vague) is orthogonal to that of the frequentist-Bayesian dimension.


In what way?



Because the key difference, IMO, between the two approaches is not the assumption of crisp or vague reality, but whether or not it is valid to consider probabilities of statements for which no objective evidence may be produced. I guess we disagree on this point.




This is not correct. Bayes wrote one paper, which was published after he died. There is no evidence that he understood very much about uncertainty, and certainly not the subtleties of imprecise probabilities. I deplore the hagiography to which Bayesians seem prone.


Bayes worked with probabilities of probabilities. Isn't this an example of
imprecise probability?


Your post said:

Here,
there is a lot of reinvention of concepts that have already been understood
already by Reverend Bayes!


I stand by my comment in response.




Here's the good reason: Probability theory does not represent all forms of uncertainty. It assumes the law of the excluded middle, yet in many decision domains this is not applicable. It assumes that every domain may be represented by a finite or countably infinite set of propositions. It assumes that all the possible outcomes of a random process may be articulated in advance, thus eliminating most problems in medicine, business strategy, and public policy. It does not represent second- and higher-order uncertainty (uncertainties about uncertainties) in a way which is humanly-intuitive, unlike (say) Dempster-Shafer theory.


If one disses Bayesian statistics as subjective and "not science" on the
basis of priors,



I did not "diss" Bayesian statistics on the basis of priors. I criticized it for accepting as valid, statements for which there can be no objective evidence produced. In any reasonable definition of science, we would expect experiments to be repeatable, and claims not to depend upon the internal, subjective beliefs of the person making the claim. In that sense, Bayesian approaches are not scientific, since they lack such objectivity.

If I said to you that I believe that the probability of reincarnation was 95%, what experiment could you conduct to enable this prior probability to be revised? Why then would my statement be accepted as scientific?

There are still problems if we consider a statement for which evidence could be produced: eg, I believe that the probability of life being found on Mars within the next 300 years is 30%. Why should my prior beliefs have anything to do with the act of inference? Basing a theory of inference on the subjective beliefs of the person doing the inferencing seems contrary to commonly-understood notions of science, IMHO.

And, what happens when scientists have different prior beliefs? Where is the Bayesian theory for the coherent combination of different prior probabilities? The usual approach is to rely on theorems which show that different prior probabilities will converge to the same posterior probability in the case when all participants receive the same information and process it in the same way. But this rarely if ever happens, and, in any case, the mathematics (again!) relies on very restrictive assumptions.

I conclude from this that Bayesianism has yet to demonstrate, beyond reasonable doubt, that it is a science.



it probably doesn't matter that there are good Bayesian
solutions to all the above issues. As I mentioned before, probability has
*always* (since the times of De Moivre and Laplace) been applied to
continuous variables: consider the Gaussian distribution with a countably
infinite set of propositions. If two events may happen at the same time, you
need two variables, not one: the argument of excluded middle is hence
irrelevant.


You misunderstand the criticism re LEM. Here's an example:

You are policeman investigating a crime. You have some evidence that person A did the crime. You also have some other evidence that person A did not do the crime. The evidence you have points to both these statements ("A did the crime" and "A did not do the crime") being true simultaneously. This contradicts LEM. In order to decide which statement to believe, you need to compare the two sets of evidence. But what if the evidence itself has uncertain elements, over which one could place probabilities? The recent alternatives to Probability -- Possibility theory, Dempster-Shafer theory -- cope with this situation better than does Probability theory.

It is possible to formalize these types of problem in second- or higher-order Probability theory (probabilities about probabilities). But this quickly becomes unintuitive, as I mentioned. Human decision makers understand Possibility and D-S approaches more readily than they do second-order probabilities.





I don't base any of my arguments on Cox's theorem. De Finetti's reference to
infinitely many interactions refers to expected utility: of course you can
do fewer and end up with a good approximation. De Finetti's justification
was asymptotic.



Both Cox and di Finetti seek to justify the use of Probability theory by mathematical arguments. In both cases, they make assumptions which do not apply in all application domains. As I said, the point made by the critics of Probability is that it is a model of reality, and, as with any model, it captures some aspects of reality and misses others. Strong protagonists of Probability theory, such as Cox and Jaynes, seem not to understand this. I think this idea is fundamental in AI, which is perhaps why people in AI have led the search for alternatives to classical Probability.






It took another field entirely -- AI -- to actually develop theories which formalize uncertainty in ways different to probability theory. I think it is meaningful that it was engineers like Zadeh, concerned with building things in the real world, who recognized and tried to solve these problems. It is not the first time that theoreticians have disparaged engineers.


I work in the field of AI, and I'm an engineer by education.



I was not criticizing you. I was trying to explain why Lotfi Zadeh is held in low regard by many statisticians, and yet revered by many AI people and control engineers.






-- Peter

Subscribe (FEED) Subscribe (DIGEST) Subscribe (INDEX) Unsubscribe Mail to ListMaster