[lit-ideas] Re: Grice v Grice

From: "" <dmarc-noreply@xxxxxxxxxxxxx> (Redacted sender "Jlsperanza" for DMARC)
To: lit-ideas@xxxxxxxxxxxxx
Date: Fri, 11 Dec 2015 07:35:14 -0500

What is the best conceptual analysis of 'onus probandi'?

To what extent is the conceptual analysis of the ordinary language and
legalese 'proof' rooted on what the Romans had behind the 'probandi'? How
'diachronic' should 'conceptual analysis' of "legal proof" be?

In a recent post, re: Grice v Grice, McEvoy was considering 'proof',
'provable', and 'provability', which may be a good occasion to see if we can
analyse this, since he is assuming a problem-solving approach that may leave,
for a Griceian, certain items 'unclarified'.

McEvoy was approaching legal proof from a problem-solving perspective; but
surely this perspective 'presupposes' a constellation of a conceptual
analysis (to echo Collingwood). Consider the proof for murder and the proof
for
theft (I'm simplifying legalese). It may be argued that since the
conceptual analysis of theft includes more elements (or necessary and
sufficient
conditions) than that of murder, the standard of proof would vary. Some legal
philosophers see this as a paradox but not one that rejects conceptual
analysis.

There are different conceptual analysis of proof'. The more philosophically
sound (qua legal philosophy) is the one more akin to the conceptual
analysis of proof in general. But we shall see.

Problem-solving does not feature much in what analytic legal philosophers
have considered about legal proof; instead, alternative conceptual analysis,
and rejection of alleged counter-examples have. So one has to proceed
carefully.

First we should start with a bit of an axiom.
i. Senses should not be multiplied beyond necessity.
ii. "Proof" has only one sense.
The idea that there is 'legal proof' and other types of proof would then
be judged spurious or tendentious.
LEGALESE has a further item of cognate vocabulary -- since we should start
with 'linguistic botanising': "probative". Thus, In reaching the verdict,
the trier of fact has to assess the so-called "probative value" (or
"probative force") of each individual item of evidence which has been received
at
the trial.
An analysis of the concept of "probative value" may play a role at a prior
stage when the judge has to make a ruling on whether to receive the
evidence
In many legal systems, if the judge finds this "probative value"
(concerning a proposed item of evidence) to be low and substantially
outweighed by
countervailing considerations, such as the risk of causing unfair prejudice
or confusion, the judge can refuse to let the jury consider the evidence.
The concept of "probative value" is sometimes confusingly referred to in
legalese (there is legalese and there is legalese) "probative force" -- M.
A. E. Dummett would be delighted: he loved force! The concept of "probative
value" has been analysed in terms of a likelihood ratio. Evidence
(including hearsay evidence) can be more or less PROBATIVE depending on the
value of
the likelihood ratio. The probative value of a blood type match may be
1.0:0.5 (or 2:1) as 50% of the suspect population may have the same blood type
as the accused. But if the blood type is less common and only 25% of the
suspect population has it, the probative value of the evidence is now
1.0:0.25 (or 4:1). The probative value is greater in the latter than in the
former scenario.

There may be an alternative conceptual analysis for "probative value". The
probative value of an item of evidence is assessed contextually. The
probative value of E may be low given one state of the other evidence and
substantial given a different body of other evidence. If some evidence shows
that
a woman has died from falling down an escalator at a mall while she was
out shopping, her husband’s history of spousal battery is unlikely to have
any probative value in proving that he was responsible for her death. But if
the other evidence shows that the wife had died of injuries in the
matrimonial home, the question is whether the injuries were sustained from an
accidental fall from the stairs or inflicted by the husband. The same evidence
of spousal battery will now have significant probative value. On this
alternative conceptual analysis, the "probative value" of an item of evidence
E
is NOT measured simply by the objective context-free likelihood ratio. The
concept of "probative value" is, now, rather, analysed as the degree to
which E increases (or decreases) the probability of the proposition or
hypothesis H in support of or against which E is led. The probative value of E
is
defined, conceptually, as any difference between the probability of H given
E (the posterior probability) and the probability of H absent E (the prior
probability). The probative value of E = P(H|E)−P(H)P(H|E) (the posterior
probability) is derived by applying Bayes’ theorem — that is, by
multiplying the prior probability by the likelihood ratio.

On this second conceptual analysis, the likelihood ratio does not itself
constitute the probative value of E, even it is nevertheless a crucial
component -- a necessary condition -- in the analysis of the concept. A major
difficulty with both analysis of probative value is that for most evidence,
obtaining the figures necessary for computing the likelihood ratio is
problematic (Grice's example, "When did you last see your father?").
Exceptionally, quantitative base rates data exist, as in human blood types.
Where
objective data is unavailable, the fact-finder has to draw on background
experience and knowledge to come up with subjective values. With blood types,
a
critical factor in computing the likelihood ratio is the percentage of the “
suspect population” who has the same blood type as the accused. "Reference
class" is the general statistical term (as "most grices are extinct,
statistically") for the role that the suspect population plays in this
conceptual analysis. But how should the reference class of “suspect
population” be
defined? Should we look at the population of the country -- say, of
Ruritania -- as a whole? Or of the town or the street where the palace is
situated
and the the alleged murder of the King of Ruritania occurred? What if it
occurred at an international airport where most the people around are
foreign visitors? Or what if it is shown that both the accused and the victim
were at the time of the alleged murder inmates of the same prison? Should we
then take the prison population as the reference class? The distribution of
blood types may differ according to which reference class is selected.

Sceptics of mathematical modelling of probative value emphasise that data
from different reference classes will have different explanatory power and
the choice of the reference class is open to — and should be subjected to —
contextual argument and requires the exercise of judgment. There would
be, contra H. L. A. Hart, no a-priori, purely analytic, way of determining
the correct reference class. Some legal philosophers have proposed
quantifiable ways of selecting, or assisting in the selection, of the
appropriate
reference class. On one suggestion, the court does not HAVE to search for the
OPTIMAL reference class. A general characteristic of an adversarial system
of trial is that the judge plays a passive role. It is up to the parties
to come up with the arguments on which they want to rely and to produce
evidence in support of their respective arguments. The adversarial setting
makes the "reference class" problem more manageable as the court need only to
decide which of the reference classes relied upon by the parties is to be
preferred. And this can be done by applying one of a variety of technical
criteria that statisticians have developed for comparing and selecting
statistical models.

Another suggestion is to use the statistical method of "feature selection"
instead. The ideal reference class is thus conceptually analysed as the
intersection of all relevant features of the case, and a feature is relevant
if it is correlated to the matter under enquiry. E.g. if the amount of drug
likely to be smuggled is reasonably believed to co-vary with the airport
through which it is smuggled, the country of origin and the time period, and
there is no evidence that any other feature is relevant on which data is
available, the ideal reference class is the class of drug smugglers passing
through that airport originating from that country and during that time
period. Both suggestions have self-acknowledged limitations: not least, they
depend on the availability of suitable data. Now, while statistical methods
have advice to offer on how courts should judge quantitative evidence, they
do so “in a way that supplements normal intuitive legal argument rather
than replacing it by a formula.

The "reference class" problem is not confined to the probabilistic
assessment of the probative value of individual items of evidence. It is a
general
difficulty with a mathematical approach to legal PROOF, and that's why it
is of particular interest to the analytic legal philosopher. The same
problem arises on a probabilistic interpretation of the standard of legal
proof
when a court has to determine whether the standard is met based on all the
evidence adduced in the case. How does the "reference-class" problem can
arises in this connection? Let it be that the plaintiff sues Blue Bus Company
to recover compensation for injuries sustained in an accident. The
plaintiff testifies and the court believes on the basis of the plaintiff's
testimony, that the plaintiff was run down by a recklessly-driven bus. It was,
alas, dark and the plaintiff can NOT really tell whether the bus belonged to
The Blue Bus Company. Assume that there is evidence which establishes that
The Blue Bus Company owns 75% of the buses in the capital of Ruritania, where
the accident occurred, and the remaining 25% is owned by The Red Bus
Company. To use the data as the basis for inferring that there is p = 0.75
that
the bus involved in the accident was owned by The Blue Bus Company would
seem to privilege the "reference class" of "buses operating in the capital of
Ruritania" over other possible reference classes such as "buses plying the
street in the capital of Ruritania where the accident occurred" or "buses
operating at the time in question". I.e. A different "reference class" may
produce a very different likelihood ratio. It is crucial how the "reference
class" is CHOSEN and this is ultimately a matter of argument and judgment.
Any choice of a "reference class" other than the class that shares every
feature of the particular incident, which is, in effect, the unique incident
itself, is in principle contestable.

Critics of the mathematisation of legal "proof" raise this point of the
arbitrariness of the 'reference class' as an example of inherent limitations
to the axiomatic modelling of probative value. But there is an alternative:
an explanatory analysis of legal "proof". This explanatory analysis of
'legal proof' has the advantage of avoiding the "reference class" problem
because it does not attempt to quantify probative value. Suppose a man is
accused of killing his wife. Evidence is produced of the man's extra-marital
affair. The unique "probative value" of the accused’s infidelity can NOT be
mathematically computed from statistical base rates of infidelity and
uxoricides. In assessing its "probative value", the court looks instead at how
strongly the evidence of infidelity supports the explanation of the events put
forward by the side adducing the evidence and how strongly it challenges
the explanation offered by the opponent. The prosecution may be producing the
evidence to buttress its case that the accused wanted to get rid of his
wife so that he could marry his mistress. The defence may be advancing the
hypothesis that the couple was unusual in that they condoned extra-marital
affairs and had never let it affect their marriage. How much "probative
value" the evidence of infidelity has depends on the strength of the
explanatory
connections between it and the competing hypotheses and this is not
something that can be quantified.

But the disagreement in this debate is not as wide as it might appear. The
critics concede that axiomatic models for evaluating evidence in law may
be useful. What they object to is scholarship arguing that such models
establish the correct or accurate probative value of evidence, and thus
implying
that any deviations from such models lead to inaccurate or irrational
outcomes. On the other side, it is acknowledged that there are limits to
mathematical formalisation of evidential reasoning in law and that context,
argument and judgment do play a role in identifying the "reference class".

We have thus far concentrated on "probative value" of an individual item
of evidence. But the conceptual analysis should extend to the TOTAL body of
evidence presented at the trial. The law assigns the legal "onus probandi"
between parties to a dispute. E.g. at a criminal trial, the accused is
presumed innocent; "onus probandi" is on the prosecution to prove that the
accused is guilty as charged. To secure a conviction, the body of evidence
presented at the trial must be sufficient to meet the standard of "proof". A
verdict will be given in favour of the side bearing the "onus probandi" iff,
having considered all of the evidence, the fact-finder is satisfied that the
applicable standard of "proof" is met.

Now, the standard of "proof" may receive different conceptual analyses. On
one such analysis, the standard of "proof" is a probabilistic threshold.
In civil cases, the standard is the "balance of probabilities" or, the
"preponderance of evidence". The plaintiff may satisfy this standard and
succeed
in his claim only if there is, on all the evidence adduced in the case,
more than 0.5 probability of his claim being true. At a criminal trial, the
standard for a guilty verdict is -- to use a very 'dogmatic' or rather
anti-sceptical turn of phrase which is SO LEGALESE it hurts: "legal proof
beyond
a reasonable doubt". Here the probabilistic threshold is thought to be
much HIGHER than 0.5, but courts have eschewed any attempt at authoritative
quantification. Typically a notional value such as 0.9 or 0.95 is assumed by
the legal philosopher for the sake of his analysis. For the prosecution to
secure a guilty verdict, the evidence adduced at the trial must establish
the criminal charge to a degree of probability that crosses this threshold.
Where there is an intermediate standard of “clear and convincing evidence”
which is reserved for special cases, the probabilistic threshold is said
to lie somewhere between 0.5 and the threshold for "proof beyond reasonable
doubt".

Some conceptual-analytic legal philosophers employ decision-theory to
develop a framework for setting the probabilistic threshold that represents
the
standard of legal proof. Since the attention in this area of the law tends
to be on the avoidance of errors and their undesirable consequences, it is
convenient to focus on disutility. The expected DISutility of an outcome
is the product of the DISutility -- the social costs -- of that outcome and
the probability of that outcome. Only two options are generally available
to the court. In criminal cases, it must either convict or acquit the
accused; in civil cases it has to give judgment either for the plaintiff or for

the defendant. At a criminal trial the decision should be made to convict
where the expected DISutility of a decision to acquit is greater than the
expected DISutility of a decision to convict. This is so as to minimize the
expected DISutilities. If we formulate the conceptual analysis axiomatically,
we have:

pDag>(1−P)Dci -- where "p" is the probability that the accused is guilty
on the basis of all the evidence adduced in the case; "Dag" is the
DISutility of acquitting a guilty person and Dci" is the DISsutility of
convicting
an innocent person. A similar conceptual analysis applies to civil cases:
the defendant should be found liable where the expected DISutility of finding
him NOT liable when he is in fact liable exceeds the expected DISutility
of finding him liable when he is in fact not liable. On this conceptual
analysis, a person should be convicted of a crime only where p is greater than:

11+DagDci. A similar conceptual analysis applies in civil cases, except
that the two DISutilities (Dag and Dci) are replaced by their civil
equivalents, framed in terms of the DISutility of awarding the judgment to a
plaintiff who in fact does not deserve it and the DISutility of awarding the
judgment to a defendant who in fact does not deserve it. On this conceptual
analysis, the crucial determinant of the standard of "legal proof" becomes the
ratio of the two DISutilities. In the civil context, the DISutility of an
error in one direction is deemed equal to the DISutility of an error in the
other direction. Hence, a probability of liability of greater than 0.5 would
suffice for a decision to enter judgment against the defendant. The
situation is somewhat different at a criminal trial: Dci, the DISutility of
convicting an innocent person is considered far greater than Dag, the
disutility
of acquitting a guilty person. Hence, the probability threshold for a
conviction should be much higher than 0.5.

An objection to this conceptual analysis may be that it is incomplete, and
that it allows for alleged counter-examples, as Grice would put it. Thus,
it is not enough to compare the costs of erroneous verdicts. The utility of
an accurate conviction and the utility of an accurate acquittal should
also be considered and factored into the equation. This results in the
following modification of the conceptual analysis for setting the standard of
legal "proof":

11+Ucg−UagUai−Uci -- where "Ucg" is the utility of convicting the guilty,
"Uag" is the utility of acquitting the guilty, "Uai" is the utility of
acquitting the innocent and "Uci" the utility of convicting the innocent.
Since the relevant utilities depend on the individual circumstances, such as
the seriousness of the crime and the severity of the punishment, this
decision-theoretic conceptual analysis of the standard of legal "proof" leads
to
the conclusion that the probabilistic threshold should vary from case to case
-- which should not please all legal philosophers (it pleased H. L. A.
Hart). In other words, the standard of "legal proof" is flexible or as Hart
has it, "floating". This conceptual analysis is perceived by some to be
problematic. First, the conceptual analysis falls short descriptively. The law
is alleged to require the court to apply a FIXED standard of "legal proof"
for ALL cases within the relevant category. All criminal cases are governed
by the same high standard; all civil cases are governed by the same lower
standard. That said, it is unclear whether fact-finders in reality adhere
strictly to a FIXED standard of "legal proof" (but this does not mean that
'proof' now has two senses!).

The conceptual analysis includes a valuational component: it advances a
claim about what the law OUGHT to be, is DEEMED to be, and not what it
perhaps alas is. The standard of "legal proof" ought to vary from case to
case.
But this conceptual analysisl faces a second objection. In principle, civil
litigants have the same two rights that we shall identify. Moral harm
arises as an objective moral FACT when a person is erroneously convicted of a
crime. Moral harm is distinguished from, to use McEvoy's favourite adjective,
"mere" harm (in the form of pain, frustration, deprivation of liberty and
so forth) that is suffered by a wrongfully convicted and punished person.
While an accused person does have a right not to be convicted if innocent, an
accused person does NOT have the right to the most accurate procedure
possible for ascertaining their guilt or innocence. However, an accused person
does have the right that a certain weight or importance be attached to the
risk of moral harm in the design of procedural and evidential rules that
affect the level of accuracy. An accused person has, further, the right to a
consistent weighting of the importance of moral harm and this further right
stems from their right to equal concern and respect. Such a conceptual
analysis carries an implication. It is arguable that to adopt a "floating"
standard of "legal proof" offends the second right insofar as it means
treating an accused person differently with respect to the evaluation of the
importance of avoiding moral harm. This difference in treatment is reflected
in
the different level of the risk of moral harm to which an accused person is
exposed.

There is a still another objection to the "floating" standard of legal
"proof". Fact-finding is a theoretical exercise that engages the question of
what to believe about the disputed facts. What counts as "reasonable" for
the purpose of applying the standard of legal "proof beyond reasonable doubt"
is, accordingly a matter for theoretical (or as Grice prefers, 'alethic'),
not practical, reasoning. Only reasons for BELIEF are germane in "alethic"
reasoning. While considerations that bear on the assessment of utility and
disutility provide rather "practical" reasons for action, and thus,
analytically, not a reason TO BELIEVE in the accused’s guilt. Thus, it is
alleged, a decision-theoretical conceptual analysis cannot therefore be used
to
support a variable application of the standard of legal "proof beyond
reasonable doubt".

Another criticism of a conceptual analysis of a flexible standard of
"legal proof" is that it would seem that the maximisation of expected utility
is
a criterion for selecting the appropriate probabilistic threshold to apply
but that it should play no further role in deciding whether that
threshold, once selected, is met on the evidence adduced in the particular
case. It
is not incompatible with the decision-theoretic analysis to insist that the
question of whether the selected threshold is met should be governed
wholly by "alethic" reasons. However, it is arguable that what counts as good
or
strong enough theoretical reason for judging, and hence alethically
BELIEVING, that something is true is dependent on the context, such as what is
at
stake in believing that it is true. Intuitively, as far as ordinary
language goes (to stick with Grice's methodology), more is at stake at a trial
involving the death penalty than in a case of petty shop-lifting.
Accordingly, there should be stronger "alethic" justification for a finding of
guilt
in a trial involving the death penalty (Grice is killed) than in a case of
petty shop-lifting (a grice, an extinct Scottish pig, is stolen). The
conceptual-analytic literature on alethic contextualism and on
interest-relative
accounts of knowledge sd justified true belief can thus be drawn upon to
support a variant standard of legal "proof".

Behind this criticism to this type of conceptual analysis is that the
trier of fact has to make a finding on a disputed factual proposition based on
his alethic BELIEF in the proposition. This is contentious. It may be
argued that, as far as ordinary language goes, some beliefs are involuntary.
It
would seem that we cannot believe something by simply DECIDING to believe
it. The dominant view is that beliefs are context-independent. At any given
moment, we cannot believe something in one context and not believe it in
another. On the other hand, legal fact-finding involves choice and decision
making and it is dependent on the context. E.g. evidence that is strong
enough to justify a finding of fact in a civil case may not be strong enough
to
justify the same finding in a criminal case where the standard of "legal
proof" is higher. The fact-finder has to base his findings, allegedly, not
on what he believes but what he accepts. Belief and acceptance are what
Grice calls 'psychological attitudes". They are different psychological
attitudes that one can have in relation to a proposition. To *accept* that p is
to
have or adopt a policy of deeming, positing or postulating that p, i.e. of
including that proposition or rule among one’s premises for deciding what
to do or think in a particular context.

So perhaps we should go back to an axiomatic conceptual analysis of 'legal
proof'. Alas, understanding standards of "legal proof" in terms of
mathematical probabilities has been found to be controversial. It is said to
raise
a number of paradoxes. E.g. The defendant, Blue Bus Company, owns 75% of
the buses in the capital of Ruritania where the plaintiff was injured by a
recklessly driven bus and the remaining 25% is owned by The Red Bus Company.
No other evidence is presented. Leaving aside the reference class problem
there is a p = 0.75 that the accident was caused by a bus owned by the
defendant. On the probabilistic interpretation of the applicable standard of
"legal proof", i.e. the balance of probabilities, the evidence should be
sufficient to justify a verdict in the plaintiff’s favour. But lawyers seem to
think that the evidence is insufficient. The puzzle is why this is so.
Various attempts have been made to solve this puzzle. On one solution, the
statistical evidence -- the 75% ownership of buses -- is not CAUSALLY
CONNECTED
with the fact sought to be proved (the accident) and as such cannot
justify belief in or knowledge of the fact (vide Grice "The Causal Theory of
Perception"). But it is questionable that the court should aim at knowledge of
the disputed fact and not simply at accuracy in its finding. Another paradox
in the mathematical interpretation of the standard of "legal proof" is the
"conjunction paradox". To succeed in a civil claim or a criminal
prosecution, the plaintiff or the prosecution will have to PROVE the facts --
or
"elements," as legalese goes (legalese can be corpuscularianistic) -- that
constitute the civil claim or criminal charge that is before the court.
Imagine a claim under the law of negligence that rests on two elements: a
breach
of duty of care by the defendant (element A) and causation of harm to the
plaintiff (element B). To win the case, the plaintiff is legally required to
PROVE "A and B". Let "A and B" be mutually independent events. Suppose the
evidence establishes "A" to a p = 0.6 and "B" to a p = 0.7. On the
mathematical interpretation of the standard of "legal proof", the plaintiff
should
succeed in his claim since the probability with respect to each of the
elements exceeds 0.5. However, according to the multiplication rule of
conventional probability calculus, the probability that "A and B" are both
true is
the product of their respective probabilities. In this example, p s only
0.42 (= 0.6 x 0.7). Thus, the overall probability is greater that the
defendant deserves to win than that the plaintiff deserves to win and yet the
verdict is awarded in favour of the plaintiff. One way of avoiding this
"conjunction" paradox is to take the position that it should not be enough for
each "element" to cross the probabilistic threshold. The plaintiff or the
prosecution should win iff the probability of the plaintiff’s or prosecution’
s case as a whole exceeds the applicable probabilistic threshold. So, the
plaintiff should lose since the overall p < 0.5. But this suggested solution
may not satisfy all. The required level of overall probability would then
turn on how many "elements" the civil claim or criminal charge happens to
have. The greater the number of elements, the higher the level of probability
to which, on average, each of them must be PROVED. This is thought to be
arbitrary and hence objectionable. As commentators have noted, the legal
conceptual anaysis of "theft" contains more "elements" than the legal
conceptual analysis of "murder". Criminal law is not the same in all countries
--
never mind Ruritania. We may take the following as a convenient
approximation of what the law is in some countries, including Ruritania.

"X has murdered Y" iff (i) X's act caused the death of Y & (ii) that was
done with the intention of causing the death.

"X has robbed Y" iff (i) X intends to take Y's property & (ii) X is
dishones in taking the Y's property & (iii) X removes Y's property from Y's
possession & (iv) Y lacks consent

Since the conceptual analysis of the offence of theft contains twice the
number of "elements" (or necessary and sufficient conditions, or 'throngs',
to use Grice's jargon) as compared to the conceptual analysis of the offence
of murder, the individual elements for theft would have to be PROVED to a
much higher level of probability, in order for the probability of the
"conjunction" of (i) & (ii) & (iii) & (iv) to cross the overall threshold than
the individual elements for the much more serious crime of murder. This is
intuitively unacceptable, even for a thief, if not a murderer!

Fortunately, there is another conceptual analysis we can bring in to
resolve the "conjunction" paradox, which, admittedly, moves away from thinking
of the standard of "legal proof" as a quantified threshold of absolute
probability. We may analyse it, instead, as a probability ratio. The
fact-finder
has to compare the probability of the evidence adduced at the trial under
the plaintiff’s theory of the case with the probability of the evidence
under the defendant’s theory of the case (the two need not add to 1), and
award
the verdict to the side with a higher probability. One criticism of this
interpretation of the standard of "legal proof" is that it ignores, and does
not provide a basis for ignoring, the margin by which one probability
exceeds the other, and the difference in probability may vary significantly
for
different elements of the case.

But one may allege there is a deeper problem with the probabilistic
conception of the standard of "legal proof", and it is that there does not
seem
to be a satisfactory interpretation of probability that suits the forensic
context. The only plausible candidate is the subjective probability
according to which probability is construed as the strength of alethic belief.
The
evidence is sufficient to satisfy the legal standard of proof on a disputed
question of fact—for example, it is sufficient to justify the positive
finding of fact that the accused killed the victim—only if the fact-finder,
having considered the evidence, forms a sufficiently strong belief that the
accused killed the victim. Guidance on how to process evidence and form
beliefs can be found in a mathematical theorem known as Bayes’ theorem; it is
the method by which an ideal rational fact-finder would revise or update his
beliefs in the light of new evidence. To return to our earlier hypothetical
scenario, suppose the fact-finder initially believes the odds of the
accused being guilty is 1:1 (prior odds) or, putting this differently, that
there is a 0.5 probability of guilt. The fact-finder then receives evidence
that blood of type A was found at the scene of the crime and that the accused
has type A blood. 50% of the population has this blood type. On the
Bayesian approach, the posterior odds are calculated by multiplying the prior
odds
(1:1) by the likelihood ratio which is 2:1. The fact-finder’s belief in
the odds of guilt should now be revised to 2:1. The probability of guilt is
now increased to 0.67. The subjectivist Bayesian theory of legal
fact-finding has come under attack.

1) ascertainment of the likelihood ratios is highly problematic.
2) Bayesi's theory is not sensitive to the weight of evidence which,
roughly put, is the amount of evidence that is available.
3) While Bayes's theorem offers a method for updating probabilities in the
light of new evidence, it is silent on what the initial probability should
be. In a trial setting, the initial probability cannot be set at zero
since this means certainty in the innocence of the accused. No new evidence
can
then make any difference. Whatever the likelihood ratio of the evidence,
multiplying it by zero (the prior probability) will still end up with a
posterior probability of zero. On the other hand, starting with an initial
probability is also problematic. This is especially so in a criminal case. To
start a trial with some probability of guilt is to have the fact-finder
harbouring some initial belief that the accused is guilty and this is not easy
to reconcile with the presumption of innocence.The suggestion of starting
the trial with prior odds of 50:50 can and has be criticised.
4) we have thus far relied for ease of illustration on highly simplified—
and therefore unrealistic—examples. In real cases, there are normally
multiple and dependent items of evidence and the probabilities of all possible
conjunctions of these items, which are numerous, will have to be computed.
These computations are far too complex to be undertaken by human beings. The
impossibility of complying with the Bayesian model undermines its
prescriptive value.
5) Bayes's theory has it the wrong way round. What matters is not the
strength of the fact-finder’s belief itself. The standard of proof should be
understood instead in terms of what it is reasonable for the fact-finder to
believe in the light of the evidence presented, and this is a matter of the
degree to which the belief is warranted by the evidence. Evidence is
legally sufficient where it warrants the contested factual claim to the degree
required by law. Whether a factual claim is warranted by the evidence turns
on how strongly the evidence supports the claim, on how independently
secure the evidence is, and on how much of the relevant evidence is available
to
the fact-finder, i.e. , the comprehensiveness of the evidence. Some are
against identifying degrees of warrant with mathematical probabilities.
Degrees of warrant do not conform to the axioms of the standard probability
calculus. For instance, where the evidence is weak, neither p nor not-p may be
warranted; in contrast, the probability of p and the probability of not-p
must add up to 1. Further, where the probability of p and the probability of
q are both less than 1, the probability of p and q, being the product of
the probability of p and the probability of q, is less than the probability
of either. On the other hand, the degree of warrant for the conjunction of
p and q may be higher than the warrant for either. We can have a legal
application of a general theory of epistemology.
6) Research in experimental psychology suggests that fact-finders do not
evaluate pieces of evidence one-by-one and in the unidirectional manner
required under the mathematical model. A holistic approach is taken instead
where the discrete items of evidence are integrated into large cognitive
structures variously labelled as mental models, stories, narratives and
theories
of the case, and they are assessed globally against the legal definition
of the crime or civil claim that is in dispute. The reasoning -- vide Grice,
"Aspects of reason" -- does not progress linearly from evidence to a
conclusion; it is bi-directional, going forward and backward: as the
fact-finder’
s consideration of the evidence inclines him towards a particular verdict,
his leaning towards that conclusion will often produce a revision of his
original perception and his assessment of the evidence.

The holistic nature of evidential reasoning as revealed by these studies
has inspired alternative conceptual analyses that are of a non-mathematical
nature. One alternative, already mentioned, is the explanatory or relative
plausibility one. This analysis contends that fact-finders do not reason in
the fashion portrayed by the Bayesian model. Instead, they engage in
generating explanations or hypotheses on the available evidence by a process
of
abductive reasoning or drawing “inferences to the best explanation”, and
these competing explanations or hypotheses are compared in the light of the
evidence. The comparison is not of a hypothesis with the negation of that
hypothesis, where the probability of a hypothesis is compared with the
probability of its negation. Instead, the comparison is of one hypothesis with
one or more particular alternative hypotheses as advocated by a party or as
constructed by the fact-finder himself. On this approach, the plausibility
of X, the factual account of the case that establishes the accused’s guilt
or defendant’s liability, is compared with the plausibility of a hypothesis
Y, a specific alternative account that points to the accused’s innocence
or the defendant’s non-liability, and there may be more than one such
specific alternative account.

On this theory, the evidence is sufficient to satisfy the preponderance of
proof standard when the best-available hypothesis that explains the
evidence and the underlying events include all of the elements of the claim.
Thus, in a negligence case, the best-available hypothesis would have to
include
a breach of duty of care by the plaintiff and causation of harm to the
defendant as these are the elements that must be proved to succeed in the
legal claim. For the intermediate clear-and-convincing standard of legal
proof,
the best-available explanation must be substantially better than the
alternatives.To establish the standard of proof beyond reasonable doubt, there
must be a plausible explanation of the evidence that includes all of the
elements of the crime and, in addition, there must be no plausible explanation
that is consistent with innocence. Now, the relative plausibility theory
itself is perceived to have a number of shortcomings.
1) the theory portrays the assessment of plausibility as an exercise of
judgment that involves employment of various criteria such as coherence,
consistency, simplicity, consilience and more. However, the theory is sketchy
on the meaning of plausibility and the criteria for evaluating plausibility
are left largely unanalyzed.
2) Despite the purported utilisation of “inference to the best explanation”
reasoning, the verdict is not controlled by the best explanation. For
instance, even if the prosecution’s hypothesis is better than the defence’s
hypothesis, neither may be very good. In these circumstances, the court must
reject the prosecution’s hypothesis even though it is the best of
alternatives. One suggested mitigation of this criticism is to place some
demand on
the epistemic effort that the trier of fact must take (for example, by
being sufficiently diligent and thorough) in constructing the set of
hypotheses from which the best is to be chosen
3) While it may be descriptively true that fact-finders decide verdicts by
holistic evaluation of the plausibility of competing explanations,
hypotheses, narratives or factual theories that are generated from the
evidence,
such forms of reasoning may conceal bias and prejudice that stand greater
chances of exposure under a systematic approach such as Bayesian analysis. A
hypothesis constructed by the fact-finder may be shaped subconsciously by a
prejudicial generalisation or background belief about the accused based on
a certain feature, say, his race or sexual history. Individuating this
feature and subjecting it to Bayesian scrutiny has the desirable effect of
putting the generalisation or background belief under the spotlight and
forcing the fact-finder to confront the problem of prejudice.

But problems are fine, if legal philosophy's problems were all solved,
legal philosophy, understood as the conceptual analysis of legalese -- as this
legalese is akin to ordinary language (alla Grice or Hart) -- would be
dead!

Cheers,

Speranza

REFERENCES:

Grice, The causal theory of perception, Aristotelian Society.
Grice, Way of Words
Grice, Aspects of Reason: the John Locke Lectures, Oxford: Clarendon
Hart, The concept of law
Toulmin, Probability, in Flew, "Conceptual Analysis".

------------------------------------------------------------------
To change your Lit-Ideas settings (subscribe/unsub, vacation on/off,
digest on/off), visit www.andreas.com/faq-lit-ideas.html

[lit-ideas] Re: Grice v Grice

Other related posts: