(3)
Houses
in course oferection inUrban and Rural
Districts.Introduction xxxv
(4) Imbecility
and
Deaf-Mutism.xxxvi Tables for Statisticians and Biometricians [XVII — XX
and
thereforelog
P = 1739602,
and P =
3-179/10™.In thefirst
and
third casesadifferenttreatmentmust
be used.For ^ = 14393
we
use Table XII.We
have forn=
4:P =
-801253 +
-4393 [-228846] -
$ (-4393) (-5607)
[+
48064]=
-6948.Had we worked
from TableXVII by Formula
(i),we
should have hadP =
-6950.For x
2=
-7080,we
can use Table XII,remembering
that for^
2=
0,P =
l.We have
P =
1-000,000+
-708[-
198,747]-
} ('708) (-292)[-30,099]
=
-8624.Had we worked
from TableXVII by Formula
(i),we
should have hadP =
*865, closeenough
for practical purposes.The
true value ofP worked
fromIV27Tiv
•~ W * + ?S?
#~***)
by
using Table IIisP =
"8713.See
p. xxxviii.Examining
the values ofP we
see that having regardto the errors ofrandom sampling we
can only say that there isno
relationbetween
ruraland
urban districtsand
houses building or built; there is clearly no 'distinct association,' forin69
out of100
cases insampling
from independent materialwe
should getmore
highly associated results.There
is likewise no association on the given materialintheDatura
characters.The
other three cases have clearly verymarked
association, quite independent of
any
influence ofrandom
sampling. Ifwe
regardthese three tables the order of ascending association
judged by
either<f> orC
2is (4), (5), (2), as againstMr
Yule's(2), (4), (5). Ifwe
disregard the non-significanceand
take merely intensity of association,without regard torandom
sampling, the orderis (3), (4), (1), (5), (2), as againstMr
Yule's order(1), (2), (3), (4), (5).The
bestmethod
of inquiry at present for relative association in the case of four-fold tables is, I hold, first to investigateP and throw
out as not associated those cases like the 'Houses, builtand
building' above.Then
to use either"
tetrachoric rt" or
C
2 according aswe
are justified in considering the variates as continuous or not. rP (see p. xxxvii)may
be used as control.Tables XVIII— XX
(pp.31—32)
Tables
for
determining the Equiprobable Tetrachoric CorrelationrP. (Pearsonand
Bell:On
aNovel Method
ofregardingthe Association oftwo Variates classedXVIII— XX] Introduction xxxvu
solely in Alternate Categories. Draper's'
Company
Research Memoirs, Biometric Series,vm. Dulau &
Co.)We have
seenunder
the discussion of theprevious Tablehow
to find ameasure
of the improbabilityoftwo
variates being independent,when
theyare classed in alternate categories.The
difficulty in such cases is to appreciate the relative importance ofvery large inverse powers of10.The
object of thepresent tables is to enable us to deduce a tetrachoric correlation, rt, of which the improbability
is the
same
as that of the given system supposing it to arise,when
thetwo
variates have thesame
marginal frequencies but are reallyindependent. In order to do thiswe
have to determine <rr for the given marginal frequencies, i.e. the standard deviation ofrt on theassumption that risreally zero. Thismay
be easily found fromAbac Diagram XXI
or from TableXXIV
(see below). TableXVIII
then gives us the value of
(—
logP)
foreach value of rtand a>. Ifwe now
turn to our original tableand
calculate its rf, this aswe
have seen will correspond to a given(—
log P).We now make
the(—
logP)
from our^
2correspond to the(— logP)
from ourrtand
„a,.,this givesus a value of rtwhich
has thesame
degree of improbability as our observed table. In other words, instead of trying to appreciate themeaning
of inverse high powers of 10,we
say that a table of thesame
marginal frequencywould
be as improbable ifithad
a tetrachoric correlation rt arising fromrandom
sampling of independent variates.Thus we
read our improbability on a scale of tetrachoric correlation.We
useourcorrelation merely as a scale tomeasure
probability on.As log%
2 provides amore
satisfactory basis for interpolation,and
asmany
readers use logarithm tables
and
not calculators,log^
2will be the form inwhich
X
1will be often presented. TableXX
provides the value of rt corresponding to givena
rand
given log%
2.We
willassume
forthe present that „cr,.can be readilyfound from the marginal totals: see p. xli below.Illustration. Obtain the values of rP for the five tables given above on pp. xxxiv
—
v.The
valuesoflog^
2and
crrare as follows:xxxviii Tables for Statisticians and Biometriciaus [XVIII — XX
regard to the spacings of the correlation curves, the value of the equiprobable correlation is under "0:3, say '027. In other words no significant association can be asserted.
In the case of Oov
=
1941we
arethrown
back on the original formulae*. In the firstplace
we must
findP
for the given value of^
2, i.e. -7080
(see p. xxxv).But
forw'=
4 from formula (xxix),=
2 (-200,0578+
-280,0088x "84142}=
-871,3256.To
obtain rwe have
to use the formula below,where
Oov=
"1941,and
m—
A (•
—
8), the /*„, fit, /is being the
normal moment
functions of Table IX.W2irJ;
e X
P =
-^5=
[jMo
(V2«0 - &
(V2//H-)}- lj ^ (V2m) -
to(^2mr)j
-
sss-
!/"c(V ^"
}" ^
(V2Vir)1+ m&
!ms(V2
~'7t)~ *
(vi ™
r>5j (xxxii).Substituting the valuesof na,.
=
-1941and
V2wt=
4-852,107,we
have for r=03, P=
-90550,r
=
-04,P=
-86501.Whence
forP =
-87133,we have
r=
-038.We now
turn to the three cases which fall inside TableXX.
(2) Eye-colour, Father
and
Son.log
x
2=
2-1249 „<7,=
-0514,r
=
0-5log x"
-
2-0942n0-,
=
0o^
=
of)j^ ^ =
22748
r-06
logx
2=
2-1239^,-•06
5=0
.7 iog%2=
2-2935.Linear differences will suffice
„<r,
=
-05 r=
0-5+.^[1] =
0-517,o£r).
=
.OG r=
0-G+;^[-l] = 0G01.
Hence
Oo-r=
-0514 givesr
=
-517+
^x-084
=
•517+
-012=
-529.* Drapers' Company Research Memoirs. Biometrie Series VII.
"A
Novel Method," etc.: see pp. 12, 13.XVIII— XX] Introduction XXXIX
Interpolating for <7,. first,
,•
=
•5 „<r,.=-05U
logx"=
2-0737,r
=
-6 O0V=
-051-1- logx'=
2-2537.Hence
for log x~=
2-1249:•0512 ri, .__
r"
=
'O+
-1800 [
'
1]
=
"°28-We
conclude that the equiprobable correlationis -53.(4) Imbecility
and
Deaf-mutism.logx"
=
3-9039 cr,.=
-Ol75,r
=
0-95, Oo-,=
-01, logx
2= *3673
; „°v
=
"02,log
x
2= 37660.
Hence:
r=
0'J5, Oo-,=
-0175,log
x
2= 39163.
Again
:r
=
090, „ct,.=
-01,log
x
2=
-i-2207 ; „o-,.=
-02, log%
2=
3-6197.Hence: r-090,
<r,.=
-0175, logX
*=
3-7699.Interpolatinglogx'-
= 39039 between 39163 and 37699, we
find rP=
0-946.(5) Developmental Defects
and
Dullness.log
x
*=
3-5128, o-,= 0201.r
=
0-8,<r,.
-
-02,log
X
2=
3'4097; <x,.
=
03,logtf
=
3-0598.Hence
: „o-,=
0201,log
X
2=
3"4062.r
=
0<), ,«r-"02, logx
2=
3-6197; O o-,.=
-03,log %=
=
3-2690.Hence
: logtf=
3-6162, for„cr,.=
-0201.Thus,
by
interpolatinglogx
s=
3-5128between
3'4062and
3-6162,we
findr>=-851.
We
have accordingly the followingresults:xl
Tables for Statisticians and Biometrieians [XXIII — XXIV
p.xxxiv.
Both
rPand Q
give veryhigh results for (4)and
(5),and thisis inaccord- ance with the view elsewhere expressed that forextreme
dichotomiesQ
is not tobe trusted. It