Directory UMM :Data Elmu:jurnal:M:Mathematical Biosciences:Vol163.Issue1.Jan2000:

(1)

Incorporating observability thresholds of tumors into the

two-stage carcinogenesis model

Marjo V. Smith

a,*

, Christopher J. Portier

b a

Analytical Sciences, Inc., Suite 200, 2605 Meridian Parkway, Durham, NC 27713, USA b

National Institute of Environmental Health Sciences, Laboratory of Computational Biology and Risk Assessment, P.O. Box 12233, Research Triangle Park, NC 27709, USA

Received 19 October 1998; received in revised form 18 June 1999; accepted 30 July 1999

Abstract

This paper discusses a general way of incorporating the growth kinetics of malignant tumors with the two-stage carcinogenesis model. The model is presented using time-homogeneous rate parameters. In that case, the dierential equations comprising the model are straightforward to solve using standard numerical techniques and software. An extension of the method to time-dependent rate parameters is included in Appendix A. Allowing the rate parameters to be time-dependent does incur computational cost. An ex-pression is given for the expected time without visible tumor, a generalization of the expected time to an observable tumor that includes the possibility of tumor regression. The model is illustrated using incidental liver tumor data in control rats from NTP rodent carcinogenicity studies, using linear birth±death kinetics of tumors combined with a non-absorbing detection limit. The approach is also shown to be potentially useful with tumor observability thresholds having more complicated features. Ó _{2000 Published by}

Keywords:Rodent tumor analysis; Non-homogeneous coecients; Backward Kolmogorov equation

1. Introduction

The two-stage model of carcinogenesis developed by Moolgavkar and Venson [1] and Moo-lgavkar and Knudson [2] models the progression of normal cells to malignant cells via two stages. The ®rst stage follows an alteration of normal cells to precancerous cells or stage-one cells. The second stage follows the transformation of stage-one cells to malignant cells. The normal cell

*_{Corresponding author. Tel. +1-919 544 8500, ext.165; fax: +1-919 544 7507.} E-mail address:[email protected] (M.V. Smith).

(2)

population may be modeled as constant or as a linear birth and death process. The stage-one cell population is normally modeled as a linear birth and death process. Original applications of the model dealt with the malignant growth subsequent to the ®rst appearance of a malignant cell in a deterministic fashion, assuming a non-negative constant time interval to a tumor of observable size. Later work has called for more stochastic descriptions of the malignant growth process [3±5].

Exactly how the observability threshold should be modeled may be expected to dier with tumor and circumstance. Tsodikov et al. [5] considered the progression of human breast cancer tumors to observable size. As human tumors are removed soon after they are observed, it made sense to model the tumor size threshold of observability as an absorbing, random barrier to the pure birth process of tumor progression. The NTP rodent carcinogenesis bioassays provide a very dierent situation. As part of experimental protocol, some rats are sacri®ced before the experi-ment is completed, some die due to the presence of large tumors that may or may not be of the tumor type of interest, while others die accidentally. The studies last about two years, whereupon the surviving rats are sacri®ced. Regardless of how the rats died, tissues are sectioned and ex-amined microscopically for the presence of tumors. For some tumor types, e.g. liver tumors in rats, the appearance of malignant clones is suciently late that presence of the tumor may be assumed incidental to death. Since the tissues can only be examined after death, a tumor may have reached observable size some time before it is actually observed. It is possible that immune system response or increased apoptosis could shrink an observable tumor to below the observability threshold before death and tissue examination.

For tumor data of this type, the observability threshold may be assumed approximately con-stant but not absorbing, as the process does not stop when the microscopic observability threshold is reached. Slow growth of the tumors means that the time between the appearance of the ®rst malignant cell and a tumor of observable size may still have an appreciable impact, even though the microscopic observability threshold is much smaller than that for human cancers. The question of incorporating an observability threshold into models of this type has been addressed before [6], but the forward Kolmogorov equations (leading to partial dierential equations) or ®ltered Poisson processes [7] were used. The theoretical development of these methods is attractive but not easily implemented using standard software.

In this paper, an approach initiated by Portier et al. [8] is further developed to extend the standard two-stage carcinogenesis model to include a malignant growth process. The equations for the model are developed using `®rst step analysis' (see e.g. [9, pp. 79±88]), that also corre-sponds to the backward Kolmogorov equations. The formulation is ¯exible but simple, especially in the time-homogeneous case, and may be used with other models of observability thresholds (See Section 4). The backward Kolmogorov equations can be developed for time-dependent rate parameters as well [10,11], although there is a computational cost. The extension of the model developed in this paper to time-dependent rate parameters is given in Appendix A.

(3)

rats [13] is used to provide parameter values for the ®rst stages of the model. The paper closes with a discussion about using the equations with observability thresholds of dierent types.

2. The model

2.1. General equations

Mathematically the two-stage model may be thought of as linked birth and death processes. The populations of normal cells and initiated or stage-one cells may each be assumed to be birth and death processes. Each malignant cell will initiate another process, describing the growth of that tumor. The system is assumed to be time-homogenous with cells acting independently. In

accordance with the usual development, in a small period of time_Dtfollowing any timet, only the

following events are assumed possible at the following probabilities:

For the growth process of the tumors, we will use:

The tumor is considered observable if it containsNor more cells. The following probabilities

are then de®ned:

W0(t)Pr(no tumor is visible attj1 normal cell, no stage-one cells, no malignant cells at 0). W1(t)Pr(no tumor is visible attjno normal cells, 1 stage-one cell, no malignant cells at 0). W2(t)Pr(no tumor is visible attjno normal cells, no stage-one cells, 1 malignant cell at 0).

If we can ®nd an expression for W0(t), then by the assumption of independence of cells, the

probability of a tumor being visible at timetcan be written as 1ÿW0tM;where the system starts

withM normal cells at time 0.

The strategy for computing W0(t) is to use the above assumptions to derive a system of two

dierential equations, havingW0(t) as part of the solution. The technique, ®rst step analysis [9, pp.

79±88], is used to derive the dierential equations over the interval 0;t. Dividing the interval

0;t_Dt into the two subintervals 0;_Dt and _Dt;t_Dt allows using the conditional in the

Dtb2 birth of a malignant cell

Dtd2 death of a malignant cell

Dtb₀ birth of a normal cell from one normal cell

Dtd0 death or dierentiation of a normal cell

Dtl₁ formation of a stage-one cell during a normal cell division

1ÿDt b₀d0l1 nothing happens to a normal cell

Dtb1 birth of a stage-one cell

Dtd1 death or dierentiation of a stage-one cell

Dtl₂ formation of a malignant cell during a stage-one cell division

(4)

de®nition ofW0(t) (set at time 0) together with the above assumptions over the interval0;Dt. The

solution over the remaining intervalDt;tDtmay be written in terms of the solutions over0;t

using time-homogeneity, since these two intervals have the same length. The backward Kolmogorov equations are derived using this same technique.

Beginning with a single normal cell at time 0, only four events are possible over the interval

0;_Dt: (1) nothing might happen, so that there is only one normal cell at time_Dt, (2) the normal cell might divide into two normal cells, (3) the normal cell may die, or (4) the normal cell may divide into one normal and one stage-one cell. W0tDtmay be written as follows:

W0tDt Prno tumor is visible att

_Dtj1 normal cell; no stage-one cells; no malignant cells at 0 Prno tumor is visible att

Dtj1 normal cell; no stage-one cells; no malignant cells at_Dt Pr1 normal cell; no stage-one cells; no malignant cells at

Dtj1 normal cell; no stage-one cells; no malignant cells at 0 Prno tumor is visible att

_Dtj2 normal cells; no stage-one cell; no malignant cell at_Dt Pr2 normal cells; no stage-one cell; no malignant cell at

Dtjno normal cells; no stage-one cells; no malignant cells at_Dt Prno normal cells; no stage-one cells; no malignant cells at

_Dtj1 normal cell; 1 stage-one cell; no malignant cells at_Dt Pr1 normal cell; 1 stage-one cell; no malignant cells at_Dt Prno tumor is visible att

_Dtj1 normal cell; no stage-one cells; no malignant cells at_Dt 1ÿ_Dtb0

ÿDtd0ÿDtl1 Prno tumor is visible att

_Dtj2 normal cells; no stage-one cell; no malignant cell at_Dt _Dtb₀

Prno tumor is visible att

Dtjno normal cells; no stage-one cells; no malignant cells at_Dt Dtd0

Prno tumor is visible att

Dtj1 normal cell; 1 stage-one cell; no malignant cells at_Dt Dtl₁:

Using the above de®nitions forW0(t) andW1(t) and the assumptions of time homogeneity and

independence of cells,

W0tDt W0t 1ÿDtb0ÿDtd0ÿDtl1 W 2

0t Dtb01Dtd0W0

(5)

By subtractingW0(t) from both sides, taking the limit asDtgoes to zero, and rearranging terms,

dW0t=dtW02t b0ÿ l11ÿW1t b0d0 W0t d0: 1

In a similar way, the corresponding equation for W1t is

dW1t=dtW12t b1ÿ l21ÿW2t b1d1 W1t d1: 2

Clearly from the de®nitions,W00W10 1.

These same equations (with W2t 0), were derived by Portier et al. [8], using probability

generating functions.

2.2. Malignant growth kinetics

In order to solve the above dierential equations for W0(t), an expression for 1ÿW2t, the

Prseeing a tumor attj1 malignant cell at 0is needed. Note that up to this time, no assumptions about the growth kinetics of the tumor, other than time-homogeneity have been made. Each malignant cell is now assumed to progress as a time-homogeneous linear birth and death process. Bailey ([14, p. 94]) gives a closed form expression for the density of a birth±death process at timet, given a single individual at time t, for the case that b₂ 6d2. In terms of the parameters de®ned

above fornP1,

Prtumor hasncells attj1 malignant cell at 0 1ÿA1ÿBBnÿ1_{, where}

Ad2e

b2ÿd2tÿ₁

b2eb2ÿd2tÿd2

and

Bb2e

b2ÿd2tÿ₁

b₂eb2ÿd2tÿd₂ :

The Prseeing a tumor attj1 malignant cell at 0 is then

X1

nN

Prtumor of ncells attj1 malignant cell at 0 1ÿA1ÿBX

1

nN Bnÿ1:

Since forb2>d2, the expressionBremains less than 1 for all positivet, the formula for geometric

series can be used to write

Prseeing a tumor attj1 malignant cell at 0 b2ÿd2b

Nÿ1

2 1ÿeÿb2ÿd2t Nÿ1

b₂ÿd2eÿb2ÿd2t

N :

Note that if b2ÿd2 and the observation times, t, are large, then the quantity expÿb2ÿd2t

may be negligible. Then the probability of seeing a tumor at t can be approximated by

b₂ÿd2=b2. In that case, the model assumption of instant observability of a tumor at the creation

of a single malignant cell, would be a good approximation (as expected). The mutation rate l2

would simply include the factor, b₂ÿd2=b2. Note that the closer d2 is to b2, the bigger the

(6)

For the limiting case thatb2 d2, Bailey gives

Prtumor hasncells attj1 malignant cell at 0 b2t

nÿ1

1b₂tn1; for n P1:

As before,

Prseeing a tumor attj1 malignant cell at time 0

X

1

nN

Prtumor hasncells attj1 malignant cell at 0 b2t

Nÿ1

1b₂tN:

For large t, this quantity approaches 0.

Substituting either of the above expressions for 1ÿW2t; would allow solving forW0(t), the

probability of not seeing a tumor from a single normal cell. Again, assuming the experiment starts withMnormal cells at time 0, the probability of observing a tumor at timetwould be 1ÿW0tM:

If the number of normal cells can be assumed to be approximately constantb₀d00, then

Eq. (1) reduces to a separable equation that can be solved in terms of an integral ofW1(t). In this

case, only the product of the number of normal cells,M, and the mutation rate for a single cell,l1,

is identi®able. Thus Eq. (1) may be written as

dW0t=dt ÿMl11ÿW1t W0t: 3

Since the solution to this equation already includes the exponent,M, the solutionW0(t) would be

Prno visible tumor attjM normal cells; no stage-one cells; no malignant cells at 0. The proba-bility of observing a tumor at time t would be 1ÿW0t.

2.3. Average time to observable tumor

In the special case that malignant growth is irreversible or the observability threshold is

absorbing, a random variable, T can be de®ned as the time a malignant tumor ®rst

be-comes visible. In that case, W0tM Prno tumor is visible by timetjMnormal cells; no

stage-one cells; no malignant cells at 0 can be written as PrT >tjMnormal cells; no stage-one cells; no malignant cells at 0. Thus 1ÿW0tM is the distribution function of Tand the expected value

ofT can be written as

ET

Z 1

0

W0tMdt; provided the infinite integral exists:

In the case of reversible growth kinetics without an absorbing threshold, the time to tumor observability becomes less easily de®ned, as it is possible that a tumor may temporarily regress in size from larger than the observability threshold to smaller than the observability threshold. In that case, the random process,

Vt 0 if there is a visible tumor att

1 if there is no visible tumor att

may be de®ned. Then, PrVt 1 W0t M

. Now consider any ®nite time interval,0;Tf. Let TvTf RTf

(7)

visible tumor. Then the average amount of time in any given ®nite interval,0;Tfwithout visible

The average total time without tumor may be found by taking the limit as Tf goes to in®nity, provided the limit exists. In that case,

ETv lim

The methodology is illustrated by modeling liver tumors in male F344 rats. Historical control data sets from 60 NTP long-term carcinogenicity studies were combined to produce a tumor incidence data set for 3514 male F344 rats of which 86 had tumors of this type. The NTP car-cinogenicity studies only count the animals in which at least one tumor of a given type is observed. Thus no information on multiple tumors, tumor size or tumor lethality was available. Animals were regularly sacri®ced throughout the studies. The earliest death occurred at 11 days; the last death occurred at 736 days. After death, the liver tissue was sectioned and examined under a microscope for tumors. A tumor may be visible at approximately 50 cells, but only recognizable as a malignant tumor at several thousand cells. The term `observable' in this paper refers to being observable (and recognized) as a malignant tumor. The ®rst tumor was observed in a rat that died on the 457th day. Thus there is a very low incidence of liver tumors in this data set, with no early tumors observed.

Rat liver tumors are preceded by hepatic lesions, which may be identi®ed with stage-one cells. Precancerous foci are relatively common in rat livers, while malignant tumors are relatively rare. Parameter values cited by a hepatic lesion simulation study [13] were used for the ®rst part of the two-stage model, and the NTP incidence data used only to estimate the parameters involved with malignant growth. In this example, the number of normal liver cells is assumed to be approximately constant and suciently large that stochastic ¯uctuations are not important. In deriving a mu-tation rate ofl1mutations per day, the normal cell division rate is assumed to be about the same as

the cited normal cell death rate, 1:210ÿ3 _{divisions/cell/day. The mutation rate per normal cell}

division was given as 3:510ÿ8_{mutations per division, so that the mutation rate of a normal cell is}

3:510ÿ8_{mutations=division}_1:2₁₀ÿ3_{divisions=day}_4:2₁₀ÿ11_{mutations/day.}

Carthew et al. [15] give 15:2108_{as the number of normal hepatic cells, so that Eq. (3) is used}

with Ml₁ 0:06384 mutations of normal cells/day. Finally, Conolly and Kimball [13] give the

(8)

3.2. Estimation procedure

The observability thresholdN, the second mutation ratel1, and the malignant birth and death

ratesb₂andd2were estimated by the maximum likelihood method. No mortality was modeled for

the animals themselves. In particular, it was assumed that none of the liver tumors were instru-mental in the death of an animal. The low overall incidence and especially the late onset of the liver tumors in this data set make this assumption plausible. In cases where this assumption would not be reasonable a more elaborate likelihood expression would have to be used (see e.g. [16]).

With the above assumption, no distinction between natural deaths and sacri®ces needs to be made, and the setfti; i1;. . .;mg, may denote the days on which at least one rat died, either by

sacri®ce or naturally. Letcianddirespectively refer to the numbers of animals that died on dayti,

having or not having observable liver tumors. For any given set of parameter values, Eqs. (2) and (3) can be numerically solved by any dierential equation solver. For this paper the estimation was done using Fortran IMSL subroutines, with Matlab used to check the results [17,18]. Thus

W0ti Prno tumor is visible by timetjMnormal cells; no stage-one or malignant cells at 0 can

be found for anyi1;. . .;m. The likelihood expression may then be written as

Ym

i1

W0tidi1ÿW0tici;

though the logarithm of this expression was actually maximized. The IMSL routine, DIVPAG was used to solve the dierential equations and the routine, DBCONF, was used to optimize the loglikelihood.

The data was ®t to the model by ®xing the ratio,d2/b2, successively at 0.0, 0.1, 0.3, 0.5, 0.7, 0.9,

0.95, 1.0 and the observability threshold, N, at 1, 5, 15, 50, 100, 500, 1000, and 10 000 cells. The

maximum likelihood estimates for b2 and l2 were found for each such combination.

3.3. Results

The values of the loglikelihood expressions are given in Table 1. The values shown do not dier statistically from each other, but do vary in a consistent way across the combinations of death±

birth ratios and observability thresholds. Corresponding estimated values for b2 and l2 are

presented in Tables 2 and 3.

Examination of Table 1, shows the optimization surface to be very shallow, with no relative maximum. The relative ¯atness of the loglikelihood surface indicates some identi®ability or es-timability problems. Both numerical and theoretical non-identi®ability were checked in the fol-lowing way. First, the numerical integration was carefully checked by two dierent dierential equation packages. By adjusting tolerance levels, the solutions to the dierential equations could be made to agree on 7±10 signi®cant digits. Less accurate tolerance levels led to only minor dierences in the table entries and no qualitative dierences.

The con®dence in the numerical solutions to the dierential equations meant that these equations could be used to check theoretical non-identi®ability. The probability of observing a tumor was computed using the equations, using the estimated parameter values from Tables 2 and

(9)

Table 3

Mutation rates to malignant cells times 107a

Minimum cells in observable clones

Ratios of death±birth rates of malignant cells,d2=b2

0.0 0.1 0.3 0.5 0.7 0.9 0.95 1.0 50 9.0 10.1 12.9 18.0 29.9 88.2 174.3 1206 100 9.1 10.1 13.0 18.1 30.1 89.3 176.5 2432 500 9.1 10.1 13.0 18.2 30.3 90.6 180.3 12 240 1000 9.1 10.1 13.0 18.2 30.4 90.8 181.1 24 510 10 000 9.1 10.1 13.0 18.3 30.4 90.8 182.3 245 100

a

The mutation rates to malignant cells increase sharply with increasing death rates. In the case that a single cell is observable, the mutation rate may equal either 4:9110ÿ7_or₄_:₉₁₌₁_ÿ_d

2=b2 10ÿ7corresponding to two dierent

solutions with very large or small malignant birth rates, respectively. See text. Table 2

Birth rates of malignant cells times 100a

0.0 0.1 0.3 0.5 0.7 0.9 0.95 1.0 1 N/A Large or Large or Large or Large or Large or Large or Small

small small small small small small

5 1.89 1.97 2.23 2.47 2.41 1.64 1.48 1.34 15 2.92 3.18 3.76 4.59 6.12 7.93 6.51 5.04 50 4.02 4.37 5.34 6.87 9.88 19.59 26.20 18.01 100 4.68 5.10 6.21 8.13 11.99 25.97 39.26 36.53 500 6.18 6.77 8.35 11.07 16.89 40.63 68.72 184.75 1000 6.82 7.48 9.27 12.36 18.95 46.98 81.34 369.74 10 000 8.95 9.70 12.31 16.48 26.03 68.43 123.25 370.73

a_{The single cell observability threshold allows two solutions with equal likelihood values corresponding to unnaturally}

large or small malignant cell birth rates. See text. Table 1

Loglikelihood values plus 391a

0.0 0.1 0.3 0.5 0.7 0.9 0.95 1.0

a_{The loglikelihood values re¯ect a loglikelihood surface without a relative maximum. Instead the loglikelihood}

(10)

proportion of tumor bearing animals could then be computed at a set of time points. This arti®cial data set was then used exactly as the data set of the illustrative example to estimate the parameter values by optimizing the loglikelihood. A shallow likelihood surfaces similar to the one shown in Table 1 was constructed, but with a clear relative maximum at the parameter values used to construct the surface. Under true theoretical non-identi®ability, such a relative maximum would not likely appear at the right spot. Thus there does not seem to be true theoretical non-identi®-ability.

In the absence of numerical or theoretical non-identi®ability, it seems likely that the very shallow likelihood surface found was generated by the particular pattern of the data set used to compute that likelihood. The data set used had very few tumor bearing animals and most of those there were detected quite late. Such a pattern of observed tumors could be explained either by higher or lower detection thresholds, with corresponding and partially compensating malignant birth rates.

Examination of the results in Tables 1±3, shows the likelihood surface increases with increasing threshold size. For a given threshold size the likelihood increases for both smaller (towards 0.0) and larger (towards 1.0) ratios of malignant death±birth rates (d2=b2). However, the larger values

ofd2=b2 require much higher (non-biological) birth and mutation rates for threshold sizes above

100 cells (See Tables 2 and 3). For the lower values of d2=b2, the birth and mutation rates stay

plausible for much higher observation thresholds. In fact, some of the birth rates and mutation rates corresponding to values ofd2=b2less than 0.5 for an observation threshold of 10 000 cells are

seen as still plausible. However, there is no longer any distinction in likelihood between the values ofd2=b2. These results are reasonable, given that tumors are microscopically identi®ed at several

thousand cells. The indication is that the death±birth ratio is relatively small, based on the plausibility of values in Table 2.

The case that a single cell is observable and the death rate is zero (ratio,d2=b2 0) is equivalent

to the original model with a zero length time interval between the ®rst malignant cell and the observability threshold. In that case, the expression for the probability of an observable tumor at t, given a single malignant cell at 0, reduces to 1. The model is then independent of the birth rate

b2. In that case, the optimal estimated value for the mutation rate to malignancy is 4:9110ÿ7/

cell/day.

The case of a reversible observability threshold of one cell is more complicated. Note that in this casel₂1ÿW2tmay be written as

l₂ 1ÿf

1ÿf eÿb21ÿft;

wheref d2=b2 and is assumed to be ®xed at one of the levels of Table 1. The assumption that a

single malignant cell is equivalent to an observable tumor assumes a very aggressive tumor with a

very high birth rate. Fitting the model to the data under the condition thatN 1, results in very

high (in fact unreasonable) estimates of b₂, so that the expression l₂1ÿW2tcan be

approx-imated by l₂1ÿf. The estimate of this quantity is 4:9110ÿ7 _{mutation/cell/day, as in the}

original model described above. For each f, the corresponding estimates for l₂ are

4:9110ÿ7₌₁_ÿ_f _{mutations/cell/day, as shown in Table 3. Mathematically equivalent, and}

(11)

then consistently estimated as 4:9110ÿ7 _{mutation/cell/day as shown in Table 1. The two}

so-lutions are the result of ®tting the model under the condition thatN1 and that the death±birth

ratio is ®xed.

The expected time without visible tumor within the duration of the experiment (736 days) was about 730 days for all combinations of threshold size and values ofd2=b2. The expected time

without visible tumor overall was about 1300 days for all cases, much longer than the life of the animal, and a reasonable result from a data set with so few tumor bearing animals and no modeled mortality.

4. Discussion

In this paper, a framework of two dierential equations with a solution that leads to the direct computation of the probability of observing a tumor at a given time has been presented for the time-homogeneous case. Using time-independent rate parameters is clearly not ideal biologically, but does signi®cantly simplify the numerical computation of the model, and thereby also its application to real data sets. Furthermore, frequently not enough is known about the rate pa-rameters to adequately model them as functions of time. For these reasons, the same model for the case of time-dependent rate parameters has been developed in Appendix A only. The equa-tions derived in Appendix A are very similar to those presented in the text, but are not easily numerically integrated except in one (quite useful) special case. In addition the equations must be numerically integrated backwards, from each observation time point back to zero, increasing the computational burden of the method.

The equations of the model presented in the text may be used with other types of observability threshold and even other growth kinetics of tumors. The only requirement is that an expression be

found for the probability of observing a tumor at timet, given one malignant cell at time 0. The

equations work well with the usual assumption that the tumor growth follows a linear birth and death process. Under those conditions, and with the assumption of fast growing tumors, the traditional two-stage model is recovered.

In the text tumor growth and observability were assumed to follow a linear birth and death process with a non-absorbing threshold. This allowed the use of a formula for the probability

density of tumor size to ®nd an expression for 1ÿW2t. Combining reversible tumor kinetics

with an absorbing observability threshold is much more dicult. Saaty [19] can be used to derive

an approximate expression for 1ÿW2t for very large absorbing thresholds. The expression

b₂

is very similar to that for the non-absorbing threshold case, adding only the third term in the ®rst factor. For absorbing observability thresholds of more moderate size, Sherman and Portier [20], using the probability generating function approach to deriving the equations, append a system of equations corresponding to malignant clones of increasing size to the above derived system.

(12)

used. The microscopic tissue examination could detect tumors with a moderate number of cells, a

non-absorbing observability threshold say N. Tumors that aect the mortality of the rodent

would typically be much larger, with an absorbing endpoint of sayMcells. IfMis large enough,

Saaty [19] gives indications for an approximation to the probability density of tumor size that

could be summed fromNtoM in a similar way to the example in this paper. In such a case, the

likelihood expression would also need to be changed to re¯ect the possibility of a tumor aecting mortality [16].

Note that if the growth kinetics of the tumor are approximated by a pure birth process (d20),

that there is no distinction between an absorbing barrier and a non-absorbing one. Thus the observability threshold of human breast cancer described as an absorbing but random barrier by Tsodikov et al. [5], can also be described using the equations in this paper. In the Tsodikov paper,

the number of cells in an observable tumor is written asN cv, wherevis the volume of a tumor

andcthe concentration of tumor cells per unit volume. The authors heldcconstant and allowed

the observable volume,v, to be random with a density ofpv. An expression for the probability of

observing a tumor at time t can therefore be found by substituting the expression

R1 0 1ÿe

ÿb2t

cvÿ1pvdv for 1ÿW2t. Similar randomizations can be applied to the other

observability thresholds.

Appendix A

The backward Kolmogorov approach can be used to deal with time-dependent rate parameters, as was made clear in [10]. The diculty is that in developing the backward equations with time-dependent rates, the ®nal, right-hand endpoint of the time interval must be kept constant, so that the equations are written in terms of the left hand endpoint, or initial time point. Thus in [11], Eq. (5) is written in terms ofs, the left-hand end point. The cost of using this method then, is that the entire system of dierential equations must be numerically solved backwards from each ®nal time point of interest to the initial time.

The above approach is applied to the problem of the text. If 06s6t, the functions of the text may be slightly rede®ned as follows:

W0(s,t)Pr(no tumor visible at tjno normal cells, no stage-one cell, no malignant cell at s). W1(s,t)Pr(no tumor is visible attjno normal cells, 1 stage-one cell, 1 malignant cell ats). W2(s,t)Pr(no tumor visible at tj1 normal cell, no stage-one cell, no malignant cell ats).

Then by the above de®nition,

W0sÿDs;t

Prno tumor is visible attj1 normal cell; no stage-one or malignant cells atsÿDs Prno tumor visible attj1 normal cell; no stage-one or malignant cells ats

Pr1 normal cell; no stage-one or malignant cells atsj1 normal cell; no stage-one or malignant cells atsÿDs

Prno tumor visible attj2 normal cells; no stage-one or malignant cells ats

Pr2 normal cells; no stage-one or malignant cells atsj1 normal cell; no stage-one or malignant cells atsÿDs

(13)

Prno normal cells; stage-one; or malignant cells atsj1 normal cell; no stage-one or

malignant cells atsÿ_Ds

Prno tumor visible attj1 normal cell; 1 stage-one cell; no malignant cells ats Pr1 normal; 1 stage-1; no malignant cells atsj1 normal; no stage-one or

malignant cells atsÿ_Ds

Prno tumor visible attj1 normal; no stage-one or malignant cells ats 1ÿb₀sÿDs Dsÿd0sÿDs Dsÿl1sÿDs Ds

Prno tumor visible attj2 normal; no stage-one or malignant cells ats b₀sÿ_Ds _Ds

Prno tumor visible attjno normal; stage-one; or malignant cells ats d0sÿDs Ds

Prno tumor visible attj1 normal cell; 1 stage-one cell; no malignant cell ats

l₁sÿDs Ds:

Using only the assumption of independence of cells, the right-hand side may be written as

W0s;t 1ÿb0sÿDs Dsÿd0sÿDs Dsÿl1sÿDs Ds W 2

0s;t b0sÿDs Ds

d0sÿDs DsW0s;t W1s;t l1sÿDs Ds:

Similar to the development in the text then, the terms of the equation can be re-arranged so that

W0s

ÿ_Ds;t ÿW0s;t=Ds ÿb0sÿDs ÿd0sÿDs ÿl1sÿDs W0s;t W20s;t

b₀sÿ_Ds d0sÿDs W0s;t W1s;t l1sÿDs

As usual,_Dsapproaches 0 from above on both sides of the equation, and with the assumption that

the rate parameters are left continuous, the following ordinary dierential equation is formed:

dW0s;t=ds ÿW2₀s;t b0s l1s 1ÿW1s;t b0s d0s W0s;t ÿd0s;t:

Note that the derivative is taken at the left-hand (that is the initial) time point. Clearly, there is now no a priori information at the point,s0, sinceW00;tis the solution. Instead, there is a

boundary condition at the right-hand endpoint, since from the de®nition, W0t;t; 1. The

following dierential equation forW1s;t is derived in the same way:

dW1s;t=ds ÿW12s;t b1s l2s 1ÿW2s;t b1s d1s W1s;t

ÿd1s;t withW1t;t 1:

The development of an expression for 1ÿW2s;t, the Pr(at least one visible tumor at tj1

ma-lignant cell ats) is completely similar to the development in the text, but with expressionsAandB that are now functions ofsas well ast. Adapting Bailey [14, p. 110] and further,As;tandBs;t

can be written in terms of the integral, Is;t, where

Is;t

Z t

s

b₂uexp

Z t

u

b₂sd2sds

(14)

Then

As;t 1ÿexpf

Rt

sb2u ÿd2udug

1Is;t

and Bs;t Is;t=1Is;t. As in the main text then, the Pr(at least one visible tumor attj1 malignant cell ats) is

X1

nN

Prtumor of n cells at tj1 malignant cell at 0 1ÿA1ÿBX

1

nN Bnÿ1:

Bs;tis necessarily less than 1 ift>s, so that the geometric series formula may again be applied. Finally for the case of time-dependent rate parameters,

Pra visible tumor attj1 malignant cell ats expf

Rt

sb2u ÿd2udug

1Is;t

Is;t

1Is;t

Nÿ1

1ÿW2s;t:

The two equations derived above must again be solved numerically. Most software used to solve dierential equations allows stepping backwards from the right-hand endpoint (that has the boundary condition) to the left-hand endpoint. Frequently this is done by labeling the right-hand endpoint the `initial point' and the left-hand endpoint, the `®nal point'. In other words, for the software, the `initial' point is larger than the `®nal' point.

One ®nal numerical problem remains, and that is evaluating the integralIs;t. Although there are numerical quadrature programs and subroutines, they are not always easily used in con-junction with numerical solving routines for dierential equations. Hence this approach is only recommended in the case that the integralIs;t admits of an analytical solution. Note that the integral does have such an analytical solution whenever the following two conditions hold: 1. The malignant cell birth rate is analytically integrable and

2. the ratio of the malignant cell death rate to the malignant cell birth rate is piecewise constant. In the above special case, the two dierential equations may be written explicitly and solved using only a numerical ordinary dierential equation solver.

Note that the special case can be quite useful. The malignant cell birth rate can be any kind of polynomial or simple trigonometric function, for example. The ratio of malignant birth rate to death rate needs to be only piecewise constant, so that the ratio itself may change at intervals. Also there are no conditions beyond left continuity on any rate parameters other than those governing malignant growth. In fact, these conditions include most actual applications of time-dependent rate parameters found in the literature (e.g., [7]).

References

[1] S. Moolgavkar, D. Venzon, Two-event models for carcinogenesis: incidence curves for childhood and adult tumors, Math. Biosci. 47 (1979) 55.

(15)

[3] A. Kopp-Schneider, C. Portier, Carcinoma formation in MNRI mouse skin painting studies is a process suggesting greater than two stages, Carcinogenesis 16 (1995) 53.

[4] E.G. Luebeck, S.H. Moolgavkar, Simulating the process of malignant transformation, Math. Biosci. 123 (1994) 127.

[5] A.D. Tsodikov, B. Asselain, A.Y. Yakovlev, A distribution of tumor size at detection: an application to breast cancer data, Biometrics 53 (1997) 1495.

[6] A. Dewanji, S.H. Moolgavkar, E.G. Luebeck, Two-mutation model for carcinogenesis: joint analysis of premalignant and malignant lesions, Math. Biosci. 104 (1991) 97.

[7] S.H. Moolgavkar, E.G. Luebeck, Two-event model for carcinogenesis, mathematical and statistical considerations, Risk Anal. 10 (1990) 323.

[8] C.J. Portier, A. Kopp-Schneider, C.D. Sherman, Calculating tumor incidence rates in stochastic models of carcinogenesis, Math. Biosci. 135 (1996) 129.

[9] H.M. Taylor, S. Karlin, An Introduction to Stochastic Modeling, Academic Press, Orlando, FL, 1984, p. 79. [10] W. Feller, An Introduction to Probability Theory and Its Applications Volume I, Wiley, New York, 1968, p. 468. [11] M.P. Little, Are two mutations sucient to cause cancer? Some generalizations of the two-mutation model of carcinogenesis of Moolgavkar, Venzon, and Knudson, and of the multistage model of Armitage and Doll, Biometrics 51 (1995) 1278.

[12] L.G. Hanin, A.Y. Yakovlev, A nonidenti®ability aspect of the two-stage model of carcinogenesis, Risk Anal. 16 (1996) 711.

[13] R.B. Conolly, J.S. Kimball, Computer simulation of cell growth governed by stochastic processes, Toxicol. Appl. Pharmacol. 124 (1994) 284.

[14] N.T.J. Bailey, The Elements of Stochastic Processes with Applications to the Natural Sciences, Wiley, New York, 1964, p. 94.

[15] P. Carthew, R.R. Maronpot, J.F. Foley, R.E. Edwards, B.M. Nolan, Method for determining whether the number of heapatocytes in rat liver is increased after treatment with the peroxisome proliferator gem®brozil, J. Appl. Toxicol. 17 (1997) 47.

[16] C.J. Portier, Estimating the tumour onset distribution in animal carcinogenesis experiments, Biometrika 73 (1986) 371.

[17] Fortran powerstation 4.0 and IMSL Libraries for MS Windows 95, Microsoft. [18] Matlab version 5.3.0, MathWorks, 1999.

[19] T.L. Saaty, Some stochastic processes with absorbing barriers, J. R. Statist. Soc., Ser. B 23 (1961) 319.