The data set “FirstOrder.csv” contains observations of microbial concentrations (log N) measured at different times (t) at a given environmental condition. Lets fit a first-order growth kinetics model \(log N = log N_0 + k \times t\) to the experimental data.
Let’s import the “FirstOrder.csv” dataset, and observe the first five lines.
dat <- read.csv("FirstOrder.csv", sep=";", header=TRUE)
dat
## Time N
## 1 0 37.298
## 2 1 56.149
## 3 2 81.580
## 4 3 108.758
## 5 4 147.276
## 6 5 197.160
## 7 6 273.014
## 8 7 368.254
## 9 8 490.822
## 10 9 673.304
## 11 10 909.398
## 12 11 1204.429
## 13 12 1640.532
## 14 13 2218.299
## 15 14 2983.926
## 16 15 4033.458
Plot N against Time, and observe that the relationship between the two variables is non-linear.
plot(dat$Time, dat$N, xlab="Time (hours)", ylab="N (cells/g)")
Take the logarithm of the variable N and add the new variable logN in the data set.
dat$logN <- log10(dat$N)
dat[1:5,]
## Time N logN
## 1 0 37.298 1.571686
## 2 1 56.149 1.749342
## 3 2 81.580 1.911584
## 4 3 108.758 2.036461
## 5 4 147.276 2.168132
Now plot logN against Time, and observe that they have now a linear relationship.
plot(dat$Time, dat$logN, xlab="Time (hours)", ylab="log N (cels/g)")
The first-order kinetics model is defined as Nt = N0*exp(k*t)
, then the log-linear model becomes ln(Nt) = ln(N0) + k*t
. Fit a linear model using the lm() (linear models) function.
lm1 <- lm(logN ~ Time, data=dat)
The results can be summarised using the summary() fucntion.
summary(lm1)
##
## Call:
## lm(formula = logN ~ Time, data = dat)
##
## Residuals:
## Min 1Q Median 3Q Max
## -0.052876 -0.006170 0.004664 0.011795 0.021331
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 1.6245612 0.0084968 191.2 <2e-16 ***
## Time 0.1328460 0.0009652 137.6 <2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 0.0178 on 14 degrees of freedom
## Multiple R-squared: 0.9993, Adjusted R-squared: 0.9992
## F-statistic: 1.894e+04 on 1 and 14 DF, p-value: < 2.2e-16
You can add fitted values to the original data set.
dat$fitted <- fitted(lm1)
dat[1:5,]
## Time N logN fitted
## 1 0 37.298 1.571686 1.624561
## 2 1 56.149 1.749342 1.757407
## 3 2 81.580 1.911584 1.890253
## 4 3 108.758 2.036461 2.023099
## 5 4 147.276 2.168132 2.155945
Plot the observations against time and superimpose a fitted line. First, extract the model’s parameters and use the model to draw a fitted line.
coefs <- coef(lm1)
coefs
## (Intercept) Time
## 1.624561 0.132846
logN0 <- coefs[1]
logN0
## (Intercept)
## 1.624561
k <- coefs[2]
k
## Time
## 0.132846
Plot observed versus fitted values.
plot(dat$logN ~ dat$Time, data=dat,
xlab="Time (hours)", ylab="log N (cells/g)")
time.levels <- seq(0, 20, by=1)
lines(time.levels, logN0 + k*time.levels, lty = 2, col = "blue")