Generating observations and log-normally distributed random
errors
We generate 10000 Observations of a sum of 100 random variables with
mean 10 and multiplicative standard deviation of 1.7.
if (!requireNamespace("mvtnorm")) {
warning("Remainder of the vignette required mvtnorm installed.")
knitr::opts_chunk$set(error = TRUE)
}
nObs <- 100; nRep <- 10000
#nObs <- 1000; nRep <- 100
xTrue <- rep(10, nObs)
sigmaStar <- rep(1.7, nObs) # multiplicative stddev
theta <- getParmsLognormForExpval(xTrue, sigmaStar)
# generate observations with correlated errors
acf1 <- c(0.4,0.1)
corrM <- setMatrixOffDiagonals(
diag(nrow = nObs), value = acf1, isSymmetric = TRUE)
xObsN <- exp(mvtnorm::rmvnorm(
nRep, mean = theta[,1]
, sigma = diag(theta[,2]) %*% corrM %*% diag(theta[,2])))
A single draw of the autocorrelated 100 variables looks like the
following.

Estimating the correlation matrix and effective number of
parameters
We can estimate the autocorrelation matrix by assuming that it
depends only on the distance in time, and estimate the autocorrelation
matrix.
The original autocorrelation function used to generate the sample
was:
## [1] 1.0 0.4 0.1
The effective autocorrelation function estimated from the sample
is:
(effAcf <- computeEffectiveAutoCorr(ds$xErr))
## [1] 1.00000000 0.27920219 0.04710105
(nEff <- computeEffectiveNumObs(ds$xErr))
## [1] 60.78516
Due to autocorrelation, the effective number of parameters is less
than nObs = 100.
Computing the mean and its standard deviation
First we compute the distribution parameter of the sum of the 100
variables. The multiplicative uncertainty has decreased from 1.7.
#coefSum <- estimateSumLognormal( theta[,1], theta[,2], effAcf = effAcf )
coefSum <- estimateSumLognormal( theta[,1], theta[,2], effAcf = c(1,acf1) )
setNames(exp(coefSum["sigma"]), "sigmaStar")
## sigmaStar
## 1.077687
Its expected value corresponds to the sum of expected values
(100*10).
(sumExp <- getLognormMoments( coefSum[1], coefSum[2])[1,"mean"])
## mean
## 1000
The lognormal approximation of the distribution of the sum, is close
to the distribution of the 10000 repetitions.

The mean is the sum divided by the number of observations, n. While the multiplicative standard
deviation does not change by this operation, the location parameter is
obtained by dividing by n at
original scale, hence, subtracting log(n) at
log-scale.
(coefMean <- setNames(c(coefSum["mu"] - log(nObs), coefSum["sigma"]), c("mu","sigma")))
## mu sigma
## 2.2997863 0.0748167
And we can plot the estimated distribution of the mean.
