-
Notifications
You must be signed in to change notification settings - Fork 1
2.7 SAM
SAM (State-space assessment model)
Current Entry Version: 0.1.0 (Date: 2017-02-20)
First Entry Version: 0.1.0 (Date: 2017-02-20)
First ICCAT Reference
Catalogue Committee
Proposer: George Tserpes
Seconder: Abdelouahed Ben Mhamed
Reviewers:
ICCAT Secretariat
Anders Nielsen DTU-Aqua an@aqua.dtu.dk
All sources available on github. Runs on all common platforms (Linux, Mac, windows)
R, C++, via the R-package TMB.
Depends on the TMB R-package, but all tools are free and open source.
Bootstrap sampling, subsampling, and the jackknife all rely on estimating the variance of a statistic by using the variability between resamples rather than using statistical distributions. This vignette will cover the jackknife and how it is applied to the resampling distribution to generate variance estimates for random forest predictions.
The ordinary jackknife is a resampling method useful for estimating the variance or bias of a statistic. The jackknife estimate of a statistic can be found be by repeatedly calculating the statistic, each time leaving one observation from the sample out and averaging all estimates. The variance of the estimate can be found by calculating the variance of the jackknifed estimates:
\begin{equation} \hat{V}{J} = \frac{n-1}{n}\sum\limits{i=1}^{n} \left(\hat{\theta}{(-i)} - \hat{\theta}{(\cdot)}\right)^2 \end{equation}
To fit the state-space assessment model to data and make it easy to produce common outputs from the model.
The basic state-space assessment model (SAM) is described in Nielsen & Berg (2014). The model has been continuously developed and adapted for different stocks (e.g.~to include tagging data and biomass indices). The current implementation (https://github.com/fishfollower/SAM) is now an R-package based on Template Model Builder (TMB) (Kristensen et al. 2016).
\subsubsection*{Model}
The model is a state--space model.The states
The transition function
\begin{align*}
\log N_{1,y}&=SR(\log(N_{,y-1}))&\
\log N_{a,y}&=\log N_{a-1,y-1} - F_{a-1,y-1} - M_{a-1,y-1}\ , \quad &2\leq a < A \
\log N_{A,y}&=\log( N_{A-1,y-1}\exp^{- F_{A-1,y-1} - M_{A-1},y-1} + N_{A,y-1}\exp^{- F_{A,y-1} - M_{A,y-1}})&\
\log F_{a,y}&=\log F_{a,y-1}\ , \quad &1\leq a \leq A
\end{align*}
Here
The prediction noise
The observation part of the state--space model describes the distribution of the observations for a given state
Here $Z$ is the total mortality rate $Z{a,y}=M_{a,y}+F_{a,y}$,
Observation uncertainty is important e.g.~to get the relative weighting of the different information sources correct, so a lot of effort has been invested in getting the optimal options into SAM. In Berg and Nielsen (2016) different covariance structures are compared for four ICES stocks. It was found that irregular lattice AR(1) observation correlation structure was optimal for surveys. The covariance structures tested were inspired by a previous study (Berg et al.~2014) of the structures obtained from survey calculations. In the paper Albertsen et al. (2016) 13 different observational likelihood formulations were evaluated for four ICES stocks. It was found that the multivariate log-normal representation was among the optimal in all four cases.
To describe the options available consider a yearly vector
where $\Sigma$ is the covariance matrix, and $\hat{C}y$ is the vector of the usual model predictions. The covariance matrix is specified from a vector of standard deviations $\sigma=(\sigma_1\ldots\sigma_A)$ and a correlation matrix $\rho$ (by $\Sigma{a\tilde{a}}=\sigma_a\sigma{\tilde{a}}\rho_{a\tilde{a}}$). Four options are available for the correlation
\subsubsection*{Likelihood and approximation}
The likelihood function for this is set up by first defining the joint likelihood of both random effects (here collected in the
\begin{align*}
\ell_M(\theta,x) = \ell(\theta,\hat{u}\theta,x)-\frac{1}{2}\log(\det(-\ell_{uu}
''(\theta,u,x)|{u=\hat{u}\theta}))+\frac{n}{2}\log(2\pi)
\end{align*}
\subsection*{Required input}
The model use catches at age
\subsection*{Program outputs}
The SAM package produce tables and figures of all standard assessment output (N, F, SSB, TSB, agerage F, and forecasts).
The model use catches at age
The SAM package produce tables and figures of all standard assessment output (N, F, SSB, TSB, agerage F, and forecasts).
The residual calculation procedure in state-space assessment models can be difficult, but is extremely important when evaluating the assumed covariance structure. The standard practice of calculating the residuals (as observed' minus predicted' divided by an estimate of the standard deviation) is strictly only valid for models with purely independent observations. It is not valid for state-space models, where an underlying unobserved process is introducing a correlation structure in the (marginal) distribution of the observations. It is also not valid if the observations are directly assumed to be correlated (e.g. multivariate normal, or multinomial for age compositions). The problem is that the resulting residuals will not become independent.
To get independent residuals the so-called `one-observation-ahead' residuals are computed. The residual for the
Also standard diagnosis like leave-fleet-out and retrospective runs are produced.
Many convenient features for additional plotting, Jitter analysis (for validating convergence), and build-in simulation from model function.
The SAM model has been validated by the ICES method working group. It is used as primary assessment model for more than 25 ICES stocks, and as such validated by each expert working group. The model participated and was simulation tested by the big model comparison initiative in Boston in 2012. The paper about the model is peer reviewed.
Simulation studies are built into the model, so routinely used to validate. The Laplace approximation is validated by parallel implementations as Unscented Kalman Filter and via MCMC.
Validation by comparing to other standard models (XSA, ADAPT, SCAA) done in ICES expert groups and in method groups.
Anders Nielsen
DTU-Aqua an@aqua.dtu.dk
Albertsen, C.M., A. Nielsen, U.H. Thygesen 2016. Choosing the observational likelihood in state-space stock assessment models. Canadian Journal of Fisheries and Aquatic Sciences.
Berg, C.W., A. Nielsen 2016. Accounting for correlated observations in an age-based state-space stock assessment model. ICES Journal of Marine Science: Journal du Conseil 73 (7), 1788-1797
Berg C. W., A. Nielsen, K. Kristensen 2014. Evaluation of alternative age-based methods for estimating relative abundance from survey data in relation to assessment models. Fisheries Research , 151: 91-99.
Kristensen, K, A. Nielsen, C.W. Berg, H.J. Skaug, B. Bell. 2016. TMB: Automatic differentiation and laplace approximation. Journal of Statistical Software 70 (5), 1-21
Nielsen, A. and C.W. Berg 2014. Estimation of time-varying selectivity in stock assessments using state-space models. Fisheries Research 158, 96-101
Thygesen, U.H., C.M. Albertsen, C.W. Berg, K. Kristensen, and A. Nielsen 2017. Validation of state space models fitted as mixed effects models. (Accepted. EES).