Estimation of dynamical systems (in particular, identification of their parameters) is fundamental in computational biology, e.g., pharmacology, virology, or epidemiology, to reconcile model runs with available measurements. Unfortunately, the mean and variance priors of the parameters must be chosen very appropriately to balance our distrust of the measurements when the data are sparse or corrupted by noise. Otherwise, the identification procedure fails. One option is to use repeated measurements collected in configurations with common priors (for example, with multiple subjects in a clinical trial or clusters in an epidemiological investigation). This shared information is beneficial and is typically modeled in statistics using nonlinear mixed-effects models. In this paper, we present a data assimilation method that is compatible with such a mixed-effects strategy without being compromised by the potential curse of dimensionality. We define population-based estimators through maximum likelihood estimation. We then develop an equivalent robust sequential estimator for large populations based on filtering theory that sequentially integrates data. Finally, we limit the computational complexity by defining a reduced-order version of this population-based Kalman filter that clusters subpopulations with common observational backgrounds. The performance of the resulting algorithm is evaluated against classical pharmacokinetics benchmarks. Finally, the versatility of the proposed method is tested in an epidemiological study using real data on the hospitalisation of COVID-19 patients in the regions and departments of France.
Mots-clés : Data Assimilation, Non linear mixed-effect models, Kalman Filters, Epidemiology, COVID-19, Pharmacokinetics
@article{MSIA_2022__11_1_213_0, author = {Annabelle Collin and M\'elanie Prague and Philippe Moireau}, title = {Estimation for dynamical systems using a population-based {Kalman} filter {\textendash} {Applications} in computational biology}, journal = {MathematicS In Action}, pages = {213--242}, publisher = {Soci\'et\'e de Math\'ematiques Appliqu\'ees et Industrielles}, volume = {11}, number = {1}, year = {2022}, doi = {10.5802/msia.25}, language = {en}, url = {https://msia.centre-mersenne.org/articles/10.5802/msia.25/} }
TY - JOUR AU - Annabelle Collin AU - Mélanie Prague AU - Philippe Moireau TI - Estimation for dynamical systems using a population-based Kalman filter – Applications in computational biology JO - MathematicS In Action PY - 2022 SP - 213 EP - 242 VL - 11 IS - 1 PB - Société de Mathématiques Appliquées et Industrielles UR - https://msia.centre-mersenne.org/articles/10.5802/msia.25/ DO - 10.5802/msia.25 LA - en ID - MSIA_2022__11_1_213_0 ER -
%0 Journal Article %A Annabelle Collin %A Mélanie Prague %A Philippe Moireau %T Estimation for dynamical systems using a population-based Kalman filter – Applications in computational biology %J MathematicS In Action %D 2022 %P 213-242 %V 11 %N 1 %I Société de Mathématiques Appliquées et Industrielles %U https://msia.centre-mersenne.org/articles/10.5802/msia.25/ %R 10.5802/msia.25 %G en %F MSIA_2022__11_1_213_0
Annabelle Collin; Mélanie Prague; Philippe Moireau. Estimation for dynamical systems using a population-based Kalman filter – Applications in computational biology. MathematicS In Action, Special issue Maths and Industry, Volume 11 (2022) no. 1, pp. 213-242. doi : 10.5802/msia.25. https://msia.centre-mersenne.org/articles/10.5802/msia.25/
[1] Estimation of US SARS-CoV-2 Infections, Symptomatic Infections, Hospitalizations, and Deaths Using Seroprevalence Surveys, JAMA Network Open, Volume 4 (2021) no. 1, p. e2033706 | DOI
[2] Data assimilation: methods, algorithms, and applications, Fundamentals of Algorithms, Society for Industrial and Applied Mathematics, 2016, xviii+306 pages | DOI | HAL
[3] The invariant extended Kalman filter as a stable observer, IEEE Trans. Autom. Control, Volume 62 (2017) no. 4, pp. 1797-1812 | DOI | MR | Zbl
[4] Estimation and Control of Dynamical Systems, Interdisciplinary Applied Mathematics, Springer, 2018 | DOI
[6] Assessment of reduced-order unscented Kalman filter for parameter identification in 1-dimensional blood flow models using experimental data., Int. J. Numer. Methods Biomed. Eng., Volume 33 (2017) no. 8, p. e2843 | DOI | MR
[7] Stan: A probabilistic programming language, J. Stat. Softw., Volume 76 (2017) no. 1 | DOI
[8] SARS-CoV-2, SARS-CoV, and MERS-CoV viral load dynamics, duration of viral shedding, and infectiousness: a systematic review and meta-analysis, The Lancet Microbe (2020)
[9] Fundamental principles of data assimilation underlying the Verdandi library: applications to biophysical model personalization within euHeart, Med. Biol. Eng. Comput., Volume 51 (2013), pp. 1221-1233 | DOI
[10] A Galerkin strategy with Proper Orthogonal Decomposition for parameter-dependent problems – Analysis, assessments and applications to parameter estimation, ESAIM, Math. Model. Numer. Anal., Volume 47 (2013) no. 6, pp. 1821-1843 | DOI | Numdam | MR | Zbl
[11] Parameter estimation in nonlinear mixed effect models using saemix, an R implementation of the SAEM algorithm, J. Stat. Softw., Volume 80 (2017), pp. 1-41 | DOI
[12] Coupling the SAEM algorithm and the extended Kalman filter for maximum likelihood estimation in mixed-effects diffusion models, Stat. Interface, Volume 6 (2013) no. 4, pp. 519-532 | DOI | MR | Zbl
[13] Une deuxième vague entrainant une situation sanitaire critique, 2020 (Note du Conseil scientifique COVID-19)
[14] runjags: An R package providing interface utilities, model templates, parallel computing methods and additional distributions for MCMC models in JAGS, J. Stat. Softw., Volume 71 (2016) no. 9, pp. 1-25 | DOI
[15] Impact of lockdown on COVID-19 epidemic in Île-de-France and possible exit strategies, BMC Medicine, Volume 18 (2020) no. 1, pp. 1-13 | DOI
[16] Analysis of population pharmacokinetic data using NONMEM and WinBUGS, J. Biopharm. Stat., Volume 15 (2004) no. 1, pp. 53-73 | DOI | MR
[17] An introduction to stochastic differential equations, American Mathematical Society, 2012
[18] Data Assimilation: The Ensemble Kalman Filter, Springer, 2009 | DOI
[19] Applied Optimal Estimation, MIT Press, 1974
[20] Proportion of asymptomatic coronavirus disease 2019: A systematic review and meta-analysis, J. Med. Virol., Volume 93 (2021) no. 2, pp. 820-830 | DOI
[21] Perspectives on the basic reproductive ratio, J. R. Soc. Interface, Volume 2 (2005) no. 4, pp. 281-293 | DOI
[22] A new extension of the Kalman filter to nonlinear systems, Proc. of AeroSense: The 11th Int. Symp. on Aerospace/Defence Sensing, Simulation and Controls (1997)
[23] New results in linear filtering and prediction theory, Trans. ASME J. Basic. Eng., Volume 83 (1961), p. 95--108 | DOI | MR
[24] Population stochastic modelling (PSM)—an R package for mixed-effects models based on stochastic differential equations, Comput. Methods Programs Biomed., Volume 94 (2009) no. 3, pp. 279-289 | DOI
[25] Maximum likelihood estimation in nonlinear mixed effects models, Comput. Stat. Data Anal., Volume 49 (2005) no. 4, pp. 1020-1038 | DOI | MR | Zbl
[26] Random-effects models for longitudinal data, Biometrics, Volume 38 (1982) no. 4, pp. 963-974 | DOI | Zbl
[27] The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: estimation and application, Ann. Intern. Med., Volume 172 (2020) no. 9, pp. 577-582 | DOI
[28] Mixed effects models for the population approach: models, tasks, methods and tools, CRC Press, 2014 | DOI
[29] Extension of a SIR model for modelling the propagation of Covid-19 in several countries. (2020) (https://www.medrxiv.org/content/early/2020/05/21/2020.05.17.20104885) | DOI
[30] Data assimilation: A mathematical introduction, Texts in Applied Mathematics, 62, Springer, 2015 | DOI
[31] Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV-2), Science, Volume 368 (2020) no. 6490, pp. 489-493 | DOI
[32] Comparing the performance of [FOCE] and different expectation-maximization methods in handling complex population physiologically-based pharmacokinetic models, J. Pharmacokinet. Pharmacodyn., Volume 43 (2016) no. 4, pp. 359-370 | DOI
[33] Reduced-order Unscented Kalman Filtering with application to parameter identification in large-dimensional systems, ESAIM, Control Optim. Calc. Var., Volume 17 (2011) no. 2, pp. 380-405 | DOI | Numdam | MR | Zbl
[34] Reduced-order Unscented Kalman Filtering with application to parameter identification in large-dimensional systems, ESAIM, Control Optim. Calc. Var., Volume 17 (2011) no. 2, pp. 380-405 | DOI | Numdam | MR | Zbl
[35] Estimating Data Transformations in Nonlinear Mixed Effects Models, Biometrics, Volume 56 (2000) no. 1, pp. 65-72 | DOI | Zbl
[36] Non-linear mixed-effects models with stochastic differential equations: implementation of an estimation algorithm, J. Pharmacokinet. Pharmacodyn., Volume 32 (2005) no. 1, pp. 85-107 | DOI
[37] et al. Association of public health interventions with the epidemiology of the COVID-19 outbreak in Wuhan, China, J. Am. Med. Soc., Volume 323 (2020) no. 19, pp. 1915-1923
[38] An introduction to the basic reproduction number in mathematical epidemiology, ESAIM, Proc. Surv., Volume 62 (2018), pp. 123-138 | DOI | MR | Zbl
[39] HIV-1 dynamics in vivo: virion clearance rate, infected cell life-span, and viral generation time, Science, Volume 271 (1996) no. 5255, pp. 1582-1586 | DOI
[40] Stochastic methods for sequential data assimilation in strongly nonlinear systems, Monthly Weather Review, Volume 129 (2001) no. 5, pp. 1194-1207 | DOI
[41] Filtres de Kalman singuliers évolutifs pour l’assimilation de données en océanographie, C. R. Acad. Sci., Sér. IIA Earth Planet. Sci., Volume 326 (1998) no. 4, pp. 255-260
[42] A singular evolutive extended Kalman filter for data assimilation in oceanography, J. Marine Syst., Volume 16 (1998) no. 3-4, pp. 323-340 | DOI
[43] Approximations to the log-likelihood function in the nonlinear mixed-effects model, J. Comput. Graph. Stat., Volume 4 (1995) no. 1, pp. 12-35
[44] Performance comparison of various maximum likelihood nonlinear mixed-effects estimation methods for dose–response models, AAPS J., Volume 14 (2012) no. 3, pp. 420-432 | DOI
[45] Use of dynamical models for treatment optimization in HIV infected patients: a sequential Bayesian analysis approach., J. Soc. Fr. Stat., Volume 157 (2016) no. 2, p. 20 | Numdam | MR | Zbl
[46] NIMROD : A program for inference via a normal approximation of the posterior in models with random effects based on ordinary differential equations, Comput. Methods Programs Biomed., Volume 111 (2013) no. 2, pp. 447-458 | DOI
[47] et al. Estimating the burden of SARS-CoV-2 in France, Science, Volume 369 (2020) no. 6500, pp. 208-211 | DOI
[48] A robust nonlinear mixed-effects model for COVID-19 deaths data, Int. J. Numer. Methods Biomed. Eng., Volume 14 (2021) no. 1, pp. 39-57 | MR
[49] Optimal State Estimation: Kalman, , and Nonlinear Approaches, Wiley-Interscience, 2006 | DOI
[50] Stochastic differential equations in NONMEM®: implementation, application, and comparison with ordinary differential equations, Pharm. Res., Volume 22 (2005) no. 8, pp. 1247-1258 | DOI
[51] Pharmacokinetic interactions between theophylline and other medication (Part I), Clin. Pharmacokin., Volume 20 (1991) no. 1, pp. 66-80 | DOI
[52] Linear mixed models for longitudinal data, Linear mixed models in practice, Springer, 1997, pp. 63-153 | DOI
[53] An application of Bayesian population pharmacokinetic/pharmacodynamic models to dose recommendation, Stat. Med., Volume 14 (1995) no. 9, pp. 971-986 | DOI
[54] Statistical methods for HIV dynamic studies in AIDS clinical trials, Stat. Methods Med. Res., Volume 14 (2005) no. 2, pp. 171-192 | DOI | MR | Zbl
Cited by Sources: