dc.contributor.author
Nueske, Feliks
dc.contributor.author
Wu, Hao
dc.contributor.author
Prinz, Jan-Hendrik
dc.contributor.author
Wehmeyer, Christoph
dc.contributor.author
Clementi, Cecilia
dc.contributor.author
Noe, Frank
dc.date.accessioned
2018-06-08T10:51:37Z
dc.date.available
2017-05-18T08:30:10.757Z
dc.identifier.uri
https://refubium.fu-berlin.de/handle/fub188/21217
dc.identifier.uri
http://dx.doi.org/10.17169/refubium-24513
dc.description.abstract
Many state-of-the-art methods for the thermodynamic and kinetic
characterization of large and complex biomolecular systems by simulation rely
on ensemble approaches, where data from large numbers of relatively short
trajectories are integrated. In this context, Markov state models (MSMs) are
extremely popular because they can be used to compute stationary quantities
and long-time kinetics from ensembles of short simulations, provided that
these short simulations are in “local equilibrium” within the MSM states.
However, over the last 15 years since the inception of MSMs, it has been
controversially discussed and not yet been answered how deviations from local
equilibrium can be detected, whether these deviations induce a practical bias
in MSM estimation, and how to correct for them. In this paper, we address
these issues: We systematically analyze the estimation of MSMs from short non-
equilibrium simulations, and we provide an expression for the error between
unbiased transition probabilities and the expected estimate from many short
simulations. We show that the unbiased MSM estimate can be obtained even from
relatively short non-equilibrium simulations in the limit of long lag times
and good discretization. Further, we exploit observable operator model (OOM)
theory to derive an unbiased estimator for the MSM transition matrix that
corrects for the effect of starting out of equilibrium, even when short lag
times are used. Finally, we show how the OOM framework can be used to estimate
the exact eigenvalues or relaxation time scales of the system without
estimating an MSM transition matrix, which allows us to practically assess the
discretization quality of the MSM. Applications to model systems and molecular
dynamics simulation data of alanine dipeptide are included for illustration.
The improved MSM estimator is implemented in PyEMMA of version 2.3.
en
dc.rights.uri
http://publishing.aip.org/authors/web-posting-guidelines
dc.subject
Markov processes
dc.subject
Singular values
dc.subject
Relaxation times
dc.subject
Trajectory models
dc.subject.ddc
500 Naturwissenschaften und Mathematik::510 Mathematik
dc.title
Markov state models from short non-equilibrium simulations—Analysis and
correction of estimation bias
dc.type
Wissenschaftlicher Artikel
dcterms.bibliographicCitation
Journal of Chemical Physics. - 146 (2017), 9,
dcterms.bibliographicCitation.doi
10.1063/1.4976518
dcterms.bibliographicCitation.url
http://dx.doi.org/10.1063/1.4976518
refubium.affiliation
Mathematik und Informatik
de
refubium.mycore.fudocsId
FUDOCS_document_000000027021
refubium.resourceType.isindependentpub
no
refubium.mycore.derivateId
FUDOCS_derivate_000000008204
dcterms.accessRights.openaire
open access