Tag: #complexity #compression #entropy

Elements and Relations: Aspects of a Scientific Metaphysics

This textbook is built around the central proposition that systems theory is an attempt to construct an “exact and scientific metaphysics” (an ESM).

Read the full article at: link.springer.com

A Review of Methods for Estimating Algorithmic Complexity: Options, Challenges, and New Directions

Hector Zenil

Entropy 2020, 22(6), 612

 

Some established and also novel techniques in the field of applications of algorithmic (Kolmogorov) complexity currently co-exist for the first time and are here reviewed, ranging from dominant ones such as statistical lossless compression to newer approaches that advance, complement and also pose new challenges and may exhibit their own limitations. Evidence suggesting that these different methods complement each other for different regimes is presented and despite their many challenges, some of these methods can be better motivated by and better grounded in the principles of algorithmic information theory. It will be explained how different approaches to algorithmic complexity can explore the relaxation of different necessary and sufficient conditions in their pursuit of numerical applicability, with some of these approaches entailing greater risks than others in exchange for greater relevance. We conclude with a discussion of possible directions that may or should be taken into consideration to advance the field and encourage methodological innovation, but more importantly, to contribute to scientific discovery. This paper also serves as a rebuttal of claims made in a previously published minireview by another author, and offers an alternative account.

Source: www.mdpi.com

Training-free measures based on algorithmic probability identify high nucleosome occupancy in DNA sequences

We introduce and study a set of training-free methods of an information-theoretic and algorithmic complexity nature that we apply to DNA sequences to identify their potential to identify nucleosomal binding sites. We test the measures on well-studied genomic sequences of different sizes drawn from different sources. The measures reveal the known in vivo versus in vitro predictive discrepancies and uncover their potential to pinpoint high and low nucleosome occupancy. We explore different possible signals within and beyond the nucleosome length and find that the complexity indices are informative of nucleosome occupancy. We found that, while it is clear that the gold standard Kaplan model is driven by GC content (by design) and by k-mer training; for high occupancy, entropy and complexity-based scores are also informative and can complement the Kaplan model.

 

Training-free measures based on algorithmic probability identify high nucleosome occupancy in DNA sequences
Hector Zenil, Peter Minary
Nucleic Acids Research, gkz750,

Source: academic.oup.com