Measure representation and multifractal analysis of complete genomes

Yu, Zu-Guo, Anh, Vo V., & Lau, Ka-Sing (2001) Measure representation and multifractal analysis of complete genomes. Physical Review E (Statistical, Nonlinear, and Soft Matter Physics), 64(3), 031903.

View at publisher


This paper introduces the notion of measure representation of DNA sequences. Spectral analysis and multifractal analysis are then performed on the measure representations of a large number of complete genomes. The main aim of this paper is to discuss the multifractal property of the measure representation and the classification of bacteria. From the measure representations and the values of the Dq spectra and related Cq curves, it is concluded that these complete genomes are not random sequences. In fact, spectral analyses performed indicate that these measure representations, considered as time series, exhibit strong long-range correlation. Here the long-range correlation is for the K-strings with dictionary ordering, and it is different from the base pair correlations introduced by other people. For substrings with length K=8, the Dq spectra of all organisms studied are multifractal-like and sufficiently smooth for the Cq curves to be meaningful. With the decreasing value of K, the multifractality lessens. The Cq curves of all bacteria resemble a classical phase transition at a critical point. But the ‘‘analogous’’ phase transitions of chromosomes of nonbacteria organisms are different. Apart from chromosome 1 of C. elegans, they exhibit the shape of double-peaked specific heat function. A classification of genomes of bacteria by assigning to each sequence a point in two-dimensional space (D_{-1} ,D1) and in three-dimensional space (D_{-1} ,D1 ,D_{-2}) was given. Bacteria that are close phylogenetically are almost close in the spaces (D_{-1} ,D1) and (D_{-1} ,D1 ,D_{-2}).

Impact and interest:

78 citations in Scopus
66 citations in Web of Science®
Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

Full-text downloads:

221 since deposited on 06 Aug 2007
3 in the past twelve months

Full-text downloads displays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.

ID Code: 7912
Item Type: Journal Article
Refereed: Yes
Keywords: measure prepresentation, DNA, multifractal analysis
DOI: 10.1103/PhysRevE.64.031903
ISSN: 1550-2376
Subjects: Australian and New Zealand Standard Research Classification > PHYSICAL SCIENCES (020000) > OTHER PHYSICAL SCIENCES (029900) > Biological Physics (029901)
Australian and New Zealand Standard Research Classification > MATHEMATICAL SCIENCES (010000) > PURE MATHEMATICS (010100) > Ordinary Differential Equations Difference Equations and Dynamical Systems (010109)
Divisions: Past > QUT Faculties & Divisions > Faculty of Science and Technology
Current > Research Centres > Science Research Centre
Copyright Owner: Copyright 2001 The American Physical Society
Copyright Statement: Reproduced in accordance with the copyright policy of the publisher.
Deposited On: 06 Aug 2007 00:00
Last Modified: 10 Aug 2011 15:27

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page