Improved Prediction of Bacterial Transcription Start Sites

, , , , & (2006) Improved Prediction of Bacterial Transcription Start Sites. Bioinformatics, 22(2), pp. 142-148.

[img]
Preview
PDF (205kB)
7549.pdf.

View at publisher

Description

Motivation: Identifying bacterial promoters is an important step toward understanding gene regulation. In this paper, we address the problem of predicting the location of promoters and their transcription start sites (TSSs) in Escherichia coli. The accepted method for this problem is to use position weight matrices (PWMs), which define conserved motifs at the sigma-factor binding site. However this method is known to result in a large numbers of false positive predictions. Results: Our approaches to TSS prediction are based upon an ensemble of support vector machines (SVMs) employing a variant of the mismatch string kernel. This classifier is sub-sequently combined with a PWM and a model based on distribution of distances from TSS to gene start. We investi-gate the effect of different scoring techniques and quantify performance using area under a detection-error tradeoff curve. When tested on a biologically realistic task, our method provides performance comparable or superior to the best reported for this task. False positives are significantly reduced, an improvement of great significance to biologists.

Impact and interest:

57 citations in Scopus
49 citations in Web of Science®
Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

Full-text downloads:

291 since deposited on 14 May 2007
15 in the past twelve months

Full-text downloads displays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.

ID Code: 7549
Item Type: Contribution to Journal (Journal Article)
Refereed: Yes
ORCID iD:
Towsey, Michaelorcid.org/0000-0002-8246-7151
Hogan, Jamesorcid.org/0000-0001-7695-3703
Measurements or Duration: 7 pages
Keywords: Bioinformatics, Chlamydia, Transcription Start Sites
DOI: 10.1093/bioinformatics/bti771
ISSN: 1367-4803
Pure ID: 33842271
Divisions: Past > QUT Faculties & Divisions > Faculty of Health
?? 16 ??
Past > Schools > School of Software Engineering & Data Communications
Past > QUT Faculties & Divisions > Faculty of Science and Technology
Past > Schools > School of Life Sciences
Past > Institutes > Institute of Health and Biomedical Innovation
Past > QUT Faculties & Divisions > Science & Engineering Faculty
Current > Research Centres > Australian Research Centre for Aerospace Automation
Copyright Owner: Copyright 2006 (The authors): Licensed to Oxford University Press
Copyright Statement: This work is covered by copyright. Unless the document is being made available under a Creative Commons Licence, you must assume that re-use is limited to personal use and that permission from the copyright owner must be obtained for all other uses. If the document is available under a Creative Commons License (or other specified license) then refer to the Licence for details of permitted re-use. It is a condition of access that users recognise and abide by the legal requirements associated with these rights. If you believe that this work infringes copyright please provide details by email to qut.copyright@qut.edu.au
Deposited On: 14 May 2007 00:00
Last Modified: 30 Jul 2024 00:11