SVM Based Prediction of Bacterial Transcription Start Sites

& (2005) SVM Based Prediction of Bacterial Transcription Start Sites. Lecture Notes in Computer Science, 3578, Article number: IDEAL 448-453.

[img]
Preview
PDF (97kB)
7600.pdf.

View at publisher

Description

Identifying bacterial promoters is the key to understanding gene expression. Promoters lie in tightly constrained positions relative to the transcription start site (TSS). Knowing the TSS position, one can predict promoter positions to within a few base pairs, and vice versa. As a route to promoter identification, we formally address the problem of TSS prediction, drawing on the RegulonDB database of known (mapped) Escherichia coli TSS locations. The accepted method of finding promoters (and therefore TSSs) is to use position weight matrices (PWMs). We use an alternative approach based on sup-port vector machines (SVMs). In particular, we quantify performance of several SVM models versus a PWM approach, using area under the detection-error tradeoff (DET) curve as a performance metric. SVM models are shown to out-perform the PWM at TSS prediction, and to substantially reduce numbers of false positives, which are the bane of this problem.

Impact and interest:

4 citations in Scopus
4 citations in Web of Science®
Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

Full-text downloads:

480 since deposited on 11 May 2007
15 in the past twelve months

Full-text downloads displays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.

ID Code: 7600
Item Type: Contribution to Journal (Journal Article)
Refereed: Yes
ORCID iD:
Towsey, Michaelorcid.org/0000-0002-8246-7151
Measurements or Duration: 6 pages
Keywords: Bacterial Promoter, Bioinformatics, Machine Learning
DOI: 10.1007/11508069_58
ISSN: 0302-9743
Pure ID: 34291650
Divisions: ?? 16 ??
Past > Schools > School of Software Engineering & Data Communications
Past > QUT Faculties & Divisions > Science & Engineering Faculty
Current > Research Centres > Australian Research Centre for Aerospace Automation
Copyright Owner: Consult author(s) regarding copyright matters
Copyright Statement: This work is covered by copyright. Unless the document is being made available under a Creative Commons Licence, you must assume that re-use is limited to personal use and that permission from the copyright owner must be obtained for all other uses. If the document is available under a Creative Commons License (or other specified license) then refer to the Licence for details of permitted re-use. It is a condition of access that users recognise and abide by the legal requirements associated with these rights. If you believe that this work infringes copyright please provide details by email to qut.copyright@qut.edu.au
Deposited On: 11 May 2007 00:00
Last Modified: 03 Mar 2024 06:38