Title:
|
A DATABASE SCHEMA FOR HIGH-THROUGHPUT SEQUENCING TRANSCRIPTOME PIPELINES |
Author(s):
|
Ruben Cruz Huacarpuma, Maristela T. Holanda, Sérgio Lifschitz, Maria Emilia M. T. Walter |
ISBN:
|
978-989-8533-06-7 |
Editors:
|
Hans Weghorn, Leonardo Azevedo and Pedro Isaías |
Year:
|
2011 |
Edition:
|
Single |
Keywords:
|
Conceptual data model, relational database schema, high-throughput sequencing, transcriptome, pipeline, bioinformatics |
Type:
|
Full Paper |
First Page:
|
187 |
Last Page:
|
194 |
Language:
|
English |
Cover:
|
|
Full Contents:
|
click to dowload
|
Paper Abstract:
|
The rapid advances in high-throughput sequencing techniques of RNA fragments create interesting computational challenges in bioinformatics. One of these challenges is to manage the enormous amount of data generated by the automatic sequencers, particularly storing and analyzing these large-scale processed data. In a transcriptome project, the objective is to identify RNA transcripts of an organism of interest. In this article, we propose both a conceptual model for a transcriptome pipeline and a logical database schema implemented using a relational database system. In order to validate our model, we present two case studies both having the objective of identifying differentially expressed genes using HTS transcriptome data. |
|
|
|
|