A DATABASE SCHEMA FOR HIGH-THROUGHPUT SEQUENCING TRANSCRIPTOME PIPELINES

Home

Document Info

Title:	A DATABASE SCHEMA FOR HIGH-THROUGHPUT SEQUENCING TRANSCRIPTOME PIPELINES
Author(s):	Ruben Cruz Huacarpuma, Maristela T. Holanda, Sérgio Lifschitz, Maria Emilia M. T. Walter
ISBN:	978-989-8533-06-7
Editors:	Hans Weghorn, Leonardo Azevedo and Pedro Isaías
Year:	2011
Edition:	Single
Keywords:	Conceptual data model, relational database schema, high-throughput sequencing, transcriptome, pipeline, bioinformatics
Type:	Full Paper
First Page:	187
Last Page:	194
Language:	English
Cover:
Full Contents:	click to dowload
Paper Abstract:	The rapid advances in high-throughput sequencing techniques of RNA fragments create interesting computational challenges in bioinformatics. One of these challenges is to manage the enormous amount of data generated by the automatic sequencers, particularly storing and analyzing these large-scale processed data. In a transcriptome project, the objective is to identify RNA transcripts of an organism of interest. In this article, we propose both a conceptual model for a transcriptome pipeline and a logical database schema implemented using a relational database system. In order to validate our model, we present two case studies both having the objective of identifying differentially expressed genes using HTS transcriptome data.

	Go Back