Title:
|
AN ONTOLOGY-DRIVEN SIMILARITY PROVIDING RELIABLE PROTEIN FAMILY RECOGNITION |
Author(s):
|
Fernando Garcia , Francisco J. Lopez , Carlos Cano , Armando Blanco |
ISBN:
|
972-8924-09-7 |
Editors:
|
Nuno Guimarães, Pedro Isaías and Ambrosio Goikoetxea |
Year:
|
2006 |
Edition:
|
Single |
Keywords:
|
Fuzzy Semantic Simililarity, Gene Ontology, Possibilistic. |
Type:
|
Short Paper |
First Page:
|
649 |
Last Page:
|
654 |
Language:
|
English |
Cover:
|
|
Full Contents:
|
click to dowload
|
Paper Abstract:
|
Last years mapping of the genomes opens the door to understanding the root and causes of many genes behaviour, but the interaction between genes is highly complex. One of the main tasks is to group the genes according to their function. This paper presents a new method for measuring the semantic similarity of genes using the annotations derived from the Gene Ontology (GO). Following the idea of Keller [12], our similarity measure is based in a λ-fuzzy measure [3]. For each gene, it takes into account its whole set of annotations (terms), and for each pair of terms, their nearest common antecessor. The evidence codes of the annotations modify their importance. We computed the weights of the evidence by means of a genetic algorithm (GA). For validating the quality of the measure we took protein families and obtained the similarity for each pair of protein in a similarity matrix. Finally, we applied a possibilistic clustering algorithm and observed how well the families were distinguished. |
|
|
|
|