دورية أكاديمية

Methods for simultaneously identifying coherent local clusters with smooth global patterns in gene expression profiles

التفاصيل البيبلوغرافية
العنوان: Methods for simultaneously identifying coherent local clusters with smooth global patterns in gene expression profiles
المؤلفون: Lee Yun-Shien, Tien Yin-Jing, Wu Han-Ming, Chen Chun-Houh
المصدر: BMC Bioinformatics, Vol 9, Iss 1, p 155 (2008)
بيانات النشر: BMC
سنة النشر: 2008
المجموعة: Directory of Open Access Journals: DOAJ Articles
مصطلحات موضوعية: Computer applications to medicine. Medical informatics, R858-859.7, Biology (General), QH301-705.5
الوصف: Background The hierarchical clustering tree (HCT) with a dendrogram 1 and the singular value decomposition (SVD) with a dimension-reduced representative map 2 are popular methods for two-way sorting the gene-by-array matrix map employed in gene expression profiling. While HCT dendrograms tend to optimize local coherent clustering patterns, SVD leading eigenvectors usually identify better global grouping and transitional structures. Results This study proposes a flipping mechanism for a conventional agglomerative HCT using a rank-two ellipse (R2E, an improved SVD algorithm for sorting purpose) seriation by Chen 3 as an external reference. While HCTs always produce permutations with good local behaviour, the rank-two ellipse seriation gives the best global grouping patterns and smooth transitional trends. The resulting algorithm automatically integrates the desirable properties of each method so that users have access to a clustering and visualization environment for gene expression profiles that preserves coherent local clusters and identifies global grouping trends. Conclusion We demonstrate, through four examples, that the proposed method not only possesses better numerical and statistical properties, it also provides more meaningful biomedical insights than other sorting algorithms. We suggest that sorted proximity matrices for genes and arrays, in addition to the gene-by-array expression matrix, can greatly aid in the search for comprehensive understanding of gene expression structures. Software for the proposed methods can be obtained at http://gap.stat.sinica.edu.tw/Software/GAPTest .
نوع الوثيقة: article in journal/newspaper
اللغة: English
تدمد: 1471-2105
العلاقة: http://www.biomedcentral.com/1471-2105/9/155Test; https://doaj.org/toc/1471-2105Test; https://doaj.org/article/26dc9e31d6924621b6228cb4bbb300abTest
DOI: 10.1186/1471-2105-9-155
الإتاحة: https://doi.org/10.1186/1471-2105-9-155Test
https://doaj.org/article/26dc9e31d6924621b6228cb4bbb300abTest
رقم الانضمام: edsbas.F23620F0
قاعدة البيانات: BASE
الوصف
تدمد:14712105
DOI:10.1186/1471-2105-9-155