Degree Type

Dissertation

Date of Award

2005

Degree Name

Doctor of Philosophy

Department

Electrical and Computer Engineering

Major

Bioinformatics and Computational Biology

First Advisor

Julie A. Dickerson

Second Advisor

Eve Syrkin Wurtele

Abstract

This work integrates multi-scale clustering and short-time correlation to estimate genetic networks with different time resolutions and detail levels. Gene expression data are noisy and large scale. Clustering is widely used to group genes with similar pattern. The cluster centers can be used to infer the genetic networks among these clusters. This work introduces the Multi-scale Fuzzy K-means clustering algorithm to uncover groups of coregulated genes and capture the networks in different levels of detail.;Time series expression profiles provide dynamic information for inferring gene regulatory relationships. Large scale network inference, identifying the transient interactions and feedback loops as well as differentiating direct and indirect interactions are among the major challenges of genetic network inference. Time correlation can estimate the time delay and edge direction. Partial correlation and directed-separation theory help differentiate direct and indirect interactions and identify feedback loops. This work introduces the constraint-based time-correlation (CBTC) network inference algorithm that combines these methods with time correlation estimation to more fully characterize genetic networks. Gene expression regulation can happen in specific time periods and conditions instead of across the whole expression profile. Short-time correlation can capture transient interactions.;The network discovery algorithm was mainly validated using yeast cell cycle data. The algorithm successfully identified the yeast cell cycle development stages, cell cycle and negative feedback loops, and indicated how the networks dynamically changes over time. The inferred networks reflect most interactions previously identified by genome-wide location analysis and match the extant literature. At detailed network level, the inferred networks provide more detailed information about genes (or clusters) and the interactions among them. Interesting genes, clusters and interactions were identified, which match the literature and the gene ontology information and provide hypotheses for further studies.

DOI

https://doi.org/10.31274/rtd-180813-1178

Publisher

Digital Repository @ Iowa State University, http://lib.dr.iastate.edu/

Copyright Owner

Pan Du

Language

en

Proquest ID

AAI3200413

File Format

application/pdf

File Size

117 pages

Share

COinS