Old projects from SNUBI

¡¡

Microarray Information System Construction

Xperanto is basically designed to collect all contents defined by the MIAME standard and meet the specific requirements of laboratories. The database is comprised of six components: experiment, hybridization, biomaterial, array, protocol, and gene expression data (experimentally measured and computationally processed). By experiment we understand a set of one or more hybridizations that are in some way related, often corresponding to a particular publication. Values for data items come from look-up table that manages controlled terms from the MGED Ontology.

¡¡

Pipelining Microarray Data Analysis

The goal of this project is to create a high-throughput platform for microarray data analysis. This work includes:
    • Develop algorithms for removing systematic effects
    • Develop algorithms for supervised analysis
    • Develop algorithms for unsupervised analysis
    • Develop algorithms for recognizing patterns
    • Develop algorithms for visualization
    • Create software package for the above purposes

¡¡

MITree

MITree is a clustering algorithm based on a geometric principle. Initially it was designed to be a binary hierarchical clustering algorithm for gene expression analysis. It is well described in Kim JH, Ohno-Machado L, Kohane IS. Unsupervised learning from complex data: the matrix incision tree algorithm. Pac Symp Biocomput 2001;:30-41. We also applied Evolution Strategy (i.e., a genetic algorithm) to the same problem. pdf Implementation of MITree is available here with a typical input data format example (Fisher's iris data set), where data starts from the 4th column and 2nd row and tab-delimited ASCII text file. Visualization and Evaluation of Clustering Structures for Gene Expression Data Analysis. Kim JH, Ohno-Machado L, Kohane IS. (2002) J Biomed Inform 2002;35(1):25-36

¡¡

BioEMR

ÀÓ»óÁ¤º¸¿Í ¹ÙÀÌ¿ÀÁ¤º¸ÀÇ ±¹Á¦Ç¥Áرâ¹Ý Á¾ÇÕ ¸ðµ¨¸µ ¹× ¹ÙÀÌ¿À EMR ¿¬µ¿±â¼ú °³¹ß

  • ÀÓ»ó-¹ÙÀÌ¿ÀÁ¤º¸ÀÇ ÅëÇÕÀ» À§ÇÑ XML ¹Ìµé¿þ¾î ¹× BioEMR ¿¬µ¿¿£Áø °³¹ß

  • ÁÖ¿ä ¾ÏȯÀÚÀÇ ½Ç½Ã°£ ÀÓ»ó½ÃÇèÀ» À§ÇÑ BioEMR ÆÄÀÏ·µ ½Ã½ºÅÛ ±¸Ãà ¹× ¿î¿µ

  • ±¹Á¦Ç¥ÁØ¿¡ ±â¹ÝÇÑ ¹ÙÀÌ¿ÀÁ¤º¸ ÅëÇո𵨠°³¹ß ¹× ´Ù ±â°ü ¹ÙÀÌ¿À ÀÓ»óÁ¤º¸ ½Ã½ºÅÛ °³¹ß

  • BioEMRÀÇ XML ±â¹Ý °í °¡¿ë¼º ÁúÀÇó¸®±â ¹× Áö½Ä±â¹Ý ÀÚ·áó¸® ¿£Áø °³¹ß

¡¡

Bioinformatics for Functional Annotation of Genome

Using automatic annotation pipeline,

  • High throughput finding noble genes, non-coding RNA, regulatory regions, repetitive sequence, sequence variation

  • Annotation at gene functional and protein level

  • Functional prediction and automatic annotation by data mining

  • Database building and informative integration of functional genomics and proteomics
    ¡¡

¡¡

Transcriptional Control and Microarrays

1) For this scheme, the microarray data used are mostly time series data. But k-means algorithm considers only the similarity in expression patterns and does not have a sense of Time. We want to develop a clustering method that better reflects the temporal patterns in the expression data. It will more reliably reconstruct the transcriptional control network.
2) Extension of this scheme to the microarray data of species other than the yeast. This will include the development of Gibbs Sampler alignment program optimized for the species.

¡¡

Clinical Document Management System (CDMS)

The goal of the our research; Communication between Hospital Information Systems (HIS) using the XML-based clinical document representation standards

¡¡