BNFO 653 – Pattern Recognition and Gene Finding (2017)
|
Projects: Computation to Solve Problems
The following projects guide you through a problem requiring computer programming for its solution. Choose whichever one (or more) that strikes your fancy. Better to do one well than try too many.
An example of data mining that combines a search of text with a search of protein sequences. STRs are commonly used in forensic application. Extracting useful insights from microarray data sliced by metabolic pathways. How to find them by repetitive structure, and how to compare their locations in different genomes. Using position-specific scoring matrices to extend experimental knowledge of the genomic targets of a certain DNA-binding protein to find new, previously unknown sites. Uses a well-studied cyanobacterial transcription factor as an example. Certain genes are known or suspected of being co-regulated. Perhaps the genes contain a common upstream sequence that is the target of a transcription factor. Uses a motif-finding program (Meme) to investigate, possible regulatory motifs in Streptococcus genes. A family of proteins does not show great sequence similarity, except within certain amino acid motifs. Can these motifs be used to find additional family members? Uses a plant protein involved in floral symmetry as an example. Sometimes computational simulation can take the place of statistical tests. Often doing a simulation provides insights into what a statistical test is tryig to do. |