Wangshu ZHANG

Work place: MOE Key Laboratory of Bioinformatics and Bioinformatics Division, TNLIST/Department of Automation, Tsinghua University, Beijing 100084, China

E-mail:

Website:

Research Interests: Bioinformatics, Computer systems and computational processes, Computational Learning Theory, Data Mining, Data Compression, Data Structures and Algorithms

Biography

Wangshu Zhang received her B.E. degree in Control Science and Engineering in 2008 from Harbin Engineering University, Harbin, China. She is now a M.S. candidate in the Department of Automation, Tsinghua University, Beijing, China. Her research interests include, machine learning, data mining, bioinformatics, and systems biology.

Author Articles
Prioritization of Candidate Nonsynonymous Single Nucleotide Polymorphisms via Sequence Conservation Features

By Jiaxin WU Wangshu ZHANG Rui JIANG

DOI: https://doi.org/10.5815/ijem.2011.05.09, Pub. Date: 5 Oct. 2011

The Detection of rare variants responsible for human complex diseases has been receiving more and more attentions. However, most existing computational methods for this purpose require the selection of functional variants before statistical analysis. Based on the assumption that nonsynonymous single nucleotide polymor-phisms (nsSNPs) associated with specific diseases should be similar in their properties, we propose a method that utilize conservation scores of nsSNPs and the guilt-by-association principle to prioritize the candi-date nsSNPs for specific diseases. Systematic validation demonstrates that our approach is effective in recovering the relationship between nsSNPs and diseases, with the Manhattan distance measure achieving the most pre-cise prediction results.

[...] Read more.
Extraction of Sequence Conservation Features for the Prioritization of Candidate Single Amino Acid Polymorphisms

By Jiaxin WU Mingxin Gan Wangshu ZHANG Rui JIANG

DOI: https://doi.org/10.5815/ijieeb.2011.02.01, Pub. Date: 8 Mar. 2011

Although remarkable success has been achieved by genome-wide association (GWA) studies over the past few years, genetic variants discovered in GWA studies can typically account for only a small fraction of heritability of most common diseases. As such, the identification of multiple rare variants that are associated with complex diseases has been receiving more and more attentions. However, most of the recently developed statistical approaches for detecting association of rare variants with diseases require the selection of functional variants before the successive analysis, making an effective bioinformatics method for filtering out non-relevant rare variants indispensible. In this paper, we focus on a specific type of genetic variants called single amino acid polymorphisms (SAAPs). We propose to prioritize candidate SAAPs for a specific disease according to their association scores that are calculated using a guilt-by-association model with a set of features derived from protein sequences. We validate the proposed approach in a systematic way and demonstrate that the proposed model is powerful in distinguishing disease-associated SAAPs for the specific disease of interest.

[...] Read more.
Other Articles