Predicting polymorphic EST-SSRs in silico.

Overview

The public availability of large quantities of gene sequence data provides a valuable resource of the mining of Simple Sequence Repeat (SSR) molecular genetic markers for genetic analysis. These markers are inexpensive, require minimal labour to produce and can frequently be associated with functionally annotated genes. This study presents the characterization of barley EST-SSRs and the identification of putative polymorphic SSRs from EST data. Polymorphic SSRs are distinguished from monomorphic SSRs by the representation of varying motif lengths within an alignment of sequence reads. Two measures of confidence are calculated, redundancy of a polymorphism and co-segregation with accessions. The utility of this method is demonstrated through the discovery of 597 candidate polymorphic SSRs, from a total of 452 642 consensus expressed sequences. PCR amplification primers were designed for the identified SSRs. Ten primer pairs were validated for polymorphism in barley and for transferability across species. Analysis of the polymorphisms in relation to SSR motif, length, position and annotation is discussed.