Identification and Application of Biological Repetitive Sequences
Repetitive sequences are abundant in both genomes and proteins. Identification and application of such biological repetitive sequences are important in Biology for at least two reasons. First, repetitive sequences play important roles in evolution. Second, they present difficulties to genome/protein analyses such as genome assembly and sequence alignment. The exciting related research projects that we have done or have been working on include
-
identifying low-complexity regions in protein sequences;
-
application of low-complexity regions to sequence similarity search in large biological databases;
-
identifying repetitive sequences in sequenced genomes;
-
identifying repetitive sequences in genomic fragments.
Software
GBA: A
software for identifying low complexity regions in protein
sequences.
People
-
Xuehui Li
-
Tamer Kahveci
-
A. Mark Settles
Publications
-
Xuehui Li, Tamer Kahveci and Mark Settles,
A Novel
Genome-Scale Repeat Finder Geared towards Transposons,
accepted to Bioinformatics, 2007.
-
Xuehui Li, Tamer Kahveci,
Quality-based similarity search
for biological sequence databases,
BIOCOMP, 2007
-
Xuehui Li, Tamer Kahveci
A Novel Algorithm for
Identifying Low-complexity Regions in a Protein Sequence
Bioinformatics 22:24 (2006), 2980-2987 (PubMed)
(Bioinformatics)
Funding
-
UF Research Initiatives (Project #: 00072365)
Identification of Repeats in Genomes In the Prescence of
transposons.(05/01/2008-04/30/2010)
-
UFGI
seed grant (Project #: 00060495),
Sequence indexed
maize transposon insertion sites for cereal functional
genomics. (01/23/2006 - 6/30/2008)
Tamer Kahveci
Last modified: Mon Nov 24 18:20:35 EST 2008