Research Interests
My research focuses on indexing, storing, accessing and use of
bioinformatics data (DNA, protein sequences, protein structures,
pathways, microarray, ...). Bioinformatics data is different than
existing data for 1) it has redundancy, 2) it is inaccurate, and 3)
it contains cross-relationship among various classes of data.
I am also interested in core database problems that will assist
management of bioinformatics data. Below, you can find a short description of
some of the problems that I am interested in. Feel free to contact me
for collaboration on similar problems.
-
Data summarization: Usually the sizes of biological
databases are huge. This reduces the usability of the
information hidden in such data. I would like to explore
efficient and meaningful summarization methods.
-
Integration: Different types of biological data reveal
different information about life. (e.g. sequences, motifs,
structures, etc.) I would like to develop new methods that
combines these data to better understand how different
organisms work.
-
Indexing: For many problems, exhaustive searching of
bioinformatics data is unfeasible due to the size of the
data. A number of index structures have been proposed to avoid
exhaustive searches. However, these index structures are
either too large or inaccurate. I would like to find novel
indexing methods, that are small, efficient, dynamic, and
extendible to different types of data.
-
Computational methods for pharmacogenomics: In the near
future, we will be able to obtain the complete genetic code
for individuals. This will enable development of personalized
medicine. Understanding and managing data at this scale
requires efficient data management, statistical analysis, and
learning methods.
-
Graph model for bioinformatics: Most of the major
bioinformatics problems can be reduced to well known graph
theory problems. I would like to use this to find a more
common data model for different types of bioinformatics
data. This will make it easier to share information across
heterogeneous data.
-
Inferring function from raw data: The functional
properties of an important fraction of the existing proteins
are unknown. There is a need for computational methods that
helps to discover these properties from the raw data.
Funding
My current funding sources are start-up grant
from University of Florida, UFGI seed
grant, ORAU junior faculty award and NSF (DBI-0606607).
Students
- Xuehui Li
- Jun Liu (Supervisor: Dr. Ranka)
- Padmavati Sridhar
- Jayendra Venkateswaran
- Xu Zhang
Conferences
| CSB
| March 14, 2005
| August 8-11, 2005
| Stanford, CA
|
| PSB
| July 18, 2005
| January 3-7, 2006
| Maui
|
| RECOMB
| September 23, 2005
| April 2-5, 2006
| Venice, Italy
|
| SIGMOD
| December 1, 2005
| June 26-29, 2006
| Chicago
|
| ISMB
| March 1, 2006
| August 6-10, 2006
|
|
| VLDB
| March 16, 2005
| September 12-15, 2005
| Seoul, Korea
|
| ICDE
| July 5, 2006
| April 16-20, 2007
| Istanbul, Turkey
|
Tamer Kahveci
Last modified: Thu Nov 16 10:13:50 EST 2006