Assistant Professor
Computer and Information Science and Engineering
College of Engineering, University of Florida
Gainesville, FL 32611
Office: E456 CSE Building
Phone: (352) 562-8936; Fax: (352) 392-1220
Office Hours: Tuesday and Thursday 5-6pm or by appointment
Prospective Students ShortBio Berkeley Homepage
My research interest is in building systems and designing algorithms to support better data analysis, where better can mean: more efficient/scalable, more advanced (using statistical machine learning), more accurate, more interactive, self-improving, and easier to use. I pursue research topics such as probabilistic databases, large-scale advanced data analysis, and query-driven interactive machine learning. Currently, I am particularly interested in bridging data management systems with statistical and probabilistic models and tools. For more information please visit our lab website Data Science Research @ UF.
I am currently looking for new graduate students. For more information, please refer to Prospective Students.
I am a member of the UFL Database Group.
ShortCV Projects Talks Students Publications Teaching Other
- ProbKB: Large-scale Probabilistic Reasoning over Uncertain Knowledge Bases
- Archer: Query-Driven Statistical Text Analysis in DBMS and MPP frameworks
- CAMeL: Crowd Assisted Machine Learning
- Past Projects
- Christan Grant
- Yang Chen
- Kun Li
- Sean Goldberg
- Yang Peng
- CIS6930, Data Science: Large-scale Advanced Data Analysis, Spring 2013
- COP5725, Data Management Systems, Fall 2012
- CIS4301, Information and Data Management Systems, Spring 2012
- CIS6930, Data Science: Large-scale Advanced Data Analysis, Fall 2011
- “A Probabilistic Knowledge Base System”,
- Invited Talk @ Rochester Big Data Forum, Oct. 2012
- “Hybrid In-Database Inference for Declarative Information Extraction” sigmod11slides
- SIGMOD Conference, June 15, 2011
- “Selectivity Estimation for Extraction Operators over Text Data” icde11slides
- ICDE Conference, April 14, 2011
- “Querying Probabilistic Information Extraction”
- EMC/Greenplum Seminar, July 11, 2011
- CSAIL Seminar, MIT, November 17, 2010.
- Database Seminar, University of Toronto, January 5, 2010.
- EMC/Greenplum Seminar, July 11, 2011
- “Querying Probabilistic Information Extraction” pvldb10slides
- VLDB Conference, September, 2010
- “Probabilistic Declarative Information Extraction” icde10slides
- ICDE Conference, March, 2010
- “Declarative Information Extraction in a Probabilistic Database System”
- Info Lab Seminar, Stanford, May, 2009.
MADden: Query-Driven Statistical Text Analytics
To Appear in Proceedings of ACM CIKM, 2012
Christan Grant, Jordan Gumbs, Kun Li, Daisy Zhe Wang, George Chitouras
Automatic Knowledge Base Construction using Probabilistic Extraction, Deductive Reasoning, and Human Feedback
To Appear in Proceedings of NAACL-HLT, 2012, short paper
Daisy Zhe Wang, Yang Chen, Sean Goldberg, Christan Grant, and Kun Li
The MADlib Analytics Library or MAD Skills, the SQL
To Appear in Proceedings of VLDB, 2012
Joseph M. Hellerstein, Christoper Re, Florian Schoppmann, Daisy Zhe Wang, Eugene Fratkin,
Aleks Gorajek, Kee Siong Ng, Caleb Welton, Xixuan Feng, Kun Li, and Arun Kumar
Hybrid In-Database Inference for Declarative Information Extraction sigmod11 sigmod11slides
Proceedings of ACM SIGMOD International Conference on Management of Data, 2011
Daisy Zhe Wang, Michael J. Franklin, Minos Garofalakis, Joseph M. Hellerstein,
and Michael L. Wick
Selectivity Estimation for Extraction Operators over Text Data icde11 icde11slides
Proceedings of 27th IEEE ICDE International Conference on Data Engineering, 2011
Daisy Zhe Wang, Long Wei, Yunyao Li, Frederick Reiss, and Shivakumar Vaithyanathan
Querying Probabilistic Information Extraction pvldb10 pvldb10slides
Proceedings of 36th VLDB Very Large Data Base Endowment, 2010, PVLDB Vol.3
Daisy Zhe Wang, Michael J. Franklin, Minos Garofalakis, and Joseph M. Hellerstein
Probabilistic Declarative Information Extraction icde10 icde10slides TR-pdb-ie
Proceedings of 26th IEEE ICDE International Conference on Data Engineering, 2010, short paper
Daisy Zhe Wang, Eirinaios Michelakis, Michael J. Franklin, Minos Garofalakis,
and Joseph M. Hellerstein
BayesStore: Managing Large, Uncertain Data Repositories with Probabilistic Graphical Models vldb08a vldb08slides
Proceedings of 34th VLDB Very Large Data Base Endowment, 2008
Daisy Zhe Wang, Eirinaios Michelakis, Minos Garofalakis, and Joseph M. Hellerstein
WebTables: Exploring the Power of Tables on the Web vldb08b
Proceedings of 34th VLDB Very Large Data Base Endowment, 2008
Michael Cafarella, Alon Halevy, Daisy Zhe Wang, Eugene Wu, Yang Zhang
VLDB Endowment
ACM SIGMOD
LaTex Templates and Guides
A Parable of Modern Research
Bob has lost his keys in a room which is dark except for one brightly lit corner.
“Why are you looking under the light, you lost them in the dark!”
“I can only see here.”
