Meetings: TR 11:00 - 12:20 in GSB 859
Managing Big Text Data
Davood Rafiei ,
Course eclass page:
Text data has become pervasive and is found in many different shapes and forms.
As the data increases in size and variation, standard models of text and queries
are no longer effective or efficient. This departure has led to many interesting
models and algorithms for search and data exploration in the past couple of
This course studies some of those models and algorithms with an in-depth
analysis of the underlying problems they address.
Topics to be covered (tentative)
- Models of text and queries
- Ranking and link analysis
- Indexing of keyword queries
- Similarity search
- Relevance feedback
- Beyond keyword queries
Students are expected to have an introductory course in data management (e.g. CMPUT 291 or equivalent) and proficiency in Linux and programming.
- (35%) - Assignments: includes problem sets, programming exercises and research paper reviews
- (45%) - Term project (individual or groups of 2, depending on the class
- (15%) - Class presentation of a research paper
- (5%) - Participation in class discussions
Recommended books and resources