CSCI 780 - Information Organization & Retrieval

3 hours, 3 credits
Dr. Kwok

Concepts of information retrieval: keywords and Boolean retrieval; text processing, automatic indexing, term weighting, similarity measures; Retrieval models: vector model, probabilistic model; Extended Boolean systems: fuzzy set, p-norm models; linguistic model; Extensions and AI techniques: learning and relevance feedback; term dependence; document and term clustering; network approaches; linguistic analysis and knowledge representation. Implementation: inverted files; efficiency issues for large scale systems; integrating database and information retrieval.