LECTURE NOTES ON INFORMATION RETRIEVAL SYSTEMS B.TECH CSE IV YEAR I SEMESTER (JNTUA-R13) Mr.D.Mukesh ASST.PROFESSOR DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING CHADALAWADA RAMANAMMA ENGINEERINGCOLLEGE CHADALAWADA NAGAR, RENIGUNTA ROAD, TIRUPATI (A.P) - 517506
JAWAHARLAL NEHRU TECHNOLOGICAL UNIVERSITY ANANTAPUR B.Tech. IV - I sem (C.S.E.) T 3 Tu 1 C 3 (13A05708) INFORMATION RETRIEVAL SYSTEMS (CBCC-III) Course Objective: To learn the different models for information storage and retrieval To learn about the various retrieval utilities To understand indexing and querying in information retrieval systems To expose the students to the notions of structured and semi structured data To learn about web search Learning Outcome: At the end of the course students will be assessed to determine whether they are able to store and retrieve textual documents using appropriate models use the various retrieval utilities for improving search do indexing and compressing documents to improve space and time efficiency formulate SQL like queries for unstructured data UNIT I Introduction to Information Retrieval Retrieval Strategies: Vector space model, Probabilistic retrieval strategies: Simple term weights, Non binary independence model, Language Models UNIT II Retrieval Utilities: Relevance feedback, Clustering, N-grams, Regression analysis, Thesauri. UNIT III Retrieval Utilities: Semantic networks, Parsing. Cross-Language Information Retrieval: Introduction, Crossing the language barrier.
UNIT IV Efficiency: Inverted index, Query processing, Signature files, Duplicate document detection UNIT V Integrating Structured Data and Text: A Historical progression, Information retrieval as a relational application, Semi-structured search using a relational schema. Distributed Information Retrieval: A Theoretical model of distributed retrieval, Web search. Text Books : 1. Information Retrieval – Algorithms and Heuristics, David A. Grossman, Ophir Frieder, 2nd Edition, 2012, Springer, (Distributed by Universities Press) Reference Books : 1. Modern Information Retrieval Systems, Yates, Pearson Education 2. Information Storage and Retrieval Systems, Gerald J Kowalski, Mark T Maybury, Springer, 2000 3 . Mining the Web : Discovering Knowledge from Hypertext Data, Soumen Chakrabarti Morgan-Kaufmann Publishers, 2002 4. An Introduction to Information Retrieval, Christopher D. Manning, Prabhakar Raghavan, Hinrich Schütze, , Cambridge University Press, Cambridge, England, 2009