OwlTail

Cover image of INLS490-154W: Information Retrieval Systems Design and Implementation

INLS490-154W: Information Retrieval Systems Design and Implementation

Tools for organizing and accessing information have become indispensable. It is critical, therefore, to understand their design and operational foundations. In this course students will have an opportunity to learn about search engines, web crawling, and some Web 2.0 technologies based on hands-on experience and with a focus on techniques that can be used to access, retrieve, organize, and present information. Students will work with practical developmental tools and learn relevant concepts through experimentation. For instance, students will employ an open source search engine and learn about indexing, retrieving, and ranking techniques.

Popular episodes

All episodes

Warning: This podcast data isn't working.

This means that the episode rankings aren't working properly. Please revisit us at a later time to get the best episodes of this podcast!

Podcast cover

Information organization

This is the podcast for the thirteenth class, in which we look at a couple of ways to organize and present information to the user. We see how a term-cloud interface can be created, allowing the user to get a quick glance at the underlying collection. We also talk about a number of clustering algorithms and see how they can be implemented with Lemur.

49mins

17 Nov 2009

Rank #1

Podcast cover

IR on Web 2.0

This is the podcast for the twelfth class, in which we see how REST requests can be made through the web, and the responses in XML can be parsed. This allows us to start connecting with Web 2.0 sources that provide the functionality of meshing different sources by open data exchange.

46mins

10 Nov 2009

Rank #2

Similar Podcasts

Podcast cover

Web crawling

This is the podcast for the eleventh class, in which we see traditional and non-traditional methods of collecting data off the web. Traditional way is demonstrated using web crawling using wget, and non-traditional way is instantiated with YouTube harvesting.

58mins

3 Nov 2009

Rank #3

Podcast cover

User interface for search

This is the podcast for the tenth class, in which we connect the back-end for search that we have been working with to a web-based front-end. This is done using Indri, a new search engine component for Lemur. We also explore some details of AJAX and see how we could use it to enhance our user interface for search.

1hr 25mins

27 Oct 2009

Rank #4

Most Popular Podcasts

Podcast cover

Evaluation-2

This is the podcast for the ninth class, in which we continue looking at evaluation. We talk about more measures to evaluate a query and a system. We also look at comparing two rank lists.

51mins

13 Oct 2009

Rank #5

Podcast cover

Evaluation-1

This is the podcast for the eighth class, in which we start looking at one of the core components of IR - evaluation. We begin our discussion by revisiting recall and precision, and then continue exploring R-precision, AP, and MAP. We see how these can be measured manually and then using TREC supplied tools.

41mins

6 Oct 2009

Rank #6

Podcast cover

Structured query processing

This is the podcast for the seventh class, in which we look at how structured queries with term weights can be executed using Lemur. We use this to provide terminologic feedback to the user, and incorporate the relevance feedback that the user provides into retrieval process.

51mins

6 Oct 2009

Rank #7

Podcast cover

Retrieval models-2

This is the podcast for the sixth class, in which we continue exploring models for retrieving information. So far we have seen vector space, boolean, and language models. Now we talk about probabilistic and relevance models.

1hr 7mins

29 Sep 2009

Rank #8

Podcast cover

Retrieval models-1

This is the podcast for the fifth class, in which we dive into models for retrieving information. We had already looked at the vector space model. Now we talk about boolean retrieval and language models for retrieval.

55mins

22 Sep 2009

Rank #9

Podcast cover

Query processing and retrieval

This is the podcast for the fourth class, in which we look at how queries can be processed using Lemur Toolkit. We then match the processed queries with collection index to retrieve a rank list.

58mins

15 Sep 2009

Rank #10