[Information Retrieval]: Guest lecture by Wessel Kraaij (TNO-ICT) Wednesday 4 October, 13.45 h. in LA-1812

Title: The evaluation of information retrieval systems by Wessel Kraaij (TNO-ICT) Wednesday 4 October, 13.45 hour in LA-1812 This lecture provides the tools and methodology for comparing the effectiveness of two or more information retrieval systems in a meaningful way. Several aspects of information retrieval systems can be evaluated without consulting the potential users or customers of the system, such as for instance the query processing time (measured for instance in miliseconds per query) or the query throughput (measured for instance as the number of queries per second). This lecture, however, focuses on aspects of the system that in uence the quality of the retrieved results. In order to measure the quality of search results, one must at some point consult the potential user of the system. For, what are the correct results for the query “black jaguar”? Cars, or cats? Ultimately, the user has to decide….

[Advanced Databases]: The GIS Project has started

The GIS project assignment can be found on TeleTOP under 'Archive'. Please form groups of 2 – 3 persons and email me before 4 October 2006 the following information:

  • the persons in the group
  • what assignment will the group do: subproject A or subproject B?

Dead line of the project: 24 October 2006. Project end presentations are scheduled on Friday 27 October 2006 at 5/6 hour.

Question Answering for Dutch: Simple does it

by Arjen Hoekstra, Djoerd Hiemstra, Paul van der Vet and Theo Huibers

When people pose questions in natural language to search for information on the web, the role of question answering (QA) systems becomes important. In this paper the QAsystem simpleQA, capable of answering Dutch questions on which the answer is a person or a location, is described. The system’s algorithm does not use a lot of complex NLP-techniques, but instead uses the magnitude of and redundancy on the World Wide Web to its advantage. The system has been evaluated on the DISEQuA corpus and performed quite well: MRR near 0.5. For further improvements it can easily be extended by adding more rewrite rules and applying more sophisticated filtering and tiling.

[download pdf]

[Advanced Databases]: Minipracticum on indexing and query optimization

Friday 22 September, 5/6 hour there will be a mini pracitum in ZI-C4. The objective of the mini practicum assignment is to get some hand-on experience with physical database design and apply the theory from Chapter 9, 10 and 11 to actual query execution plans. It is possible to complete the assignment in approximately two or three hours. After the practium you should hand in your assignments. Deadline: 27 September 2006.

New IR colloquium

We are going to start a new colloquium on information retrieval related topics. It should bring together people working in this field (from different floors of this building) to discuss their newest research as well as new developments in IR in general.

For the first session, that will be held on Tuesday 26.9. 11:00 in our meeting room No. 3126, Claudia has volunteered to give a report on the Clef conference in Alicante (just 2 days after she returns from Spain. So you will get the newest information possible). We will further discuss how we are going to continue these colloquium meetings.

So if you are interested in IR topics, put this date in you agenda…

[Information Retrieval]: Welcome to the course Information Retrieval

Next Wednesday, 6 September, we start our full semester course Information Retrieval. I think we have put together and interesting course with a number of national and international guest speakers such as Iadh Ounis from the University of Glasgow who is one of the architects of the Terrier system.

In November, we will switch from lectures to small research projects in students participate actively in research done at the University of Twente. In this second part, the meetings will be used to set up a discussion forum in which all participants actively exchange their ideas, progress and problems encountered.

If you did not already do so, please buy the reader Information Retrieval.
We wish you a fruitful course!

[Advanced Databases]: TeleTOP site ready

The complete TeleTOP roster is ready. Please note the two extra meetings on Friday 22 September and Friday 27 October. The organisation of the course is such, that there is something for everyone: a little bit of home work, a mini practicum, a small project, a written exam, and quite some talking from my side. It will be hard work and hopefully it is going to be fun. Hope to see you at the lectures.

PF/Tijah: text search in an XML database system

by Djoerd Hiemstra, Henning Rode, Roel van Os and Jan Flokstra

This paper introduces the PF/Tijah system, a text search system that is integrated with an XML/XQuery database management system. We present examples of its use, we explain some of the system internals, and discuss plans for future work. PF/Tijah is part of the open source release of MonetDB/XQuery.

[download pdf] [more info]