Authors: Jungermann, Felix
Morik, Katharina
Title: Enhanced services for targeted information retrieval by event extraction and data mining
Language (ISO): en
Abstract: Where Information Retrieval (IR) and Text Categorization delivers a set of (ranked) documents according to a query, users of large document collections would rather like to receive answers. Questionanswering from text has already been the goal of the Message Understanding Conferences. Since then, the task of text understanding has been reduced to several more tractable tasks, most prominently Named Entity Recognition (NER) and Relation Extraction. Now, pieces can be put together to form enhanced services added on an IR system. In this paper, we present a framework which combines standard IR with machine learning and (pre-)processing for NER in order to extract events from a large document collection. Some questions can already be answered by particular events. Other questions require an analysis of a set of events. Hence, the extracted events become input to another machine learning process which delivers the final output to the user’s question. Our case study is the public collection of minutes of plenary sessions of the German parliament and of petitions to the German parliament.
Subject Headings: Data mining
Entity recognition
Information retrieval
Relation extraction
URI: http://hdl.handle.net/2003/25864
http://dx.doi.org/10.17877/DE290R-14441
Issue Date: 2008-11-26T14:23:57Z
Appears in Collections:Sonderforschungsbereich (SFB) 475

Files in This Item:
File Description SizeFormat 
tr04-08-Jungermann.pdfDNB265.84 kBAdobe PDFView/Open


This item is protected by original copyright



This item is protected by original copyright rightsstatements.org