Enhanced services for targeted information retrieval by event extraction and data mining

dc.contributor.authorJungermann, Felix
dc.contributor.authorMorik, Katharina
dc.date.accessioned2008-11-26T14:23:57Z
dc.date.available2008-11-26T14:23:57Z
dc.date.issued2008-11-26T14:23:57Z
dc.description.abstractWhere Information Retrieval (IR) and Text Categorization delivers a set of (ranked) documents according to a query, users of large document collections would rather like to receive answers. Questionanswering from text has already been the goal of the Message Understanding Conferences. Since then, the task of text understanding has been reduced to several more tractable tasks, most prominently Named Entity Recognition (NER) and Relation Extraction. Now, pieces can be put together to form enhanced services added on an IR system. In this paper, we present a framework which combines standard IR with machine learning and (pre-)processing for NER in order to extract events from a large document collection. Some questions can already be answered by particular events. Other questions require an analysis of a set of events. Hence, the extracted events become input to another machine learning process which delivers the final output to the user’s question. Our case study is the public collection of minutes of plenary sessions of the German parliament and of petitions to the German parliament.en
dc.identifier.urihttp://hdl.handle.net/2003/25864
dc.identifier.urihttp://dx.doi.org/10.17877/DE290R-14441
dc.language.isoende
dc.subjectData miningen
dc.subjectEntity recognitionen
dc.subjectInformation retrievalen
dc.subjectRelation extractionen
dc.subject.ddc004
dc.titleEnhanced services for targeted information retrieval by event extraction and data miningen
dc.typeTextde
dc.type.publicationtypereporten
dcterms.accessRightsopen access

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
tr04-08-Jungermann.pdf
Size:
265.84 KB
Format:
Adobe Portable Document Format
Description:
DNB
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.92 KB
Format:
Item-specific license agreed upon to submission
Description: