Authors: Joachims, Thorsten
Title: Estimating the generalization performance of a SVM efficiently
Language (ISO): en
Abstract: This paper proposes and analyzes an approach to estimating the generalization performance of a support vector machine (SVM) for text classification. Without any computation intensive resampling, the new estimators are computationally much more efficient than cross-validation or bootstrap, since they can be computed immediately from the form of the hypothesis returned by the SVM. Moreover, the estimators delevoped here address the special performance measures needed for text classification. While they can be used to estimate error rate, one can also estimate the recall, the precision, and the F1. A theoretical analysis and experiments on three text classification collections show that the new method can effectively estimate the performance of SVM text classifiers in a very efficient way. The paper is written in English.
URI: http://hdl.handle.net/2003/2601
http://dx.doi.org/10.17877/DE290R-5102
Issue Date: 2000-01-12
Provenance: Universität Dortmund
Appears in Collections:LS 08 Künstliche Intelligenz

Files in This Item:
File Description SizeFormat 
report25.pdfDNB372.88 kBAdobe PDFView/Open
report25.ps963.7 kBPostscriptView/Open


This item is protected by original copyright



This item is protected by original copyright rightsstatements.org