|Title:||Estimating the generalization performance of a SVM efficiently|
|Abstract:||This paper proposes and analyzes an approach to estimating the generalization performance of a support vector machine (SVM) for text classification. Without any computation intensive resampling, the new estimators are computationally much more efficient than cross-validation or bootstrap, since they can be computed immediately from the form of the hypothesis returned by the SVM. Moreover, the estimators delevoped here address the special performance measures needed for text classification. While they can be used to estimate error rate, one can also estimate the recall, the precision, and the F1. A theoretical analysis and experiments on three text classification collections show that the new method can effectively estimate the performance of SVM text classifiers in a very efficient way. The paper is written in English.|
|Appears in Collections:||LS 08 Künstliche Intelligenz|
Files in This Item:
|report25.pdf||DNB||372.88 kB||Adobe PDF||View/Open|
This item is protected by original copyright
All resources in the repository are protected by copyright.