on accuracy of pdf divergence estimators and their applicability to representative data sampling

;Katarzyna Musial;Bogdan Gabrys;Marcin Budka

doi:10.3390/e13071229

on accuracy of pdf divergence estimators and their applicability to representative data sampling

Clicks: 247

ID: 157182

2011

Free PDF

Article Quality & Performance Metrics

Overall Quality Improving Quality

0.0 /100

Combines engagement data with AI-assessed academic quality

Reader Engagement Steady Performance

30.0 /100

246 views

31 readers

AI Quality Assessment

Not analyzed

Abstract

EN
- Turkish
- Spanish
- Portuguese
- Arabic
- Chinese
- French
- German
- Indonesian
- Russian
- Thai

Generalisation error estimation is an important issue in machine learning. Cross-validation traditionally used for this purpose requires building multiple models and repeating the whole procedure many times in order to produce reliable error estimates. It is however possible to accurately estimate the error using only a single model, if the training and test data are chosen appropriately. This paper investigates the possibility of using various probability density function divergence measures for the purpose of representative data sampling. As it turned out, the first difficulty one needs to deal with is estimation of the divergence itself. In contrast to other publications on this subject, the experimental results provided in this study show that in many cases it is not possible unless samples consisting of thousands of instances are used. Exhaustive experiments on the divergence guided representative data sampling have been performed using 26 publicly available benchmark datasets and 70 PDF divergence estimators, and their results have been analysed and discussed.

Abstract Quality Issue: This abstract appears to be incomplete or contains metadata (157 words). Try re-searching for a better abstract.

Reference Key	musial2011entropyon Use this key to autocite in the manuscript while using SciMatic Manuscript Manager or Thesis Manager
Authors	;Katarzyna Musial;Bogdan Gabrys;Marcin Budka
Journal	European journal of medicinal chemistry
Year	2011
DOI	10.3390/e13071229 Searching for DOI...
URL	http://www.mdpi.com/1099-4300/13/7/1229/ https://doi.org/10.3390/e13071229
Keywords	cross-validation kullback-leibler divergence

Citations

No citations found. To add a citation, contact the admin at info@scimatic.org

Comments

Login to comment Register

No comments yet. Be the first to comment on this article.