Sampling with Confidence: Using k-NN Confidence Measures in Active Learning

Rong Hu

    Research output: Contribution to conferencePaperpeer-review

    Abstract

    Active learning is a process through which classifiers can be built from collections of unlabelled examples through the cooperation of a human oracle who can label a small number of examples selected as most informative. Typically the most informative examples are selected through uncertainty sampling based on classification scores. However, previous work has shown that, contrary to expectations, there is not a direct relationship between classification scores and classification confidence. Fortunately, there exists a collection of particularly effective techniques for building measures of classification confidence from the similarity information generated by k-NN classifiers. This paper investigates using these confidence measures in a new active learning sampling selection strategy, and shows how the performance of this strategy is better than one based on uncertainty sampling using classification scores.
    Original languageEnglish
    DOIs
    Publication statusPublished - 2009
    Event8th International Conference on Case-based Reasoning -
    Duration: 1 Jan 2009 → …

    Conference

    Conference8th International Conference on Case-based Reasoning
    Period1/01/09 → …
    OtherUKDS Workshop

    Keywords

    • Active learning
    • classifiers
    • unlabelled examples
    • human oracle
    • uncertainty sampling
    • classification scores
    • classification confidence
    • k-NN classifiers
    • sampling selection strategy

    Fingerprint

    Dive into the research topics of 'Sampling with Confidence: Using k-NN Confidence Measures in Active Learning'. Together they form a unique fingerprint.

    Cite this