DIT - Frequency based incremental attribute selection for GRE.

John D. Kelleher

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The DIT system uses an incremental greedy search to generate descriptions, similar to the incremental algorithm described in (Dale and Reiter, 1995). The selection of the next attribute to be tested for inclusion in the description is ordered by the absolute frequency of each attribute in the training corpus. Attributes are selected in descending order of frequency (i.e. the attribute that occurred most frequently in the training corpus is selected first). Where two or more attributes have the same frequency of occurrence the first attribute found with that frequency is selected. The type attribute is always included in the description. Other attributes are included in the description if they exclude at least 1 distractor from the set of distractors that fulfil the description generated prior that attribute’s selection.The algorithm terminates when a distinguishing description has been generated (i.e., all the distractors have been excluded) or when all the target’s attributes have been tested for inclusion in the description.
Original languageEnglish
Title of host publicationProceedings of the MT Summit XI Workshop Using Corpora for Natural Language Generation: Language Generation and Machine Translation (UCNLG+MT)
EditorsAnja Belz, Sebastian Varges
Pages90-92
DOIs
Publication statusPublished - 2007
Externally publishedYes
EventMT Summit XI Workshop -
Duration: 1 Jan 2007 → …

Publication series

NameACL Anthology

Conference

ConferenceMT Summit XI Workshop
Period1/01/07 → …
OtherUsing Corpora for Natural Language Generation: Language Generation and Machine Translation (UCNLG+MT)

Keywords

  • incremental greedy search
  • descriptions
  • training corpus
  • attributes
  • frequency
  • distractors
  • distinguishing description

Cite this