Skip to main navigation Skip to search Skip to main content

Rapid Unsupervised Keyphrase Extraction from Single Document

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Keyphrases offer a concise representation of a document's content. They are valuable for improving web search results and enhancing tasks such as document tagging, text classification, or summarization. This makes keyphrase extraction is an essential component of text mining. Among the widely used constraints and features in existing keyphrase extraction methods, we identified several effective techniques that have not yet been used together: Part-of-Speech (PoS) restrictions, extended stop-word lists, and position-based features. To address this gap, we propose an approach that leverages automatically extracted extended stop word lists combined with PoS restrictions in keyphrases, and incorporates positional criteria. The main goal of the work was to develop a fast keyphrase extraction algorithm, which was built upon the three mentioned features. Experimental results on the INSPEC and SemEval 2010 datasets demonstrate the effectiveness of the proposed method.

Original languageEnglish
Title of host publicationProceedings of the 36th Conference of Open Innovations Association FRUCT, FRUCT 2024
EditorsYurii Khlaponin, Sergey Balandin
PublisherIEEE Computer Society
Pages609-616
Number of pages8
ISBN (Electronic)9789526524627
DOIs
Publication statusPublished - 2024
Event36th Conference of Open Innovations Association FRUCT, FRUCT 2024 - Helsinki, Finland
Duration: 30 Oct 20241 Nov 2024

Publication series

NameConference of Open Innovation Association, FRUCT
ISSN (Print)2305-7254

Conference

Conference36th Conference of Open Innovations Association FRUCT, FRUCT 2024
Country/TerritoryFinland
CityHelsinki
Period30/10/241/11/24

Fingerprint

Dive into the research topics of 'Rapid Unsupervised Keyphrase Extraction from Single Document'. Together they form a unique fingerprint.

Cite this