Analysis of Attention Mechanisms in Box-Embedding Systems

Jeffrey Sardina, Callie Sardina, John D. Kelleher, Declan O’Sullivan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Large-scale Knowledge Graphs (KGs) have recently gained considerable research attention for their ability to model the inter- and intra- relationships of data. However, the huge scale of KGs has necessitated the use of querying methods to facilitate human use. Question Answering (QA) systems have shown much promise in breaking down this human-machine barrier. A recent QA model that achieved state-of-the-art performance, Query2box, modelled queries on a KG using box embeddings with an attention mechanism backend to compute the intersections of boxes for query resolution. In this paper, we introduce a new model, Query2Geom, which replaces the Query2box attention mechanism with a novel, exact geometric calculation. Our findings show that Query2Geom generally matches the performance of Query2box while having many fewer parameters. Our analysis of the two models leads us to formally describe the interaction between knowledge graph data and box embeddings with the concepts of semantic-geometric alignment and mismatch. We create the Attention Deviation Metric as a measure of how well the geometry of box embeddings captures the semantics of a knowledge graph, and apply it to explain the difference in performance between Query2box and Query2Geom. We conclude that Query2box’s attention mechanism operates using “latent intersections” that attend to the semantic properties in embeddings not expressed in box geometry, acting as a limit on model interpretability. Finally, we generalise our results and propose that semantic-geometric mismatch is a more general property of attention mechanisms, and provide future directions on how to formally model the interaction between attention and latent semantics.

Original languageEnglish
Title of host publicationArtificial Intelligence and Cognitive Science - 30th Irish Conference, AICS 2022, Revised Selected Papers
EditorsLuca Longo, Ruairi O’Reilly
PublisherSpringer Science and Business Media Deutschland GmbH
Pages68-80
Number of pages13
ISBN (Print)9783031264375
DOIs
Publication statusPublished - 2023
Event30th Irish Conference on Artificial Intelligence and Cognitive Science, AICS 2022 - Munster, Ireland
Duration: 8 Dec 20229 Dec 2022

Publication series

NameCommunications in Computer and Information Science
Volume1662 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference30th Irish Conference on Artificial Intelligence and Cognitive Science, AICS 2022
Country/TerritoryIreland
CityMunster
Period8/12/229/12/22

Keywords

  • Attention
  • Box embeddings
  • Knowledge graph
  • Question answering

Fingerprint

Dive into the research topics of 'Analysis of Attention Mechanisms in Box-Embedding Systems'. Together they form a unique fingerprint.

Cite this