Skip to main navigation Skip to search Skip to main content

Fundamentals of Machine Learning for Neural Machine Translation

  • John D. Kelleher

Research output: Contribution to conferencePaperpeer-review

Abstract

This paper presents a short introduction to neural networks and how they are used for machine translation and concludes with some discussion on the current research challenges being addressed by neural machine translation (NMT) research. The primary goal of this paper is to give a no-tears introduction to NMT to readers that do not have a computer science or mathematical background. The secondary goal is to provide the reader with a deep enough understanding of NMT that they can appreciate the strengths of weaknesses of the technology. The paper starts with a brief introduction to standard feed-forward neural networks (what they are, how they work, and how they are trained), this is followed by an introduction to word-embeddings (vector representations of words) and then we introduce recurrent neural networks. Once these fundamentals have been introduced we then focus in on the components of a standard neural-machine translation architecture, namely: encoder networks, decoder language models, and the encoder-decoder architecture.
Original languageEnglish
DOIs
Publication statusPublished - 2016
Externally publishedYes
EventTranslating Europe Forum 2016: Focusing on Translation Technologies -
Duration: 1 Jan 2016 → …

Conference

ConferenceTranslating Europe Forum 2016: Focusing on Translation Technologies
Period1/01/16 → …
OtherOrganised by the European Commission Directorate-General for Translation

Keywords

  • neural networks
  • machine translation
  • neural machine translation
  • NMT
  • feed-forward neural networks
  • word-embeddings
  • recurrent neural networks
  • encoder networks
  • decoder language models
  • encoder-decoder architecture

Fingerprint

Dive into the research topics of 'Fundamentals of Machine Learning for Neural Machine Translation'. Together they form a unique fingerprint.

Cite this