Identifying Online Child Sexual Texts in Dark Web through Machine Learning and Deep Learning Algorithms

Vuong M. Ngo, Susan McKeever, Christina Thorpe

Research output: Contribution to journalConference articlepeer-review

Abstract

Predators often use the dark web to discuss and share Child Sexual Abuse Material (CSAM) because the dark web provides a degree of anonymity, making it more difficult for law enforcement to track the criminals involved. In most countries, CSAM is considered as forensic evidence of a crime in progress. Processing, identifying and investigating CSAM is often done manually. This is a time-consuming and emotionally challenging task. In this paper, we propose a novel model based on artificial intelligence algorithms to automatically detect CSA text messages in dark web forums. Our algorithms have achieved impressive results in detecting CSAM in dark web, with a recall rate of 89%, a precision rate of 92.3% and an accuracy rate of 87.6%. Moreover, the algorithms can predict the classification of a post in just 1 microsecond and 0.3 milliseconds on standard laptop capabilities. This makes it possible to integrate our model into social network sites or edge devices to for real-time CSAM detection.

Original languageEnglish
JournalCEUR Workshop Proceedings
Volume3631
DOIs
Publication statusPublished - 2023
Event2023 APWG.EU Technical Summit and Researchers Sync-Up, APWG.EU TECH 2023 - Dublin, Ireland
Duration: 21 Jun 202322 Jun 2023

Keywords

  • artificial intelligent
  • Child sexual exploitation material
  • CSAM
  • CSEM
  • forums
  • text content

Fingerprint

Dive into the research topics of 'Identifying Online Child Sexual Texts in Dark Web through Machine Learning and Deep Learning Algorithms'. Together they form a unique fingerprint.

Cite this