TY - JOUR
T1 - Identifying Online Child Sexual Texts in Dark Web through Machine Learning and Deep Learning Algorithms
AU - Ngo, Vuong M.
AU - McKeever, Susan
AU - Thorpe, Christina
N1 - Publisher Copyright:
© 2023 Copyright for this paper by its authors.
PY - 2023
Y1 - 2023
N2 - Predators often use the dark web to discuss and share Child Sexual Abuse Material (CSAM) because the dark web provides a degree of anonymity, making it more difficult for law enforcement to track the criminals involved. In most countries, CSAM is considered as forensic evidence of a crime in progress. Processing, identifying and investigating CSAM is often done manually. This is a time-consuming and emotionally challenging task. In this paper, we propose a novel model based on artificial intelligence algorithms to automatically detect CSA text messages in dark web forums. Our algorithms have achieved impressive results in detecting CSAM in dark web, with a recall rate of 89%, a precision rate of 92.3% and an accuracy rate of 87.6%. Moreover, the algorithms can predict the classification of a post in just 1 microsecond and 0.3 milliseconds on standard laptop capabilities. This makes it possible to integrate our model into social network sites or edge devices to for real-time CSAM detection.
AB - Predators often use the dark web to discuss and share Child Sexual Abuse Material (CSAM) because the dark web provides a degree of anonymity, making it more difficult for law enforcement to track the criminals involved. In most countries, CSAM is considered as forensic evidence of a crime in progress. Processing, identifying and investigating CSAM is often done manually. This is a time-consuming and emotionally challenging task. In this paper, we propose a novel model based on artificial intelligence algorithms to automatically detect CSA text messages in dark web forums. Our algorithms have achieved impressive results in detecting CSAM in dark web, with a recall rate of 89%, a precision rate of 92.3% and an accuracy rate of 87.6%. Moreover, the algorithms can predict the classification of a post in just 1 microsecond and 0.3 milliseconds on standard laptop capabilities. This makes it possible to integrate our model into social network sites or edge devices to for real-time CSAM detection.
KW - artificial intelligent
KW - Child sexual exploitation material
KW - CSAM
KW - CSEM
KW - forums
KW - text content
UR - http://www.scopus.com/inward/record.url?scp=85178662668&partnerID=8YFLogxK
U2 - 10.21427/wfn5-rt72
DO - 10.21427/wfn5-rt72
M3 - Conference article
AN - SCOPUS:85178662668
SN - 1613-0073
VL - 3631
JO - CEUR Workshop Proceedings
JF - CEUR Workshop Proceedings
T2 - 2023 APWG.EU Technical Summit and Researchers Sync-Up, APWG.EU TECH 2023
Y2 - 21 June 2023 through 22 June 2023
ER -