AI Content Detection

Authors

  • Prashjeev Rai Dept. of Computer Engineering, Kathmandu Engineering College
  • Sadikshya Gyawali Dept. of Computer Engineering, Kathmandu Engineering College
  • Shuvechhya Bajracharya Dept. of Computer Engineering, Kathmandu Engineering College
  • Sparsh Nidhi Dept. of Computer Engineering, Kathmandu Engineering College
  • Sudeep Shakya Assoc. Professor, Dept. of Computer Engineering, Kathmandu Engineering College

DOI:

https://doi.org/10.3126/kjse.v9i1.78343

Abstract

AI (Artificial Intelligence) content detection is the task of predicting if the given content is written by humans or AI. This project is a detection tool aimed at eliminating issues created by AI-generated text content such as fake academic reports and papers, articles, news, misinformation, and propaganda by combining multiple detection methods. Three models, LSTM (Long short-term memory), BERT (Bidirectional Encoder Representations from Transformers), and distilBERT (distilled Bidirectional Encoder Representations from Transformers) were fine-tuned on a small labelled dataset of 2492 rows. After comparing their performances, distilBERT was selected for further refinement. Then, a pre-trained distilBERT model was finetuned with 24034 rows of collected datasets to get results specific to the intended application. The language models in AI text generators (e.g. GPT-2) often plagiarize from the training datasets. So, to increase the accuracy BERT Classifier-based plagiarism detector was integrated into the system to determine the originality of input text and predict the likelihood of plagiarism or AI generation. The final model had an overall accuracy of 95% on the unseen data with it being able to detect 100% of all AI content in the unseen dataset and correctly classifying 90% of AI text in the unseen dataset.

Downloads

Download data is not yet available.
Abstract
282
PDF
116

Downloads

Published

2025-05-07

How to Cite

Prashjeev Rai, Sadikshya Gyawali, Shuvechhya Bajracharya, Sparsh Nidhi, & Sudeep Shakya. (2025). AI Content Detection. KEC Journal of Science and Engineering, 9(1), 34–44. https://doi.org/10.3126/kjse.v9i1.78343

Issue

Section

Articles