Short Updates- Machine Learning Based News Summarizer

Authors

  • Raksha Dangol Advanced College of Engineering and Management, Kathmandu, Nepal
  • Prashna Adhikari Advanced College of Engineering and Management, Kathmandu, Nepal
  • Pranjal Dahal Advanced College of Engineering and Management, Kathmandu, Nepal
  • Hrizu Sharma Advanced College of Engineering and Management, Kathmandu, Nepal

DOI:

https://doi.org/10.3126/jacem.v8i2.55939

Keywords:

Automated Text Summarization, Deep learning, Inverse Document Frequency, novel intra-attention mechanism, Pointer-generator, Seq2Seq, Term frequency, TF-IDF

Abstract

Automated Text Summarization is becoming important due to the vast amount of data being generated. Manual processing of documents is tedious, mostly due to the absence of standards. Therefore, there is a need for a mechanism to reduce text size, structure it, and make it readable for users. Natural Language Processing (NLP) is critical for analyzing large amounts of unstructured, text-heavy data. This project aims to address concerns with extractive and abstractive text summarization by introducing a new neural network model that deals with repetitive and incoherent phrases in longer documents. The model incorporates a novel Seq2Seq architecture that enhances the standard attentional model with an intra-attention mechanism. Additionally, a new training method that combines supervised word prediction and reinforcement learning is employed. The model utilizes a hybrid pointer-generator network, which distinguishes it from the standard encoder-decoder model. This approach produces higher quality summaries than existing models.

Downloads

Download data is not yet available.
Abstract
82
PDF
77

Author Biographies

Raksha Dangol, Advanced College of Engineering and Management, Kathmandu, Nepal

Assistant Professor

Prashna Adhikari, Advanced College of Engineering and Management, Kathmandu, Nepal

Undergraduate student

Pranjal Dahal, Advanced College of Engineering and Management, Kathmandu, Nepal

Undergraduate student

Hrizu Sharma, Advanced College of Engineering and Management, Kathmandu, Nepal

Undergraduate student

Downloads

Published

2023-06-23

How to Cite

Dangol, R., Adhikari, P., Dahal, P., & Sharma, H. (2023). Short Updates- Machine Learning Based News Summarizer. Journal of Advanced College of Engineering and Management, 8(2), 15–25. https://doi.org/10.3126/jacem.v8i2.55939

Issue

Section

Articles