Balancing Privacy And Accuracy In Nepali Sentiment Analysis: Fine-Tuning Nepalibert With Differential Privacy

Pradip Paneru; Laxmi Prasad Bhatt; Anisha Pokhrel; Sharad Kumar Ghimire

doi:10.3126/jacem.v12i01.93900

Authors

Pradip Paneru Department of Electronics and Computer Engineering, Pulchowk Campus, IOE
Laxmi Prasad Bhatt Department of Computer Engineering, Hillside College of Engineering https://orcid.org/0009-0002-2521-7153
Anisha Pokhrel Department of Computer Engineering, Hillside College of Engineering
Sharad Kumar Ghimire Department of Electronics and Computer Engineering, Pulchowk Campus, IOE

DOI:

https://doi.org/10.3126/jacem.v12i01.93900

Keywords:

Differential Privacy, DP-SGD, Low-Resource NLP, Membership Inference Attack, Nepali Sentiment Analysis, NepaliBERT, Privacy–Utility Trade-off, Privacy-Preserving Machine Learning

Abstract

The increasing volume of user-generated Nepali text has enabled the development of sentiment analysis systems, but training large language models on real data introduces significant privacy risks, including potential exposure through membership inference attacks. This study examines the balance between accuracy and privacy in Nepali sentiment analysis by fine-tuning NepaliBERT with and without Differential Privacy. A high-performing non-private baseline model was trained on approximately 7,000 labeled samples, achieving near-perfect classification performance (Accuracy up to 99.88–100% and Macro F1 up to 1.00), and was subsequently evaluated for vulnerability using membership inference and canary-based privacy assessments. To mitigate privacy risks, Differentially Private Stochastic Gradient Descent was applied under varying privacy budgets (ε), and the resulting models were systematically analyzed to measure performance degradation and resistance to privacy attacks. The findings establish an empirical benchmark for the privacy–utility trade-off in low-resource Nepali NLP and provide practical guidance for building sentiment analysis systems that are both accurate and privacy-preserving.

Downloads

Download data is not yet available.

Abstract

63

pdf

66

Author Biographies

Laxmi Prasad Bhatt, Department of Computer Engineering, Hillside College of Engineering

Head of Department, Asst. Professor

Sharad Kumar Ghimire, Department of Electronics and Computer Engineering, Pulchowk Campus, IOE

Assoc. Professor

Balancing Privacy And Accuracy In Nepali Sentiment Analysis: Fine-Tuning Nepalibert With Differential Privacy

Authors

DOI:

Keywords:

Abstract

Downloads

Author Biographies

Laxmi Prasad Bhatt, Department of Computer Engineering, Hillside College of Engineering

Sharad Kumar Ghimire, Department of Electronics and Computer Engineering, Pulchowk Campus, IOE

Downloads

Published

How to Cite

Issue

Section

License

Information

Current Issue