Kidney CT Scan Image Classification Using Modified Vision Transformer
Keywords:CNN, Classification, CT, MLP, Vision Transformer
With the rising number of kidney-related health issues, early and precise diagnosis is crucial. The study aims to create a reliable method for categorizing kidney CT scan images into four groups: Cyst, Normal, Tumor, and stone. Traditional approaches usually rely on typical Machine Learning (ML) and Convolution Neural Networks (CNNs). However, in this research, the potential of a novel model called Vision Transformer (ViT) is explored. ViT was initially designed for Natural Language Processing (NLP) tasks but shows promise for medical image classification. ViT’s capabilities are enhanced by coupling it with Fully Connected Networks (FCN). This combination helps to merge the feature extraction capability of the ViT and the classification ability of the FCN, which ultimately helps to overcome the challenge of detecting kidney-related issues.
How to Cite
Copyright (c) 2023 Roshan Subedi, Suresh Timilsina, Smita Adhikari
This work is licensed under a Creative Commons Attribution 4.0 International License.
CC BY: This license allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.