The agricultural sector has been the most productive, contributing significantly to the national and international economies. Not only does it contribute to about 4.3% of the global gross domestic product (GDP) but it is also responsible for the employment of a substantial proportion of the workforce in the world (26.40%). Tomato (
Solanum lycopersicum L.) is one of the most extensively grown and produced crops as evidenced by the fact that around 9 in 10 farmers grow tomatoes in their farms
(Agarwal et al., 2020). According to the (
FAO, 2023), it is the sixth most-grown vegetable in the world and its annual production has been estimated to be over 180 million.
However, several diseases and pests are causing the loss of crops each year, leading to decreased productivity (
Savary and Willocquet, 2020). Thus, it is critical to recognize them as soon as possible to reduce severe losses and boost yields. The conventional techniques of diagnosing plant diseases are labor-intensive and costly in terms of time and effort. These diseases are caused by plant pathogens including bacteria, fungi, viruses and nematodes
(Blancard et al., 2012; Prasad et al., 2020). Additionally, certain insects, such as sucking insect pests, feed on plant components and deficiencies in micronutrients have a significant impact on plant growth (
Castañé et al., 2020;
Kusanur and Chakravarthi, 2021;
Koike et al., 2023). The focus of this study is the viral diseases (tomato yellow leaf curl disease and the tomato mosaic disease), Fungal disease (target spot also called early blight) and insect (Two-spotted spider mite). In recent times, artificial intelligence (AI) has proven to be of great value in diverse fields involving the classification processes along with making the process of disease identification faster and more accurate (
Alzubi, 2023;
Kumar et al., 2023; Cho, 2024;
Wasik and Pattinson, 2024;
Hai and Duong, 2024;
Untari and Satria, 2022;
Wihardjo et al., 2024).
Early identification of plant diseases is important to combat their negative consequences related to production loss and poor yields. Using machine learning techniques such as convolutional neural network (CNN) may help in overcoming the limitations of ongoing human observation and laborious laboratory techniques. CNN’s primary job is to the identification of images and their classification. Its unique feature is that can be taught from the input object itself without the need for manual feature extraction (
Taye, 2023). Here, instead of looking at each pixel of an image individually, CNN looks at small pieces of the picture at a time. The construction closely follows the basic structure of the visual cortex found in primates as various stages of its learning process are comparable to the ventral pathway of the visual cortex in primates
(Khan et al., 2020). Basha et al. (2024) introduced Tomato Guard, a predictive model for tomato plant diseases. The model uses machine learning with environmental and plant health data. It was validated through field trials and showed better accuracy than traditional methods. The model adapts to different climates and supports sustainable farming. Its interface allows growers to upload images for diagnosis and treatment recommendations. This helps improve agricultural productivity and resilience.
Sood et al., (2024) proposed an analytical approach for tomato leaf disease detection using CNNs, focusing on ResNet50 and VGG16 architectures. Using a labeled dataset of 10,388 images with 10 disease classes, the study achieved training accuracies of 99.63% and 94.48% for ResNet50 and VGG16, respectively, at 20 epochs. The method leverages transfer learning and data augmentation, outperforming existing techniques.
Several studies attempted to develop a CNN-based model.
Agarwal et al. (2020) employed a CNN-based method on the Plant Village dataset of tomato leaves which contained infected and diseased leaves images belonging to 9 disease classes. They obtained classification accuracies that varied from 76% to 100% for different classes of diseases (10 classes: 1 healthy and 9 diseased). Their model was better than the using pre-trained models such as VGG16, InceptionV3 and MobileNet. The overall accuracy obtained for the model was 91.2%.
Tian et al. (2023) used the dataset of PlantVillage and a curated in-house dataset to develop a model for the detection of diseases. They tested three pre-trained models (VGG16, InceptionV3 and ResNet50) which were then used to develop a smartphone application Tomato Guard for the convenient identification of tomato diseases. Their model reached an accuracy of 99%.
The present study was conducted to develop a CNN model for classifying tomato leaf conditions. The model focuses on five categories: HC, TSSM, TYCLVD, TMV and TS. It uses a preprocessed dataset to enhance classification accuracy and reliability. This work aims to enable early and automated disease detection for better disease management and improved tomato yields.