Options
Enhancing data quality in image pre-processing: a case study on plant disease classification
Journal
2023 International Conference for Technological Engineering and its Applications in Sustainable Development (ICTEASD)
Date Issued
2023
Author(s)
Ge Ye Ong
INTI International College Penang
Nurzulaikha Abdullah
Universiti Malaysia Kelantan
Fakhitah Ridzuan
Universiti Malaysia Kelantan
Universiti Malaysia Perlis
Laith H. Alzubaidi
The Islamic University, Najaf, Iraq
DOI
10.1109/ICTEASD57136.2023.10584873
Abstract
Data quality plays a vital role in image pre-processing for machine learning applications. The effectiveness of building accurate and reliable models lies in the high-quality of data. Prioritizing data quality in the initial stages of the image pre-processing pipeline lays a strong foundation for subsequent machine learning stages. Therefore, this research aims to identify the appropriate steps for image pre-processing and compare the performance of machine learning model based on different pre-processing approaches. In this study, we utilized a plant diseases dataset sourced from Kaggle, comprising approximately 87,000 RGB images of both healthy and diseased crop leaves, categorized into 38 distinct classes. To significantly enhance the image quality, we implemented a range of transformation techniques, including resizing, normalization, data augmentation, and cropping. The analysis clearly indicates that implementing comprehensive pre-processing techniques enhances data quality and improves the machine learning model's classification performance.