Now showing 1 - 4 of 4
  • Publication
    Chatbot Application Training Using Natural Language Processing Techniques: Case of Small-Scale Agriculture
    ( 2024-06-07)
    Ong R.J.
    ;
    ; ;
    Choong K.Y.
    Tacit knowledge, which is based on first-hand experience and is more difficult to articulate, has evolved alongside natural languages as they are passed down through the years. In computing, Natural Language Processing (or NLP) refers to a set of methods for studying and modelling human languages that may be studied and represented automatically. Extracting or searching through vast bodies of unregulated text for specific information can be a complex and time-consuming process. Knowledge comes in several shapes and sizes, but can usually be differentiated into two types: structured or unstructured. Using NLP techniques, unstructured text data can be translated into a structured and well-organized database and then used for question-answering purposes. This paper is about the implementation of NLP techniques to convert unstructured text data into a structured database for Chatbot application training.
  • Publication
    Dynamic web-based knowledge management system (KMS) in small scale agriculture
    ( 2024-02-08)
    Ong R.J.
    ;
    ; ;
    Choong K.Y.
    ;
    ;
    Yacob Y.
    ;
    Nasir S.N.B.M.
    The shift from paper-based to web-based practice is a significant trend, as paper-based practice consumes time, is prone to error, and is more prone to data loss, all of which lead to inefficiency. Smart farming or agriculture 4.0 defined the new era of agriculture that moving toward digitalization. However, the absence of a centralized information system, which results in information asymmetry among agricultural sectors. One of the keys to success is reaching out to farmers with pertinent agricultural information at the appropriate time and platform. The purpose of this study is to design and create a dynamic web-based knowledge management system for the dissemination of agricultural knowledge in Malaysia. A centralized information system capable of achieving information symmetry, in which all important information is accessible to all participants. Creating an advanced learning environment allowing for virtual engagement between external and internal stakeholders via a web-based knowledge management system.
  • Publication
    Analysis On the Effect of The Length of Training and Test Set to The Accuracy of SARIMA Forecasting
    ( 2024-06-07)
    Choong K.Y.
    ;
    ; ;
    Ong R.J.
    The Seasonal Autoregressive Integrated Moving Average (SARIMA) models which built upon the ARIMA models to support seasonality, are used when it encounter the periodic time series data. It has been widely used in infectious disease prediction and other fields where data indicate a seasonal pattern. However, there are lacks of research focuses on the impact of the length of training and test data on the forecasting accuracy. In this study, the data is split into two parts: Training and Test data. The selected SARIMA model will be fitted for different lengths of training data and forecast the observations of the length of test data. This study aims to investigate how the length of training and test data affect the forecasting accuracy. In order to check it, the Mean Absolute Percentage Error (MAPE) for different lengths of training data are calculated and compared to study their relationship.
  • Publication
    Hybrid approach for vegetable price forecasting in electronic commerce platform
    ( 2024-06-01)
    Choong K.Y.
    ;
    ; ;
    Ong R.J.
    The significance of the agriculture sector in Malaysia is often overlooked, and there is a notable deficiency in the advancement of digitalization within the country’s agricultural domain. The integration of a price forecasting model in the platform enables the relevant parties, including farmers, to make informed decisions and plan their crop selection based on projected future prices. In this research, the authors proposed the hybrid approach with the combination of a linear model and a non-linear model in doing the vegetable price forecasting model. The hybrid model combining seasonal autoregressive integrated moving average (SARIMA)-discrete wavelet transform (DWT)-genetic algorithm neural network (GANN), referred to SARIMA-DWT-GANN, was used to forecast monthly vegetable prices in Malaysia. The historical vegetable price data is collected from the federal agricultural marketing authority Malaysia and split into training/test sets for modeling. The performance of the models is evaluated on the accuracy metrics including mean absolute error (MAE), mean absolute percentage error, and root mean square error (RMSE). The forecasted results using the proposed hybrid model are compared to those using the single SARIMA model. In conclusion, the hybrid SARIMA-DWT-GANN model is superior to the individual model, which obtained the smaller MAE and RMSE, and got the forecast accuracy of at least 95%.