Home
  • English
  • Čeština
  • Deutsch
  • Español
  • Français
  • Gàidhlig
  • Latviešu
  • Magyar
  • Nederlands
  • Português
  • Português do Brasil
  • Suomi
  • Log In
    New user? Click here to register. Have you forgotten your password?
Home
  • Browse Our Collections
  • Publications
  • Researchers
  • Research Data
  • Institutions
  • Statistics
    • English
    • Čeština
    • Deutsch
    • Español
    • Français
    • Gàidhlig
    • Latviešu
    • Magyar
    • Nederlands
    • Português
    • Português do Brasil
    • Suomi
    • Log In
      New user? Click here to register. Have you forgotten your password?
  1. Home
  2. Resources
  3. UniMAP Index Publications
  4. Publications 2022
  5. Clustering selected Terengganu’s rainfall stations based on persistent homology
 
Options

Clustering selected Terengganu’s rainfall stations based on persistent homology

Journal
Thai Journal of Mathematics
ISSN
16860209
Date Issued
2022-01-01
Author(s)
Gobithaasan R.U.
Zabidi Abu Hasan
Universiti Malaysia Perlis
Selvarajh K.D.
Wong K.S.
Mamat S.
Ali M.Z.M.
Miura K.T.
Dotko P.
Handle (URI)
https://hdl.handle.net/20.500.14170/5220
Abstract
Topological Data Analysis (TDA) is an emerging technique rooted from Algebraic Topology that reveals the geometrical structure of high-dimensional data sets. The approach in TDA is twofold; i.e. Persistent homology (PH) which quantifies topological invariants of a given data set, and Mapper which represents the high-dimensional data set into a 1D graph with nodes and edges. In this work, we employ PH as a tool to quantify the first dimensional holes (H1) in the daily rainfall data set collected between 2012 to 2017 from six rainfall stations located in Terengganu, Malaysia. We divided the rainfall data based on one year (365 days) resulting in each station having five sets of rainfall point clouds. Since a rainfall point cloud consists of 1D data set, direct comparison of rainfalls between stations may not show a clear pattern. Thus, we first embed them into point clouds of 10D with time delay τ = 13, using Takens embedding, preserving its original dynamical state. Next, we employ PH to generate persistence diagram to quantify 1D holes (H1) in the rainfall point clouds and record its maximum persistence (H1 lifespan), as its topological feature to characterize the distribution and intensity of rainfall. The first result is; based on past flood events, flood occurred when the year’s average persistence score exceeds 13. The second part of this work involves clustering the stations using two approaches; the standard dynamic time warping (DTW) method which matches the rainfall frequency before computing its dissimilarity distance; and the PH approach using five years maximum H1 lifespan as its distance matrix. The dendrograms produced by both clustering approaches are different, in which DTW has three distinct clusters, but dissimilar to its rainfall distribution. However, PH neatly ranks based on its annual rainfall intensity and recurrence, hence outperforming DTW approach.
Funding(s)
Kementerian Pendidikan Malaysia
Subjects
  • Clustering

  • Dynamic Time Warping

  • Persistent Homology

  • Rainfall

  • Time Series

File(s)
Clustering selected Terengganu’s rainfall stations based on persistent homology.pdf (103.8 KB)
Downloads
8
Acquisition Date
Mar 5, 2026
View Details
Views
2
Acquisition Date
Mar 5, 2026
View Details
google-scholar
  • About Us
  • Contact Us
  • Policies