Home
  • English
  • ÄŒeÅ¡tina
  • Deutsch
  • Español
  • Français
  • Gàidhlig
  • LatvieÅ¡u
  • Magyar
  • Nederlands
  • Português
  • Português do Brasil
  • Suomi
  • Log In
    New user? Click here to register. Have you forgotten your password?
Home
  • Browse Our Collections
  • Publications
  • Researchers
  • Research Data
  • Institutions
  • Statistics
    • English
    • ÄŒeÅ¡tina
    • Deutsch
    • Español
    • Français
    • Gàidhlig
    • LatvieÅ¡u
    • Magyar
    • Nederlands
    • Português
    • Português do Brasil
    • Suomi
    • Log In
      New user? Click here to register. Have you forgotten your password?
  1. Home
  2. Resources
  3. UniMAP Index Publications
  4. Publications 2019
  5. Gender Prediction by Speech Analysis
 
Options

Gender Prediction by Speech Analysis

Journal
Journal of Physics: Conference Series
ISSN
17426588
Date Issued
2019-11-22
Author(s)
Nazifa N.
Fook C.
Chin L.
Vijean V.
Kheng E.
DOI
10.1088/1742-6596/1372/1/012011
Handle (URI)
https://hdl.handle.net/20.500.14170/10288
Abstract
Speech is one of the important methods of communication for humans. The speech signal itself contain linguistic information that can be used to identify the speaker information such as gender, emotions and many more. There are some problems that involve in detection gender of the speaker. In forensic analysis, the police need to detect criminal profile from any evidence such as voice from any calls and while in healthcare aspect, some vocal fold pathologies can be bias to a particular gender such as vocal fold cyst can be seen particularly in female patients and the patient will have problem with their voices. Three features are extract from the speech signal which are Mel Frequency Cepstrum Coefficient (MFCC), Linear Prediction Coding (LPC) and Linear Prediction Coding Coefficient (LPCC). While for the classification, two classifier are used which are Support Vector Machine (SVM) and k-Nearest Neighbour (KNN). The recognition rate is higher for the combination of MFCC and LPCC compared to other features. SVM classifier had outperformed KNN classifier and obtained highest recognition rate of 97.45%. Lastly a graphical user interface system is develop that will record the voice of the speakers, pre-process the signal, extract MFCC and LPCC and classify it using SVM.
Thumbnail Image
google-scholar
Views
Downloads
  • About Us
  • Contact Us
  • Policies