Ir al contenido

Documat


Imputation of Rainfall Data Using the Sine Cosine Function Fitting Neural Network

  • Po Chan Chiu [1] ; Ali Selamat [1] ; Ondrej Krejcar [2] ; King Kuok Kuok [3] ; Enrique Herrera-Viedma [4] ; Giuseppe Fenza [5]
    1. [1] University of Technology Malaysia

      University of Technology Malaysia

      Malasia

    2. [2] University of Hradec Králové

      University of Hradec Králové

      Chequia

    3. [3] Swinburne University of Technology Sarawak Campus

      Swinburne University of Technology Sarawak Campus

      Malasia

    4. [4] Universidad de Granada

      Universidad de Granada

      Granada, España

    5. [5] University of Salerno

      University of Salerno

      Fisciano, Italia

  • Localización: IJIMAI, ISSN-e 1989-1660, Vol. 6, Nº. 7, 2021, págs. 39-48
  • Idioma: español
  • DOI: 10.9781/ijimai.2021.08.013
  • Enlaces
  • Resumen
    • Missing rainfall data have reduced the quality of hydrological data analysis because they are the essential input for hydrological modeling. Much research has focused on rainfall data imputation. However, the compatibility of precipitation (rainfall) and non-precipitation (meteorology) as input data has received less attention. First, we propose a novel pre-processing mechanism for non-precipitation data by using principal component analysis (PCA). Before the imputation, PCA is used to extract the most relevant features from the meteorological data. The final output of the PCA is combined with the rainfall data from the nearest neighbor gauging stations and then used as the input to the neural network for missing data imputation. Second, a sine cosine algorithm is presented to optimize neural network for infilling the missing rainfall data. The proposed sine cosine function fitting neural network (SC-FITNET) was compared with the sine cosine feedforward neural network (SCFFNN), feedforward neural network (FFNN) and long short-term memory (LSTM) approaches. The results showed that the proposed SC-FITNET outperformed LSTM, SC-FFNN and FFNN imputation in terms of mean absolute error (MAE), root mean square error (RMSE) and correlation coefficient (R), with an average accuracy of 90.9%. This study revealed that as the percentage of missingness increased, the precision of the four imputation methods reduced. In addition, this study also revealed that PCA has potential in pre-processing meteorological data into an understandable format for the missing data imputation.

  • Referencias bibliográficas
    • P. Muñoz, J. Orellana-Alvear, P. Willems, and R. Célleri. “Flash-flood forecasting in an Andean mountain catchment—Development of a stepwise...
    • S. Szewrański, J. Chruściński, J. Kazak, M. Świąder, K. TokarczykDorociak, and R. Żmuda, “Pluvial flood risk assessment tool (PFRA) for rainwater...
    • K.K. Kuok, S. Harun, S.M. Shamsuddin, and P.C. Chiu, “Evaluation of daily rainfall-runoff model using multilayer perceptron and particle swarm...
    • N. Yang, B.H. Men, and C.K. Lin, “Impact analysis of climate change on water resources,” Procedia Engineering, vol. 24, 2011, pp. 643-648.
    • K.K. Kuok, S. Harun, and P.C. Chiu, “Hourly runoff forecast at different leadtime for a small watershed using artificial neural networks,”...
    • R.A. Mcdonald, P.W. Thurston, and M.R. Nelson, “A Monte Carlo study of missing item methods,” Organizational Research Methods, vol. 3, no....
    • P.E. McKnight, K.M. McKnight, S. Sidani, and A.J. Figueredo, “Missing data: A gentle introduction,” Guilford Press. 2007.
    • K.J. Lee and J.B. Carlin, “Multiple imputation for missing data: Fully conditional specification versus multivariate normal imputation,” American...
    • Y. Gao, C. Merz, G. Lischeid, and M. Schneider, “A review on missing hydrological data processing,” Environmental earth sciences, vol. 77,...
    • S. Londhe, P. Dixit, S. Shah, and S. Narkhede, “Infilling of missing daily rainfall records using artificial neural network,” ISH Journal...
    • T. Canchala-Nastar, Y. Carvajal-Escobar, W. Alfonso-Morales, W.L. Cerón and E. Caicedo, “Estimation of missing data of monthly rainfall in...
    • P.C. Chiu, A. Selamat, O. Krejcar, and K.K. Kuok, “Missing rainfall data estimation using artificial neural network and nearest neighbor imputation,”...
    • M.R. Mispan, N.F.A. Rahman, M.F. Ali, K. Khalid, M.H.A. Bakar and S.H. Haron, “Missing river discharge data imputation approach using artificial...
    • P.C. Chiu, A. Selamat and O. Krejcar, “Infilling missing rainfall and runoff data for Sarawak, Malaysia using gaussian mixture model based...
    • W.Y. Lai, and K.K. Kuok, “A study on bayesian principal component analysis for addressing missing rainfall data,” Water Resources Management,...
    • S. Mirjalili, “SCA: A sine cosine algorithm for solving optimization problems,” Knowledge-Based Systems, vol. 96, 2016, pp. 120-133.
    • C. Qu, Z. Zeng, J. Dai, Z. Yi, and W. He, “A modified sine-cosine algorithm based on neighborhood search and greedy levy mutation,” Computational...
    • S. Das, A. Bhattacharya and A.K. Chakraborty, “Solution of short-term hydrothermal scheduling using sine cosine algorithm,” Soft Computing,...
    • S. Li, H. Fang, and X. Liu, “Parameter optimization of support vector regression based on sine cosine algorithm,” Expert Systems with Applications,...
    • M.A. Tawhid, and P. Savsani, “Discrete sine-cosine algorithm (DSCA) with local search for solving traveling salesman problem,” Arabian Journal...
    • R.E. Chandler, V.S. Isham, N.A. Leith, P.J. Northrop, C.J. Onof, and H.S. Wheater, “Uncertainty in rainfall inputs,” World Scientific/Imperial...
    • O. Stoner, and T. Economou, “An advanced hidden markov model for hourly rainfall time series,” arXiv:1906.03846. 2019.
    • T. Kashiwao, K. Nakayama, S. Ando, K. Ikeda, M. Lee and A. Bahadori, “A neural network-based local rainfall prediction system using meteorological...
    • M.H. Yen, D.W. Liu, Y.C. Hsin, C.E. Lin, and C.C. Chen, “Application of the deep learning for the prediction of rainfall in Southern Taiwan,”...
    • M. Chhetri, S. Kumar, P.P. Roy, and B.G. Kim, “Deep BLSTM-GRU model for monthly rainfall prediction: A case study of Simtokha, Bhutan,” Remote...
    • S.K. Grange, and D.C. Carslaw, “Using meteorological normalisation to detect interventions in air quality time series,” Science of the Total...
    • L.I. Smith, “A tutorial on principal components analysis”, 2002. Accessed: Jan. 3, 2020. [online]. Available: http://www.cs.otago.ac.nz/cosc453/...
    • C. Skittides, and W.G. Früh, “Wind forecasting using principal component analysis,” Renewable Energy, vol. 69, 2014, pp. 365-374.
    • M. Hubert, P.J. Rousseeuw, and W. Van den Bossche, “MacroPCA: An allin-one PCA method allowing for missing values as well as cellwise and...
    • Z. Zuśka, J. Kopcińska, E. Dacewicz, B. Skowera, J. Wojkowski, and A. Ziernicka–Wojtaszek, “Application of the principal component analysis...
    • Y.Y Choi, H. Shon, Y.J. Byon, D.K. Kim, S. Kang, “Enhanced application of principal component analysis in machine learning for imputation...
    • B.S. Harish, and S.V.A Kumar. “Anomaly based intrusion detection using modified fuzzy clustering,” International Journal of Interactive Multimedia...
    • T. Kurita, “Principal component analysis (PCA),” In: Ikeuchi K. (eds) Computer Vision: A Reference Guide, Springer, 2014.
    • K. Pearson, “Principal components analysis,” The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science, vol. 6, no.2,...
    • H. Hotelling, “Analysis of a complex of statistical variables into principal components,” Journal of Educational Psychology, vol. 24, no....
    • R. Khattree, and D.N. Naik “Multivariate data reduction and discrimination with SAS software,” SAS Institute, 2000.
    • G. Bebis, and M. Georgiopoulos, “Feedforward neural networks,” IEEE Potentials, vol. 13, no. 4, 1994, pp. 27-31.
    • S. Mirjalili, “How effective is the Grey Wolf optimizer in training multilayer perceptrons,” Applied Intelligence, vol. 43, no.1, 2015, pp.150-161.
    • S. Hochreiter, and J. Schmidhuber, “Long short-term memory,” Neural Computing, vol. 9, no. 8, 1997, pp. 1735-1780.
    • E. Mussumeci, and F.C. Coelho, “Large-scale multivariate forecasting models for Dengue-LSTM versus random forest regression,” Spatial and...
    • S. Maya, and U. Ken, “DADIL: Data augmentation for domain-invariant learning,” Data Science and Pattern Recognition, vol. 4, no. 2, 2020,...
    • J.C.W. Lin, Y. Shao, Y. Djenouri and U. Yun, “ASRNN: A recurrent neural network with an attention model for sequence labeling,” KnowledgeBased...
    • Hourly meteorological dataset for Kuching station: 2002 to 2003, Malaysian Meteorological Department, Selangor, Malaysia, October 2019.
    • Hourly rainfall datasets for Sungai Merang station and nearest neighbor stations: 2002 to 2003, Department of Irrigation and Drainage (DID),...
    • A.J. Henry, N.D. Hevelone, S. Lipsitz, and L.L. Nguyen, “Comparative methods for handling missing data in large databases,” Journal of Vascular...
    • J.R. Cheema, “Some general guidelines for choosing missing data handling methods in educational research,” Journal of Modern Applied Statistical...
    • P. Zhu, Q. Xu, Q. Hu, C. Zhang, and H. Zhao, “Multi-label feature selection with missing labels,” Pattern Recognition, vol. 74, 2018, pp....
    • H. Hassani, M. Kalantari, and Z. Ghodsi, “Evaluating the performance of multiple imputation methods for handling missing values in time series...
    • S. Oba, M.A. Sato, I. Takemasa, M. Monden, K.I. Matsubara, and S. Ishii, “A Bayesian missing value estimation method for gene expression profile...
    • R.J. Little, and D.B. Rubin, “Statistical analysis with missing data,” John Wiley & Sons, 2014.
    • M.K. Gill, T. Asefa, Y. Kaheil, and M. McKee, “Effect of missing data on performance of learning algorithms for hydrologic predictions: Implications...
    • J. H. Lee, and Jr.J. Huber, “Multiple imputation with large proportions of missing data: How much is too much?,” In United Kingdom Stata Users’...
    • Q. Shang, Z. Yang, S. Gao, and D. Tan, “An imputation method for missing traffic data based on FCM optimized by PSO-SVR,” Journal of Advanced...
    • T. Kim, W. Ko, and J. Kim, “Analysis and impact evaluation of missing data imputation in day-ahead PV generation forecasting,” Applied Sciences,...
    • O.F. Ayilara, L. Zhang, T.T. Sajobi, R. Sawatzky, E. Bohm, and L.M. Lix, “Impact of missing data on bias and precision when estimating change...
    • S. Bai, J.Z. Kolter, and V. Koltun, “An empirical evaluation of generic convolutional and recurrent networks for sequence modeling,” arXiv...
    • A. Cecaj, M. Lippi, M. Mamei, and F. Zambonelli, “Comparing deep learning and statistical methods in forecasting crowd distribution from Aggregated...

Fundación Dialnet

Mi Documat

Opciones de artículo

Opciones de compartir

Opciones de entorno