quality assurance; quality software; machine learning; noisy dataset; class imbalance problems; Random Over Sampling; Kernel Principal Component Analysis; Staked Generalization