Optimization of the random forest method using principal component analysis to predict house prices : A case study of house prices in Malang City

Elmuna, Emha Ahdan Fahmi, Chamidy, Totok and Nugroho, Fresy ORCID: https://orcid.org/0000-0001-9448-316X (2023) Optimization of the random forest method using principal component analysis to predict house prices : A case study of house prices in Malang City. International Journal of Advances in Data and Information Systems, 4 (2). pp. 155-166. ISSN 2721-3056

[img] Text
16362.pdf - Published Version
Available under License Creative Commons Attribution Share Alike.

Download (704kB)

Abstract

Investment is an interesting thing, especially property investment. The developer must also be careful in determining the price of the property. It should be noted that every year, both short-term and long-term, property prices increase and rarely go down. In determining the price, it is often also based on the features of the house such as the concept, location, bedrooms, etc. To predict house prices based on their features, the random forest has a good performance for predicting house prices. However, the random forest method has the disadvantage that if you use too many variables, the training process will take longer and feature selection tends to select features that are not informative. One way to reduce features without removing other features is to use Principal Component Analysis. In this research, the method used is Principal Component Analysis (PCA) and Random Forest. From the results of model training, it can be concluded that the use of model evaluation results using PCA has a smaller error rate and more consistent values, with an average of 0.018. While the results of the evaluation without PCA and using only Random Forest have a higher error value with an average of 0.03125. The training time using the PCA model has a faster time, with an average of 7918 milliseconds, while those using only random forest without PCA have an average time of 8975 milliseconds.

Item Type: Journal Article
Keywords: house price prediction; random forest method; principal component analysis; data mining; regression; RMSE
Subjects: 08 INFORMATION AND COMPUTING SCIENCES > 0899 Other Information and Computing Sciences > 089999 Information and Computing Sciences not elsewhere classified
Divisions: Faculty of Technology > Department of Informatics Engineering
Depositing User: Totok Chamidy
Date Deposited: 15 Nov 2023 09:07

Downloads

Downloads per month over past year

Origin of downloads

Actions (login required)

View Item View Item