TY - JOUR
T1 - Housing price prediction incorporating spatio-temporal dependency into machine learning algorithms
AU - Soltani, Ali
AU - Heydari, Mohammad
AU - Aghaei, Fatemeh
AU - Pettit, Christopher James
PY - 2022/12
Y1 - 2022/12
N2 - Conventional housing price prediction methods rarely consider the spatiotemporal non-stationary problem in a large data volumes. In this study, four machine learning (ML) models are used to explore the impacts of various features – i.e., property attributes and neighborhood quality - on housing price variations at different geographical scales. Using a 32-year (1984–2016) housing price dataset of Metropolitan Adelaide, Australia, this research relies on 428,000 sale transaction records and 38 explanatory variables. It is shown that non-linear tree-based models, such as Decision Tree, have perform better than linear models. In addition, ensemble machine learning techniques, such as Gradient-Boosting and Random Forest, are better at predicting future housing prices. A spatiotemporal lag (ST-lag) variable was added to improve the prediction accuracy of the models. The study demonstrates that ST-lag (or similar spatio-temporal indicator) can be a useful moderator of spatio-temporal effects in ML applications. This paper will serve as a catalyst for future research into the dynamics of the Australian property market, utilizing the benefits of cutting-edge technologies to develop models for business and property valuation at various geographical levels.
AB - Conventional housing price prediction methods rarely consider the spatiotemporal non-stationary problem in a large data volumes. In this study, four machine learning (ML) models are used to explore the impacts of various features – i.e., property attributes and neighborhood quality - on housing price variations at different geographical scales. Using a 32-year (1984–2016) housing price dataset of Metropolitan Adelaide, Australia, this research relies on 428,000 sale transaction records and 38 explanatory variables. It is shown that non-linear tree-based models, such as Decision Tree, have perform better than linear models. In addition, ensemble machine learning techniques, such as Gradient-Boosting and Random Forest, are better at predicting future housing prices. A spatiotemporal lag (ST-lag) variable was added to improve the prediction accuracy of the models. The study demonstrates that ST-lag (or similar spatio-temporal indicator) can be a useful moderator of spatio-temporal effects in ML applications. This paper will serve as a catalyst for future research into the dynamics of the Australian property market, utilizing the benefits of cutting-edge technologies to develop models for business and property valuation at various geographical levels.
KW - Australia
KW - Housing price
KW - Machine learning
KW - Spatio-temporal modelling
KW - Value estimation
UR - http://www.scopus.com/inward/record.url?scp=85136480857&partnerID=8YFLogxK
U2 - 10.1016/j.cities.2022.103941
DO - 10.1016/j.cities.2022.103941
M3 - Article
AN - SCOPUS:85136480857
SN - 0264-2751
VL - 131
JO - Cities
JF - Cities
M1 - 103941
ER -