Prediction of cost contingency in construction projects by introducing machine learning algorithms

DOI: https://doi.org/10.3846/jcem.2025.24913

Abstract

Construction projects are bound by uncertainties and changes by its nature. Thus, cost contingency needs to be allocated to construction project budget to cope with any deviation of actual costs from planned ones. However, existing methods for predicting cost contingencies, as studied and practiced, still present limitations in reliability and accuracy. Machine learning (ML) has gained popularity for enhancing prediction power in various fields. The paper aims to examine various ML algorithms to implement a cost contingency prediction model, employing both continuous and categorical predictor variables. To develop the model, construction transportation project datasets, which were bid between 2013‒2017, were collected from the Florida Department of Transportation (FDOT) website. To address imbalanced regression dataset issues, the synthetic minority over-sampling technique for regression with Gaussian noise (SMOGN) algorithm is introduced. ML random forest (RF) regression associated with random search hyperparameter optimization, achieved remarkably accurate predictions compared to extreme gradient boosting (XGBoost) regression and artificial neural network (ANN) models. The results also demonstrate that four parameters are significant factors in predicting construction cost contingency: project amount, project duration, and latitude and longitude factors. These findings provide new insights for researchers in developing models and for practitioners seeking more advanced method.

Keywords:

construction cost contingency, machine learning, RF, XGBoost, hyperparameter optimization, SMOGN, cost prediction

How to Cite

Nindartin, A., Park, S.-J., Lee, K.-T., Kim, J.-H., & Rostiyanti, S. F. (2025). Prediction of cost contingency in construction projects by introducing machine learning algorithms. Journal of Civil Engineering and Management, 31(8), 860–880. https://doi.org/10.3846/jcem.2025.24913

Share

Published in Issue
November 13, 2025
Abstract Views
52

References

Abt, K. (1987). Descriptive data analysis: a concept between confirmatory and exploratory data analysis. Methods of Information in Medicine, 26(2), 77–88. https://doi.org/10.1055/s-0038-1635488

Agrawal, T. (2021). Hyperparameter optimization in machine learning: Make your machine learning and deep learning models more efficient. Apress. https://doi.org/10.1007/978-1-4842-6579-6

Alpaydin, E. (2020). Introduction to machine learning. The MIT Press.

Alshboul, O., Shehadeh, A., Almasabha, G., & Almuflih, A. S. (2022). Extreme gradient boosting-based machine learning approach for green building cost prediction. Sustainability, 14(11), Article 6651. https://doi.org/10.3390/su14116651

Ameh, O. J., Soyingbe, A. A., & Odusami, K. T. (2010). Significant factors causing cost overruns in telecommunication projects in Nigeria. Journal of Construction in Developing Countries, 15(2), 49–67. https://ir.unilag.edu.ng/handle/123456789/8924

Ammar, T., Abdel-Monem, M., & El-Dash, K. (2022). Risk factors causing cost overruns in road networks. Ain Shams Engineering Journal, 13(5), Article 101720. https://doi.org/10.1016/j.asej.2022.101720

Ammar, T., Abdel-Monem, M., & El-Dash, K. (2025). Regression-based model predicting cost contingencies for road network projects. International Journal of Construction Management, 25(11), 1273–1287. https://doi.org/10.1080/15623599.2024.2411082

Anitescu, C., Atroshchenko, E., Alajlan, N., & Rabczuk, T. (2019). Artificial neural network methods for the solution of second order boundary value problems. Computers, Materials & Continua, 59(1), 345–359. https://doi.org/10.32604/cmc.2019.06641

Anjum, S., Khalid, R., Khan, M., Khan, N., & Park, C. (2021). A pull-reporting approach for floor opening detection using deep-learning on embedded devices. In Proceedings of the of the 38th International Symposium on Automation and Robotics in Construction (ISARC 2021) (pp. 395–402), Dubai, UAE. https://doi.org/10.22260/ISARC2021/0055

Arifuzzaman, M., Gazder, U., Islam, M. S., & Skitmore, M. (2022). Budget and cost contingency CART models for power plant projects. Journal of Civil Engineering and Management, 28(8), 680–695. https://doi.org/10.3846/jcem.2022.16944

Artur, M. (2021). Review the performance of the Bernoulli Naïve Bayes classifier in intrusion detection systems using recursive feature elimination with cross-validated selection of the best number of features. Procedia Computer Science, 190, 564–570. https://doi.org/10.1016/j.procs.2021.06.066

Asamoah, Oduro R., Offei-Nyako, K., & Twumasi-Ampofo, K. (2023). Relative importance of triggers influencing cost contingency determination for building contracts-the perspective of quantity surveyors. International Journal of Construction Management, 23(5), 790–798. https://doi.org/10.1080/15623599.2021.1930638

Association for the Advancement of Cost Engineering International. (2008). Contingency estimating-general principles (AACE Recommended Practice No. 40R-08, TCM Framework). https://www.pathlms.com/aace/courses/2928/documents/3825#

Baccarini, D. (2004). Accuracy in estimating project cost construction contingency-a statistical analysis. In Cobra 2004: RICS International Construction Conference, Responding to Change, London, United Kingdom. http://hdl.handle.net/20.500.11937/29859

Baccarini, D., & Love, P. E. (2014). Statistical characteristics of cost contingency in water infrastructure projects. Journal of Construction Engineering and Management, 140(3), Article 04013063. https://doi.org/10.1061/(ASCE)CO.1943-7862.0000820

Bakhshi, P., & Touran, A. (2014). An overview of budget contingency calculation methods in construction industry. Procedia Engineering, 85, 52–60. https://doi.org/10.1016/j.proeng.2014.10.528

Bekkerman, R. (2015). The present and the future of the KDD Cup Competition: An outsider’s perspective [Post]. LinkedIn. https://www.linkedin.com/pulse/present-future-kdd-cup-competition-outsiders-ron-bekkerman/

Bergstra, J., & Bengio, Y. (2012). Random search for hyper-parameter optimization. Journal of Machine Learning Research, 13(2), 281–305.

Bilal, M., & Oyedele, L. O. (2020). Guidelines for applied machine learning in construction industry – A case of profit margins estimation. Advanced Engineering Informatics, 43, Article 101013. https://doi.org/10.1016/j.aei.2019.101013

Bilal, M., Oyedele, L. O., Qadir, J., Munir, K., Ajayi, S. O., Akinade, O. O., Owolabi, H. A., Alaka, H. A., & Pasha, M. (2016). Big data in the construction industry: A review of present status, opportunities, and future trends. Advanced Engineering Informatics, 30(3), 500–521. https://doi.org/10.1016/j.aei.2016.07.001

Branco, P., Torgo, L., & Ribeiro, R. P. (2017). SMOGN: a pre-processing approach for imbalanced regression. In First International Workshop on Learning with Imbalanced Domains: Theory and Applications (Vol. 74, pp. 36–50), Skopje, Macedonia.

Branco, P., Torgo, L., & Ribeiro, R. P. (2019). Pre-processing approaches for imbalanced distributions in regression. Neurocomputing, 343, 76–99. https://doi.org/10.1016/j.neucom.2018.11.100

Breiman, L. (1996). Bagging predictors. Machine Learning, 24, 123–140. https://link.springer.com/article/10.1007/BF00058655

Breiman, L. (2001). Random forests. Machine Learning, 45, 5–32. https://link.springer.com/article/10.1023/a:1010933404324

Cantarelli, C. C., Flyvbjerg, B., & Buhl, S. L. (2012). Geographical variation in project cost performance: the Netherlands versus worldwide. Journal of Transport Geography, 24, 324–31. https://doi.org/10.1016/j.jtrangeo.2012.03.014

Cao, Y., Ashuri, B., & Baek, M. (2018). Prediction of unit price bids of resurfacing highway projects through ensemble machine learning. Journal of Computing in Civil Engineering, 32(5), Article 04018043. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000788

Catalão, F. P., Cruz, C. O., & Sarmento, J. M. (2019). The determinants of cost deviations and overruns in transport projects, an endogenous model approach. Transport Policy, 74, 224–238. https://doi.org/10.1016/j.tranpol.2018.12.008

Cha, G. W., Moon, H. J., & Kim, Y. C. (2021). Comparison of random forest and gradient boosting machine models for predicting demolition waste based on small datasets and categorical variables. International Journal of Environmental Research Public Health, 18(16), Article 8530. https://doi.org/10.3390/ijerph18168530

Chakraborty, D., & Elzarka, H. (2019). Advanced machine learning techniques for building performance simulation: A comparative analysis. Journal of Building Performance Simulation, 12(2), 193–207. https://doi.org/10.1080/19401493.2018.1498538

Chan, E. H., & Au, M. C. (2008). Relationship between organizational sizes and contractors’ risk pricing behaviors for weather risk under different project values and durations. Journal of Construction Engineering and Management, 134(9), 673–680. https://doi.org/10.1061/(ASCE)0733-9364(2008)134:9(673)

Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’16) (pp. 785–794). Association for Computing Machinery. https://doi.org/10.1145/2939672.2939785

Chen, D., & Hartman, F. T. (2000). A neural network approach to risk assessment and contingency allocation. AACE International Transactions.

Clark, D. E. (2001). Monte Carlo analysis: Ten years of experience. Cost Engineering, 43(6), 40–45.

Daly, A., Dekker, T., & Hess, S. (2016). Dummy coding vs effects coding for categorical variables: Clarifications and extensions. Journal of Choice Modelling, 21, 36–41. https://doi.org/10.1016/j.jocm.2016.09.005

De Marco, A., Rafele, C., & Thaheem, M. J. (2016). Dynamic management of risk contingency in complex design-build projects. Journal of Construction Engineering and Management, 142(2), Article 04015080. https://doi.org/10.1061/(ASCE)CO.1943-7862.0001052

Dey, P., Tabucanon, M. T., & Ogunlana, S. O. (1994). Planning for project control through risk analysis: A petroleum pipeline-laying project. International Journal of Project Management, 12(1), 23–33. https://doi.org/10.1016/0263-7863(94)90006-X

Dietterich, T. G. (2000). An experimental comparison of three methods for constructing ensembles of decision trees: Bagging, boosting, and randomization. Machine Learning, 40, 139–157. https://link.springer.com/article/10.1023/a:1007607513941

Dietterich, T. G. (2002). Ensemble learning. In M. A. Arbib (Ed.), The handbook of brain theory and neural networks (2nd ed.) (pp. 405–408). The MIT Press.

El-Kholy, A. M., Tahwia, A. M., & Elsayed, M. M. (2022). Prediction of simulated cost contingency for steel reinforcement in building projects: ANN versus regression-based models. International Journal of Construction Management, 22(9), 1675–1689. https://doi.org/10.1080/15623599.2020.1741492

Elmousalami, H. H. (2020). Comparison of artificial intelligence techniques for project conceptual cost prediction: a case study and comparative analysis. IEEE Transactions on Engineering Management, 68(1), 183–196. https://doi.org/10.1109/TEM.2020.2972078

El-Touny, A. S., Ibrahim, A. H., & Amer, M. I. (2014). Estimating cost contingency for highway construction projects using analytic hierarchy process. International Journal of Computer Science Issues, 11(6), Article 73.

Enshassi, A., & Ayyash, A. (2014). Factors affecting cost contingency in the construction industry–contractors’ perspective. International Journal of Construction Management, 14(3), 191–208. https://doi.org/10.1080/15623599.2014.922729

Espinoza, R. D. (2011). Contingency estimating using option pricing theory: Closing the gap between theory and practice. Construction Management and Economics, 29(9), 913–927. https://doi.org/10.1080/01446193.2011.610328

Fernández-Delgado, M., Cernadas, E., Barro, S., & Amorim, D. (2014). Do we need hundreds of classifiers to solve real world classification problems?. Journal of Machine Learning Research, 15(1), 3133–3181.

Flyvbjerg, B., Holm, M. S., & Buhl, S. (2002). Underestimating costs in public works projects: Error or lie?. Journal of the American Planning Association, 68(3), 279–295. https://doi.org/10.1080/01944360208976273

Gharaibeh, H. M. (2014). Cost control in mega projects using the Delphi method. Journal of Management in Engineering, 30(5), Article 04014024. https://doi.org/10.1061/(ASCE)ME.1943-5479.0000218

Ghimire, B., Rogan, J., Galiano, V. R., Panday, P., & Neeti, N. (2012). An evaluation of bagging, boosting, and random forests for land-cover classification in Cape Cod, Massachusetts, USA. GIScience & Remote Sensing, 49(5), 623–643. https://doi.org/10.2747/1548-1603.49.5.623

Gholamy, A., Kreinovich, V., & Kosheleva, O. (2018). Why 70/30 or 80/20 relation between training and testing sets: A pedagogical explanation (Technical report). University of Texas at El Paso Computer Science.

Günhan, S., & Arditi, D. (2007). Budgeting owner’s construction contingency. Journal of Construction Engineering and Management, 133(7), 492–497. https://doi.org/10.1061/(ASCE)0733-9364(2007)133:7(492)

Guyon, I., & Elisseeff, A. (2003). An introduction to variable and feature selection. Journal of Machine Learning Research, 3, 1157–1182.

Hall, P. (1994). Methodology and theory for the bootstrap. In R. F. Engle & D. L. McFadden (Eds.), Handbook of econometrics (Vol. 4, pp. 2341–2381). Elsevier Inc. https://doi.org/10.1016/S1573-4412(05)80008-X

Hamid, R. A., & Kehinde, F. J. (2017). Choosing an appropriate contingency sum estimating methods for highway construction projects in Nigeria: A literature review. Planning Malaysia Journal, 15(1). https://doi.org/10.21837/pm.v15i1.217

Hartman, F. T. (2000). Don’t park your brain outside: A practical guide to improving shareholder value with SMART management. Project Management Institute.

Hashemi, T. S., Ebadati, O. M., & Kaur, H. (2020). Cost estimation and prediction in construction projects: A systematic review on machine learning techniques. SN Applied Sciences, 2, Article 1703. https://doi.org/10.1007/s42452-020-03497-1

Hollmann, J. K. (2012). Estimate accuracy: Dealing with reality. Cost Engineering-Morgantown, 54(6), Article 17.

Hoseini, E., Bosch-Rekveldt, M., & Hertogh, M. (2020a). Cost contingency and cost evolvement of construction projects in the preconstruction phase. Journal of Construction Engineering and Management, 146(6), Article 05020006. https://doi.org/10.1061/(ASCE)CO.1943-7862.0001842

Hoseini, E., Van Veen, P., Bosch-Rekveldt, M., & Hertogh, M. (2020b). Cost performance and cost contingency during project execution: Comparing client and contractor perspectives. Journal of Management in Engineering, 36(4), Article 05020006. https://doi.org/10.1061/(ASCE)ME.1943-5479.0000772

Huang, C.-H., & Hsien, S.-H. (2020). Predicting BIM labor cost with random forest and simple linear regression. Automation in Construction, 118, Article 103280. https://doi.org/10.1016/j.autcon.2020.103280

Jiang, Q. (2020). Estimation of construction project building cost by back-propagation neural network. Journal of Engineering, Design and Technology, 18(3), 601–609. https://doi.org/10.1108/JEDT-08-2019-0195

Karlsen, J. K., & Lereim, J. (2005). Management of project contingency and allowance. Cost Engineering, 47(9), 24–29.

Kasimu, M. A. (2012). Significant factors that cause cost overruns in building construction project in Nigeria. Interdisciplinary Journal of Contemporary Research in Business, 3(11), 775–780.

Kaur, H., Pannu, H. S., & Malhi, A. K. (2019). A systematic review on imbalanced data challenges in machine learning: Applications and solutions. ACM Computing Surveys, 52(4), Article 79. https://doi.org/10.1145/3343440

Kim, G. H., An, S. H., & Kang, K. I. (2004). Comparison of construction cost estimating models based on regression analysis, neural networks, and case-based reasoning. Building and Environment, 39(10), 1235–1242. https://doi.org/10.1016/j.buildenv.2004.02.013

Kunz, N. (2020). SMOGN: Synthetic minority over-sampling technique for regression with Gaussian noise. https://pypi.org/project/smogn/

Kursa, M. B., & Rudnicki, W. R. (2010). Feature selection with the Boruta package. Journal of Statistical Software, 36, 1–13. https://doi.org/10.18637/jss.v036.i11

Larsen, J. K., Shen, G. Q., Lindhard, S. M., & Brunoe, T. D. (2016). Factors affecting schedule delay, cost overrun, and quality level in public construction projects. Journal of Management in Engineering, 32(1), Article 04015032. https://doi.org/10.1061/(ASCE)ME.1943-5479.0000391

Laryea, S., & Hughes, W. (2009). How contractors in Ghana include risk in their bid prices. In Proceedings of 25th Annual ARCOM Conference (pp. 1295–1304), Nottingham, UK. Association of Researchers in Construction Management.

Lathong, K., & Wisaeng, K. (2024). An innovative hybrid machine learning techniques for predicting construction cost estimates. International Journal for Computational Civil and Structural Engineering, 20(3), 69–83.

Lhee, S. C. (2014). Finding significant factors to affect cost contingency on construction projects using ANOVA statistical method-focused on transportation construction projects in the US. Architectural Research, 16(2), 75–80. https://doi.org/10.5659/AIKAR.2014.16.2.75

Lhee, S. C., Issa, R. R., & Flood, I. (2012). Prediction of financial contingency for asphalt resurfacing projects using artificial neural networks. Journal of Construction Engineering and Management, 138(1), 22–30. https://doi.org/10.1061/(ASCE)CO.1943-7862.0000408

Lhee, S. C., Flood, I., & Issa, R. R. (2014). Development of a two-step neural network-based model to predict construction cost contingency. Journal of Information Technology in Construction (ITcon), 19(24), 399–411.

Lhee, S. C., Issa, R. R., & Flood, I. (2016). Using particle swarm optimization to predict cost contingency on transportation construction projects. Journal of Information Technology in Construction (ITcon), 21(30), 504–516.

Li, F., Laili, Y., Chen, X., Lou, Y., Wang, C., Yang, H., Gao, X., & Han, H. (2023). Towards big data driven construction industry. Journal of Industrial Information Integration, 35, Article 100483. https://doi.org/10.1016/j.jii.2023.100483

Love, P. E. D., Sing, C. P., Wang, X., Irani, Z., & Thwala, D. W. (2014). Overruns in transportation infrastructure projects. Structure and Infrastructure Engineering, 10, 141–159. https://doi.org/10.1080/15732479.2012.715173

Mahamid, I. (2013). Effects of project’s physical characteristics on cost deviation in road construction. Journal of King Saud University-Engineering Sciences, 25(1), 81–88. https://doi.org/10.1016/j.jksues.2012.04.001

Mahmoodzadeh, A., Nejati, H. R., & Mohammadi, M. (2022a). Optimized machine learning modelling for predicting the construction cost and duration of tunnelling projects. Automation in Construction, 139, Article 104305. https://doi.org/10.1016/j.autcon.2022.104305

Mahmoodzadeh, A., Nejati, H. R., Mohammadi, M., Ibrahim, H. H., Khishe, M., Rashidi, S., & Mohammed, A. H. (2022b). Developing six hybrid machine learning models based on gaussian process regression and meta-heuristic optimization algorithms for prediction of duration and cost of road tunnels construction. Tunnelling and Underground Space Technology, 130, Article 104759. https://doi.org/10.1016/j.tust.2022.104759

Mak, S., Wong, J., & Picken, D. (1998). The effect on contingency allowances of using risk analysis in capital cost estimating: A Hong Kong case study. Construction Management and Economics, 16(6), 615–619. https://doi.org/10.1080/014461998371917

Manu, P., Ankrah, N., Proverbs, D., & Suresh, S. (2010). An approach for determining the extent of contribution of construction project features to accident causation. Safety Science, 48(6), 687–692. https://doi.org/10.1016/j.ssci.2010.03.001

Meharie, M. G., & Shaik, N. (2020). Predicting highway construction costs: Comparison of the performance of random forest, neural network and support vector machine models. Journal of Soft Computing in Civil Engineering, 4(2), 103–112. https://doi.org/10.22115/scce.2020.226883.1205

Moselhi, O. (1997). Risk assessment and contingency estimating. In AACE International Transactions, Dallas, USA.

Myers, J. L., Well, A., & Lorch, R. F. (2010). Research design and statistical analysis. Routledge.

Nawar, S., Hosny, O., & Nassar, K. (2018). Owner time and cost contingency estimation for building construction projects in Egypt. In Construction Research Congress 2018 (pp. 367–377), New Orleans, Louisiana, USA. https://doi.org/10.1061/9780784481271.036

Nitithamyong, P., & Skibniewski, M. J. (2004). Web-based construction project management systems: how to make them successful?. Automation in Construction, 13(4), 491–506. https://doi.org/10.1016/j.autcon.2004.02.003

Opitz, D., & Maclin, R. (1999). Popular ensemble methods: An empirical study. Journal of Artificial Intelligence Research, 11, 169–198. https://doi.org/10.1613/jair.614

Peško, I., Mučenski, V., Šešlija, M., Radović, N., Vujkov, A., Bibić, D., & Krklješ, M. (2017). Estimation of costs and durations of construction of urban roads using ANN and SVM. Complexity, 2017, Article 2450370. https://doi.org/10.1155/2017/2450370

Petrusheva, S., Car-Pušić, D., & Zileska-Pancovska, V. (2019). Support vector machine based hybrid model for prediction of road structures construction costs. IOP Conference Series: Earth and Environmental Science, 222(1), Article 012010. https://doi.org/10.1088/1755-1315/222/1/012010

Polikar, R. (2006). Ensemble based systems in decision making. IEEE Circuits Systems Magazine, 6(3), 21–45. https://doi.org/10.1109/MCAS.2006.1688199

Querns, W. R. (1989). What is contingency, anyway?. In AACE International Transactions.

Rafiei, M. H., & Adeli, H. (2018). Novel machine-learning model for estimating construction costs considering economic variables and indexes. Journal of Construction Engineering and Management, 144(12), Article 04018106. https://doi.org/10.1061/(ASCE)CO.1943-7862.0001570

Sagi, O., & Rokach, L. (2018). Ensemble learning: A survey. WIREs Data Mining and Knowledge Discovery, 8(4), Article e1249. https://doi.org/10.1002/widm.1249

Salah, A., & Moselhi, O. (2015). Contingency modelling for construction projects using fuzzy-set theory. Engineering, Construction and Architectural Management, 22(2), 214–241. https://doi.org/10.1108/ECAM-03-2014-0039

Samaniego, E., Anitescu, C., Goswami, S., Nguyen-Thanh, V. M., Guo, H., Hamdia, K., Zhuang, X., & Rabczuk, T. (2020). An energy approach to the solution of partial differential equations in computational mechanics via machine learning: Concepts, implementation and applications. Computer Methods in Applied Mechanics and Engineering, 362, Article 112790. https://doi.org/10.1016/j.cma.2019.112790

Shardlow, M. (2016). An analysis of feature selection techniques. The University of Manchester, United Kingdom.

Shoar, S., Chileshe, N., & Edwards, J. D. (2022). Machine learning-aided engineering services’ cost overruns prediction in high-rise residential building projects: Application of random forest regression. Journal of Building Engineering, 50, Article 104102. https://doi.org/10.1016/j.jobe.2022.104102

Shrestha, K. K., & Shrestha, P. P. (2016). A cost contingency estimation system for road maintenance contracts. Procedia Engineering, 145, 128–135. https://doi.org/10.1016/j.proeng.2016.04.030

Smith, G. R., & Bohn, C. M. (1999). Small to medium contractor contingency and assumption of risk. Journal of Construction Engineering and Management, 125(2), 101–108. https://doi.org/10.1061/(ASCE)0733-9364(1999)125:2(101)

Snoek, J., Larochelle, H., & Adams, R. P. (2012). Practical Bayesian optimization of machine learning algorithms. arXiv. https://doi.org/10.48550/arXiv.1206.2944

Sonmez, R., Ergin, A., & Birgonul, M. T. (2007). Quantitative methodology for determination of cost contingency in international projects. Journal of Management in Engineering, 23(1), 35–39. https://doi.org/10.1061/(ASCE)0742-597X(2007)23:1(35)

Thal, J. A. E., Cook, J. J., & White III, E. D. (2010). Estimation of cost contingency for air force construction projects. Journal of Construction Engineering and Management, 136(11), 1181–1188. https://doi.org/10.1061/(ASCE)CO.1943-7862.0000227

Thompson, P. A., & Perry, J. G. (1992). Engineering construction risks: A guide to project risk analysis and assessment implications for project clients and project managers. Thomas Telford.

Torgo, L., Ribeiro, R. P., Pfahringer, B., & Branco, P. (2013). Smote for regression. In Proceedings of the 16th Portuguese Conference on Artificial Intelligence Progress in Artificial Intelligence (EPIA 2013) (pp. 378–389), Angra do Heroísmo, Azores, Portugal. https://doi.org/10.1007/978-3-642-40669-0_33

Touran, A., & Lopez, R. (2006). Modeling cost escalation in large infrastructure projects. Journal of Construction Engineering and Management, 132(8), 853–860. https://doi.org/10.1061/(ASCE)0733-9364(2006)132:8(853)

Verrelst, J., Muñoz, J., Alonso, L., Delegido, J., Rivera, J. P., Camps-Valls, G., & Moreno, J. (2012). Machine learning regression algorithms for biophysical parameter retrieval: Opportunities for sentinel-2 and-3. Remote Sensing of Environment, 118, 127–139. https://doi.org/10.1016/j.rse.2011.11.002

Wang, M. T., & Chou, H. Y. (2003). Risk allocation and risk handling of highway projects in Taiwan. Journal of Management in Engineering, 19(2), 60–68. https://doi.org/10.1061/(ASCE)0742-597X(2003)19:2(60)

Wang, Z., & Srinivasan, R. S. (2017). A review of artificial intelligence-based building energy use prediction: Contrasting the capabilities of single and ensemble prediction models. Renewable and Sustainable Energy Reviews, 75, 796–808. https://doi.org/10.1016/j.rser.2016.10.079

Wang, X., Zhu, J., Ma, F., Li, C., Cai, Y., & Yang, Z. (2016). Bayesian network-based risk assessment for hazmat transportation on the middle route of the South-to-North water transfer project in China. Stochastic Environmental Research and Risk Assessment, 30, 841–857. https://doi.org/10.1007/s00477-015-1113-6

Wang, C. C., Kuo, P. H., & Chen, G. Y. (2022). Machine learning prediction of turning precision using optimized XGBoost model. Applied Sciences, 12(15), Article 7739. https://doi.org/10.3390/app12157739

Won, D., Park, M. W., & Chi, S. (2018). Construction resource localization based on UAV-RFID platform using machine learning algorithm. In 2018 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM) (pp. 1086–1090). IEEE. https://doi.org/10.1109/IEEM.2018.8607668

Wu, W., Kunz, N., & Branco, P. (2022). ImbalancedLearningRegression – A Python package to tackle the imbalanced regression problem. In M. R. Amini, S. Canu, A. Fischer, T. Guns, P. Kralj Novak, & G. Tsoumakas (Eds.), Lecture notes in computer science. Vol. 13718: Machine learning and knowledge discovery in databases (ECML PKDD 2022) (pp. 645–648). Springer, Cham. https://doi.org/10.1007/978-3-031-26422-1_48

Yan, H., He, Z., Gao, C., Xie, M., Sheng, H., & Chen, H. (2022). Investment estimation of prefabricated concrete buildings based on XGBoost machine learning algorithm. Advanced Engineering Informatics, 54, Article 101789. https://doi.org/10.1016/j.aei.2022.101789

Yeo, K. T. (1990). Risks, classification of estimates, and contingency management. Journal of Management in Engineering, 6(4), 458–470. https://doi.org/10.1061/(ASCE)9742-597X(1990)6:4(458)

Zekić-Sušac, M., Has, A., & Knežević, M. (2021). Predicting energy cost of public buildings by artificial neural networks, CART, and random forest. Neurocomputing, 439, 223–233. https://doi.org/10.1016/j.neucom.2020.01.124

Zheng, Z., Zhou, L., Wu, H., & Zhou, L. (2023). Construction cost prediction system based on Random Forest optimized by the Bird Swarm Algorithm. Mathematical Biosciences and Engineering, 20(8), 15044–15074. https://doi.org/10.3934/mbe.2023674

Zhou, J., Li, E., Wei, H., Li, C., Qiao, Q., & Armaghani, D. J. (2019). Random forests and cubist algorithms for predicting shear strengths of rockfill materials. Applied Sciences, 9(8), Article 1621. https://doi.org/10.3390/app9081621

Zhu, X., Chu, J., Wang, K., Wu, S., Yan, W., & Chiam, K. (2021). Prediction of rockhead using a hybrid N-XGBoost machine learning framework. Journal of Rock Mechanics and Geotechnical Engineering, 13(6), 1231–1245. https://doi.org/10.1016/j.jrmge.2021.06.012

View article in other formats

CrossMark check

CrossMark logo

Published

2025-11-13

Issue

Section

Articles

How to Cite

Nindartin, A., Park, S.-J., Lee, K.-T., Kim, J.-H., & Rostiyanti, S. F. (2025). Prediction of cost contingency in construction projects by introducing machine learning algorithms. Journal of Civil Engineering and Management, 31(8), 860–880. https://doi.org/10.3846/jcem.2025.24913

Share