FORMATION OF BIM MODEL DATASET FOR MACHINE LEARNING TASKS

Authors

DOI:

https://doi.org/10.31649/2311-1429-2025-1-87-94

Keywords:

BIM, Revit, optimization of constructions, Dynamo, Python, Machine Learning, FEA.

Abstract

This paper presents a reproducible approach to deriving machine-learning-ready datasets from Autodesk Revit Building
Information Models. We unify four complementary export routes‒Schedules→CSV, Dynamo→CSV/XLSX/JSON,
pyRevit/Revit API→CSV/JSON/XLSX, and IFC→Autodesk Platform Services/Speckle→JSON/CSV‒under a single data
contract: a target schema/ontology with mandatory fields, SI units, a stable key policy (IfcGUID/composite key), and formal

data-quality validation. The methodology covers ingestion, normalization of types and units, construction of geometric
descriptors, checks of completeness and referential integrity, and feature preparation for reliable train/validation/test splits.
Provided a practical protocol and a final Elements.csv dataset used in a baseline task that predicts element mass as a
proxy for cost and logistics; templates for quality checks accompany the flow. Across applied scenarios‒production-rate
estimation, cost approximation, and optimization of layout and load-bearing systems‒we show that disciplined data
management (schema, keys, units, rules) reduces information leakage and improves reproducibility.
The contribution aligns tool-specific export routes with a uniform representation, enabling integration of BIM data with
analytics without dependence on a software stack and establishing a foundation for data-centric design and subsequent
optimization. We also document reproducibility artefacts (JSON/Table Schema definitions, DQI reports, and processing logs)
to support audit and repetition.

Author Biographies

Valeriy Andrukhov, Vinnytsia National Technical University

PhD, Associate Professor

Andriy Potіekha, Vinnytsia National Technical University

student, Department of Civil and Environmental Engineering

References

Kyshkan, A., & Karkhut, I. (2024). Potential for Automation and Acceleration of Construction Processes Using

Machine Learning and Artificial Intelligence. Conference Proceedings of the International Scientific Center of

Development (Uzhhorod, Ukraine, 17 May 2024), 354–358. URL:

https://archive.mcnd.org.ua/index.php/conference-proceeding/article/view/1231

Syomko, P., & Levus, Ye. (2024). Using Machine Learning to Determine the Architectural Style of Buildings.

Conference Proceedings of MNL (Kyiv, 17 May 2024), 346–347. URL:

https://archive.liga.science/index.php/conference-proceedings/article/view/964

Adamenko, V. (2022). Experience of Implementing BIM Technologies in the Educational Process at the

Department of Metal and Timber Structures of KNUCA. Building Constructions. Theory and Practice. (Online

publication). URL: https://bctp.knuba.edu.ua/article/view/260026

Yasnii, V. P., & Meshcheriakova, O. M. (2022). BIM: An Effective Tool for the Reconstruction of Buildings and

Structures. Modern Technologies and Methods of Calculations in Construction, 18, 61–70.

https://doi.org/10.36910/6775-2410-6208-2022-8(18)-08

Lialiuk, O. H., Osypenko, R. S., & Melnyk, D. O. (2024). Decision Support Systems in Construction Projects

Based on Bayesian Networks. Modern Technologies, Materials and Structures in Construction, 36(1), 96–102.

DOI: https://doi.org/10.31649/2311-1429-2024-1-96-102

Lialiuk, O. H., & Osypenko, R. S. (2023). Features of the Implementation of Artificial Intelligence in Construction.

Modern Technologies, Materials and Structures in Construction, 35(2), 172–176. DOI:

https://doi.org/10.31649/2311-1429-2023-2-172-176

Nazirov, E. K., Nazirova, T. O., & Karpenko, M. Yu. (2018). Methods of Data Collection and Classification Using

a Soundlet Bayesian Neural Network. Bulletin of the Kherson National Technical University, 3(1), 332–337.

URL:http://nbuv.gov.ua/UJRN/Vkhdtu_2018_3(1)__49

buildingSMART International. (2024). IFC 4.3 Formally Approved and Published as an ISO Standard (ISO

. URL: https://www.buildingsmart.org/ifc-4-3-formally-approved-and-published-as-an-iso-standard/

Boiko, O. A., & Kovalchuk, A. S. (2023). The Potential of Automation and Acceleration of Construction Processes

through Machine Learning and Artificial Intelligence. Modern Construction and Architecture, 3, 85–92. URL:

https://archive.mcnd.org.ua/index.php/conference-proceeding/article/view/1231

Autodesk Help. (2024). Export a Schedule (Revit 2024). URL:

https://help.autodesk.com/view/RVTLT/2024/ENU/?guid=GUID-B2CCAC4F-1D38-4D5D-B4D1-

D1B7EBE

Kravchenko, O. V., & Tymoshenko, P. I. (2023). Concepts of Genetic Algorithms and Their Application to

Optimization Problems. Computer Technologies of Data Processing, URL:

https://jktod.donnu.edu.ua/article/view/16228

Zheng, H., Moosavi, V., & Akbarzadeh, M. (2020). Machine Learning Assisted Evaluations in Structural Design

and Construction. Automation in Construction, 114, 103346. URL: https://doi.org/10.1016/j.autcon.2020.103346

scikit-learn. (2024–2025). User Guide (stable): Model Selection & Evaluation. URL: https://scikitlearn.

org/stable/user_guide.html

Boiko, N. I., & Blazhevskyi, S. H. (2022). Method of Determining the Structure of the Model of Optimal

Complexity. Herald of Khmelnytskyi National University, 2(307), 7–13. DOI: https://doi.org/10.31891/2307-

-2022-307-2-7-13

Pyrih, Y., Klymash, M., Pyrih, Y., & Lavriv, O. (2023). Genetic Algorithm as a Tool for Solving Optimisation

Problems. Information and Communication Technologies, Electronic Engineering, 3(2), 95–107. DOI:

https://doi.org/10.23939/ictee2023.02.095

Stetsyuk, P. I., Vakulenko, D. V., & Stetsyuk, M. H. (2023). r-Algorithm for Learning Linear Regression Models.

Problems of Applied Mathematics and Mathematical Modeling, 23. DOI: https://doi.org/10.15421/322324

Rochefort-Beaudoin, T., Vadean, A., Aage, N., & Achiche, S. (2024). Structural Design Through Reinforcement

Learning. arXiv, arXiv:2407.07288. DOI: https://doi.org/10.48550/arXiv.2407.07288

Kumar, A., & Singh, R. (2023). Application of Artificial Intelligence in Structural Engineering. In Proceedings of

the 2023 ASCE International Conference on Computing in Civil Engineering (pp. 350–359). DOI:

https://doi.org/10.1061/9780784485231.044

Sadeghi Eshkevari, S., Sadeghi Eshkevari, S., Sen, D., & Pakzad, S. N. (2021). RL-Controller: A Reinforcement

Learning Framework for Active Structural Control. arXiv, arXiv:2103.07616.

DOI:https://doi.org/10.48550/arXiv.2103.07616

Zhang, Y., Wang, L., Wang, Y., Liu, Y., & Zhang, J. (2022). A Review of Artificial Intelligence Applications in

Structural Engineering. Computers in Industry, 142, 103750. DOI: https://doi.org/10.1016/j.compind.2022.103750

Downloads

Abstract views: 0

Published

2025-09-15

How to Cite

[1]
V. Andrukhov and Potіekha A., “FORMATION OF BIM MODEL DATASET FOR MACHINE LEARNING TASKS”, СучТехнБудів, vol. 38, no. 1, pp. 87–94, Sep. 2025.

Issue

Section

MODELING OF BUILDING PRODUCTION

Metrics

Downloads

Download data is not yet available.