TY - JOUR
T1 - Filtered dataset of Irish energy performance certificates
T2 - A data-driven approach for enhanced building stock modelling
AU - Raushan, Kumar
AU - Uidhir, Tomás Mac
AU - Salvador, Marisa Llorens
AU - Norton, Brian
AU - Ahern, Ciara
N1 - Publisher Copyright:
© 2025 The Authors
PY - 2025/4
Y1 - 2025/4
N2 - The data presented in this article supports the research publication “A data-driven standardised generalisable methodology to validate a large energy performance Certification dataset: A case of the application in Ireland” by Raushan et al. [1]. It provides the filtered Energy Performance Certificate (EPC) database for residential buildings in Ireland after applying rigorous data validation methods to remove erroneous entries, and outliers. EPCs contain valuable information about building energy efficiency and characteristics. The raw EPC database for Ireland is publicly accessible but contains over 1 million unfiltered entries with inconsistent and erroneous values that can skew analysis. This processed dataset enhances the quality and robustness of the EPC data for use in building stock modelling and research. The data is openly available in .CSV format along with the methodology used for processing the raw database, published in full Python scripts. Supporting notes and metadata explain the filtering process, experimental design, and content of 211 variables across four categories: Informational, form, envelope, and system. By publishing this standardised data-driven filtered EPC dataset, this research enables stakeholders, non-expert and expert alike, to leverage this higher quality input for characterising the Irish housing stock.
AB - The data presented in this article supports the research publication “A data-driven standardised generalisable methodology to validate a large energy performance Certification dataset: A case of the application in Ireland” by Raushan et al. [1]. It provides the filtered Energy Performance Certificate (EPC) database for residential buildings in Ireland after applying rigorous data validation methods to remove erroneous entries, and outliers. EPCs contain valuable information about building energy efficiency and characteristics. The raw EPC database for Ireland is publicly accessible but contains over 1 million unfiltered entries with inconsistent and erroneous values that can skew analysis. This processed dataset enhances the quality and robustness of the EPC data for use in building stock modelling and research. The data is openly available in .CSV format along with the methodology used for processing the raw database, published in full Python scripts. Supporting notes and metadata explain the filtering process, experimental design, and content of 211 variables across four categories: Informational, form, envelope, and system. By publishing this standardised data-driven filtered EPC dataset, this research enables stakeholders, non-expert and expert alike, to leverage this higher quality input for characterising the Irish housing stock.
KW - Building energy rating
KW - Data validation
KW - Data-driven statistical methods
KW - Dwelling energy assessment procedure
KW - Energy performance certificates
KW - EPC database
KW - Irish housing stock
UR - https://www.scopus.com/pages/publications/85215844731
U2 - 10.1016/j.dib.2025.111281
DO - 10.1016/j.dib.2025.111281
M3 - Article
AN - SCOPUS:85215844731
SN - 2352-3409
VL - 59
JO - Data in Brief
JF - Data in Brief
M1 - 111281
ER -