Building a dataset of Wrocław’s historic tenements. Image annotation for machine learning applications

Aleksandra Marcinów, Małgorzata Biegańska, Bianka Kowalska, Hubert Baran, Daniil Hardzetski, Halina Kwaśnicka

doi:10.37190/arc240306

Abstract

In recent years, the development of artificial intelligence (AI) has introduced new possibilities in the field of architecture. In the realm of compositional analysis and recognition of architectural details, AI can have a significant impact, supporting historical-architectural research, the valorisation of historic buildings, and design in accordance with historical context. However, the successful use of AI in analysing architectural objects requires large datasets to train and test the models. The article aims to demonstrate the creation of a new dataset containing annotated images. The NeoFaçade collection serves as a historical dataset, containing façades of the 19th and 20th century townhouses from Wrocław and, in due course, other cities with similar architectural styles (for example, Szczecin or Berlin). Gathering high-quality photographic material and marking architectural elements accurately, enables to use the dataset for various AI tasks: semantic segmentation, image classification, and generation of pictures of tenement house façades. This way, the NeoFaçade dataset can potentially be applied in architectural practice or historic preservation. The methodology for creating the dataset developed by the authors consists of three stages: preparation of the data acquisition procedure, data processing: creation of a dataset that meets the requirements and a summary of the dataset. All stages are discussed in detail in the paper, including an example annotation of one of the townhouses. In the future, the research team will focus on expanding the collection with new photographs, while also striving to demonstrate NeoFaçade value as a tool supporting innovative research projects and practical applications

Full article view is only available on bigger screens.