Skip to main content

2024 | OriginalPaper | Buchkapitel

The Possibilities of Text-to-Image Tools for the Generation of Floor Plans

verfasst von : Angélica Fernández-Morales

Erschienen in: Graphic Horizons

Verlag: Springer Nature Switzerland

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

This study builds on previous research to assess whether text-to-image technology can correctly generate images of residential floor plans. Three tools are tested: Midjourney, Stable Diffusion and Dall-E. The process involved: (1) using reference images to generate text descriptions, (2) crafting prompts from these descriptions and testing them on the three AI systems, (3) merging text requests with reference images, and (4) using hand-drawn sketches to create technical architectural drawings.
In general, the tools showed potential but were deemed not yet suitable for producing architectural designs due to a lack of syntactic and functional logic. Midjourney emerged as the most effective, consistently generating 2D planimetric images and producing quality results when combining textual descriptions with reference images. On the other hand, Dall-E underperformed in responding to text requests and deviated significantly from delivering the desired images, although it excelled at describing images via ChatGPT, a task at which Midjourney faltered. Stable Diffusion was noted for striking a balance, offering quality close to Midjourney and better text descriptions through Artbot. It also showed promise with its unique ability to create images from hand-drawn sketches, a feature not available in the other tools.
The improvements shown by those tools within a short time suggest that they will continue to advance and might soon generate accurate architectural drawings from text descriptions and rough sketches, constituting an important help tool for architects.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
CLIP Interrogator is a prompt engineering tool that combines CLIP, from OpenAI, and BLIP, from Salesforce, to generate optimized texts that match a given image. Its author is pharmapsychotic and it is available on Github. It can be tested online from various websites. For this work, Huggingface (https://​huggingface.​co/​spaces/​pharmapsychotic/​CLIP-Interrogator) and Replicate (https://​replicate.​com /pharmapsychotic/clip-interrogator) were used.
 
2
Interrogate is an application within Artbot, a Stable Horde web client created by Dave Schumaker. It can be used online from https://​tinybots.​net/​artbot/​interrogate. Stable Horde is an open source platform that uses idle GPU power, voluntarily provided by the user community, to be freely used for AI art generation.
 
Literatur
Zurück zum Zitat Baduge, S.K., et al.: Artificial intelligence and smart vision for building and construction 4.0: machine and deep learning methods and applications. Autom. Constr. 141, 104440 (2022) Baduge, S.K., et al.: Artificial intelligence and smart vision for building and construction 4.0: machine and deep learning methods and applications. Autom. Constr. 141, 104440 (2022)
Zurück zum Zitat Chaillou, S.: AI+ Architecture: Towards a New Approach. Harvard University, p. 188 (2019) Chaillou, S.: AI+ Architecture: Towards a New Approach. Harvard University, p. 188 (2019)
Zurück zum Zitat Nauata, N., Chang, K.H., Cheng, C.Y., Mori, G., Furukawa, Y.: House-GAN: relational generative adversarial networks for graph-constrained house layout generation. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.M. (eds.) ECCV 2020. LNCS, Part I, vol. 12346, pp. 162–177. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58452-8_10 Nauata, N., Chang, K.H., Cheng, C.Y., Mori, G., Furukawa, Y.: House-GAN: relational generative adversarial networks for graph-constrained house layout generation. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.M. (eds.) ECCV 2020. LNCS, Part I, vol. 12346, pp. 162–177. Springer, Cham (2020). https://​doi.​org/​10.​1007/​978-3-030-58452-8_​10
Zurück zum Zitat Merino-Gómez, E., Reviriego, P., Moral, F.: Arquitecturas inconclusas: una perspectiva desde la Inteligencia Artificial. EGA 28(48), 254–267 (2023) Merino-Gómez, E., Reviriego, P., Moral, F.: Arquitecturas inconclusas: una perspectiva desde la Inteligencia Artificial. EGA 28(48), 254–267 (2023)
Zurück zum Zitat Jaruga-Rozdolska, A.: Artificial intelligence as part of future practices in the architect’s work: MidJourney generative tool as part of a process of creating an architectural form. Architectus 3(71), 95–104 (2022) Jaruga-Rozdolska, A.: Artificial intelligence as part of future practices in the architect’s work: MidJourney generative tool as part of a process of creating an architectural form. Architectus 3(71), 95–104 (2022)
Zurück zum Zitat Yildirim, E.: Text-to-image generation AI in architecture. In: Kozlu, H.H. (ed.) Art and Architecture: Theory, Practice and Experience, vol. 97. Livre de Lyon, Lyon (2022) Yildirim, E.: Text-to-image generation AI in architecture. In: Kozlu, H.H. (ed.) Art and Architecture: Theory, Practice and Experience, vol. 97. Livre de Lyon, Lyon (2022)
Zurück zum Zitat Gajjar, C.P.: Re_Imaged: reimaging architecture through artificially intelligent generated images. Doctoral dissertation, Virginia Tech (2023) Gajjar, C.P.: Re_Imaged: reimaging architecture through artificially intelligent generated images. Doctoral dissertation, Virginia Tech (2023)
Zurück zum Zitat Molina-Siles, P., Ribera, M.G.: Inteligencia artificial y creatividad para la generación de imágenes arquitectónicas a partir de descripciones textuales en Midjourney. Emulando a Louis I. Kahn. EGA Expresión Gráfica Arquitectónica 28(49), 238–251 (2023) Molina-Siles, P., Ribera, M.G.: Inteligencia artificial y creatividad para la generación de imágenes arquitectónicas a partir de descripciones textuales en Midjourney. Emulando a Louis I. Kahn. EGA Expresión Gráfica Arquitectónica 28(49), 238–251 (2023)
Zurück zum Zitat Paananen, V., Oppenlaender, J., Visuri, A.: Using text-to-image generation for architectural design ideation. arXiv preprint arXiv:2304.10182 (2023) Paananen, V., Oppenlaender, J., Visuri, A.: Using text-to-image generation for architectural design ideation. arXiv preprint arXiv:​2304.​10182 (2023)
Zurück zum Zitat Ploennigs, J., Berger, M.: AI art in architecture. AI Civ. Eng. 2(8) (2023) Ploennigs, J., Berger, M.: AI art in architecture. AI Civ. Eng. 2(8) (2023)
Zurück zum Zitat Fernández-Morales, A.: Explorando las posibilidades de Midjourney para la generación de plantas de distribución. In: Horizontes Gráficos. Proceedings of the XX Congreso Internacional de Expresión Gráfica Arquitectónica (2024, in press) Fernández-Morales, A.: Explorando las posibilidades de Midjourney para la generación de plantas de distribución. In: Horizontes Gráficos. Proceedings of the XX Congreso Internacional de Expresión Gráfica Arquitectónica (2024, in press)
Metadaten
Titel
The Possibilities of Text-to-Image Tools for the Generation of Floor Plans
verfasst von
Angélica Fernández-Morales
Copyright-Jahr
2024
DOI
https://doi.org/10.1007/978-3-031-57575-4_36