Text-Guided Artistic Image Synthesis Using Diffusion Model

Authors

  • Shivani Patil 5Department of Artificial Intelligence and Data Science, AISSMS Institute of Information Technology, Maharashtra, India Author
  • Snehal Patil Department of Artificial Intelligence and Data Science, AISSMS Institute of Information Technology, Maharashtra, India Author
  • Sanskruti Sitapure Department of Artificial Intelligence and Data Science, AISSMS Institute of Information Technology, Maharashtra, India Author
  • Madhavi Patil 5Department of Artificial Intelligence and Data Science, AISSMS Institute of Information Technology, Maharashtra, India Author
  • Dr. M.V. Shelke Department of Artificial Intelligence and Data Science, AISSMS Institute of Information Technology, Maharashtra, India Author

DOI:

https://doi.org/10.47392/IRJASH.2024.024

Keywords:

Artistic Image Synthesis, Diffusion Model, PyTorch, Generative Models, Latent Diffusion Model, Stable Diffusion

Abstract

Use of Artificial Intelligence (AI) has been integrated into numerous fields for the purpose of promoting innovativeness and efficiency. In the domain of image generation, AI offers a chance to improve creativity and accuracy by bridging the language-art gap. Our approach proposes utilization of the latent Diffusion for creating art images from user given textual descriptions. The Stable Diffusion is a powerful foundation upon which the rest of the image production module is built. It transforms input text descriptions into latent vector representations and then decodes them into visually appealing masterpieces. In terms of user access, our system consists of an easily comprehensible user interface module, which allows users to comfortably write text-based descriptions and view generated graphics without any difficulties. Our approach not only streamlines the image creation process but also outperforms current systems in terms of cost-effectiveness and efficiency. The implementation of the Stable Diffusion empowers our system for producing precise and realistic art images based on textual descriptions. Resulting capability finds applications in diverse fields such as design, content creation, marketing, and gaming. By providing an innovative and accessible solution for aesthetic image generation, our proposed approach contributes to the evolving landscape of AI-driven technologies.

Downloads

Published

2024-06-06