Text-Guided Artistic Image Synthesis Using Diffusion Model

Shivani Patil; Snehal Patil; Sanskruti Sitapure; Madhavi Patil; Dr. M.V. Shelke

doi:10.47392/IRJASH.2024.024

Authors

Shivani Patil 5Department of Artificial Intelligence and Data Science, AISSMS Institute of Information Technology, Maharashtra, India Author
Snehal Patil Department of Artificial Intelligence and Data Science, AISSMS Institute of Information Technology, Maharashtra, India Author
Sanskruti Sitapure Department of Artificial Intelligence and Data Science, AISSMS Institute of Information Technology, Maharashtra, India Author
Madhavi Patil 5Department of Artificial Intelligence and Data Science, AISSMS Institute of Information Technology, Maharashtra, India Author
Dr. M.V. Shelke Department of Artificial Intelligence and Data Science, AISSMS Institute of Information Technology, Maharashtra, India Author

DOI:

https://doi.org/10.47392/IRJASH.2024.024

Keywords:

Artistic Image Synthesis, Diffusion Model, PyTorch, Generative Models, Latent Diffusion Model, Stable Diffusion

Abstract

Use of Artificial Intelligence (AI) has been integrated into numerous fields for the purpose of promoting innovativeness and efficiency. In the domain of image generation, AI offers a chance to improve creativity and accuracy by bridging the language-art gap. Our approach proposes utilization of the latent Diffusion for creating art images from user given textual descriptions. The Stable Diffusion is a powerful foundation upon which the rest of the image production module is built. It transforms input text descriptions into latent vector representations and then decodes them into visually appealing masterpieces. In terms of user access, our system consists of an easily comprehensible user interface module, which allows users to comfortably write text-based descriptions and view generated graphics without any difficulties. Our approach not only streamlines the image creation process but also outperforms current systems in terms of cost-effectiveness and efficiency. The implementation of the Stable Diffusion empowers our system for producing precise and realistic art images based on textual descriptions. Resulting capability finds applications in diverse fields such as design, content creation, marketing, and gaming. By providing an innovative and accessible solution for aesthetic image generation, our proposed approach contributes to the evolving landscape of AI-driven technologies.

Text-Guided Artistic Image Synthesis Using Diffusion Model

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section

License

Information

Latest publications