Work place: Department of Computer Engineering, School of Technology Management and Engineering, NMIMS University, Mumbai, India
E-mail: mohit.jain25@nmims.edu.in
Website:
Research Interests: Machine Learning, Deep Learning
Biography
Mohit Jain is a diligent individual currently pursuing a Bachelor of Technology in Computer Engineering (CSE) and concurrently undertaking a Master of Business Administration degree at Mukesh Patel School of Management and Engineering, NMIMS University, Mumbai, India. During the course of his academic journey, he demonstrated his skills and dedication by developing this project for the final year of his Bachelors in Computer Engineering. Mohit's research focuses on the practical applications of machine learning/deep learning, showcasing his passion for exploring innovative technologies and their implications.
By Nimesh Yadav Aryan Sinha Mohit Jain Aman Agrawal Sofia Francis
DOI: https://doi.org/10.5815/ijem.2024.01.03, Pub. Date: 8 Feb. 2024
Reading the words can be confusing, and it may be hard to picture what is happening. There are some circumstances where words can be misunderstood. It's much simpler to recognize text if it's displayed as an image. The use of visuals is proven to increase viewership and retention.
Synthesizing realistic images automatically is a challenging undertaking, and even the most advanced artificial intelligence and machine learning algorithm has trouble meeting this standard. GANs (Generative Adversarial Networks) are just one example of a powerful neural network architecture that has shown promising results in recent years. Existing text-to-image methods can generate examples that generally reflect the meaning of the provided descriptions, but they often lack the necessary details and colorful object elements.
The primary objective of our research was to explore diverse architectural methodologies with the intention of facilitating the generation of visual representations from textual descriptions. By delving into this investigation, we aimed to discover and examine various approaches that could effectively support the creation of visuals that accurately depict the content and context provided within written narratives. Our aim was to unlock new possibilities in the realm of visual storytelling by establishing a strong connection between language and imagery through innovative architectural techniques.
Subscribe to receive issue release notifications and newsletters from MECS Press journals