News Report Technology
December 06, 2023

Google Research and Tel Aviv University Develop AI Framework for Precise Image Generation

In Brief

Google Research and Tel Aviv University have developed AI that combines a text-to-image diffusion with lens geometry for image rendering.

Google Research and Tel Aviv University Unveil AI Framework for Precision Image Generation

Google Research in collaboration with Tel Aviv University, has introduced a new artificial intelligence (AI) framework that combines a text-to-image diffusion model with specialized lens geometry for image rendering.

This integration allows for precise control over rendering geometry, making it easier to generate diverse visual effects such as fish-eye, panoramic views, and spherical texturing using a single diffusion model.

In a latest research paper, scientists tackled the task of incorporating diverse optical controls into text-to-image diffusion models. This approach involved making the model consider the local lens geometry, enhancing its ability to replicate intricate optical effects and create realistic-looking images.

Instead of merely altering the standard shape of images, this method allows virtually any grid warps through per-pixel coordinate conditioning. This innovative approach supports diverse applications, such as panoramic scene generation that impart a sense of presence and sphere texturing. 

Additionally, the framework introduces a manifold geometry-aware image generation framework with metric tensor conditioning. This provides additional possibilities for controlling and modifying the way images are generated, unveiling numerous possibilities for creating and refining pictures.

Precise Image Generation through Text-to-Image Diffusion Integration

The framework integrates text-to-image diffusion models with specific lens geometry through per-pixel coordinate conditioning. The method entails refining a pre-trained latent diffusion model by utilizing data generated through the distortion of images with random warping fields.

Token reweighting was implemented in self-attention layers, allowing for the manipulation of curvature properties and yielding various effects, such as fish-eye and panoramic views. This approach goes beyond fixed resolution in image generation and includes metric tensor conditioning for improved control.

Revolutionizing Image Manipulation

The framework expands the capabilities of image manipulation, addressing challenges such as large image generation and adjusting self-attention scales in diffusion models.

Effectively, the framework integrates a text-to-image diffusion model with specific lens geometry, allowing for a range of visual effects like fish-eye, panoramic views, and spherical texturing using a single model. It provides meticulous control over curvature properties and rendering geometry, leading to the creation of realistic and nuanced images.

Trained on a substantial textually annotated dataset and per-pixel warping fields, the method produces arbitrary warped images with finely undistorted results closely aligned with the target geometry. Additionally, it facilitates the development of spherical panoramas characterized by realistic proportions and minimal artifacts.

Google Research and Tel Aviv University Unveil AI Framework for Precision Image Generation

The recently introduced framework, which integrates diverse lens geometries into image rendering, offers improved control over curvature properties and visual effects.

The researchers suggest extending this approach to achieve outcomes comparable to specialized lenses capturing distinct scenes. By considering the potential utilization of more advanced conditioning techniques, the framework envisions enhanced image generation and expanded capabilities.

Disclaimer

In line with the Trust Project guidelines, please note that the information provided on this page is not intended to be and should not be interpreted as legal, tax, investment, financial, or any other form of advice. It is important to only invest what you can afford to lose and to seek independent financial advice if you have any doubts. For further information, we suggest referring to the terms and conditions as well as the help and support pages provided by the issuer or advertiser. MetaversePost is committed to accurate, unbiased reporting, but market conditions are subject to change without notice.

About The Author

Alisa is a reporter for the Metaverse Post. She focuses on investments, AI, metaverse, and everything related to Web3. Alisa has a degree in Business of Art and expertise in Art & Tech. She has developed her passion for journalism through writing for VCs, notable crypto projects, and scientific writing. You can contact her at alisa@mpost.io

More articles
Alisa Davidson
Alisa Davidson

Alisa is a reporter for the Metaverse Post. She focuses on investments, AI, metaverse, and everything related to Web3. Alisa has a degree in Business of Art and expertise in Art & Tech. She has developed her passion for journalism through writing for VCs, notable crypto projects, and scientific writing. You can contact her at alisa@mpost.io

Hot Stories

Top Investment Projects of the Week 25-29.03

by Viktoriia Palchik
March 29, 2024
Join Our Newsletter.
Latest News

Top Investment Projects of the Week 25-29.03

by Viktoriia Palchik
March 29, 2024

Supply and Demand Zones

Cryptocurrency, like any other currency, is a financial instrument based on the fundamental economic principles of supply ...

Know More

Top 10 Crypto Wallets in 2024

With the current fast-growing crypto market, the significance of reliable and secure wallet solutions cannot be emphasized ...

Know More
Join Our Innovative Tech Community
Read More
Read more
Modular Blockchain Sophon Raises $10M Funding from Paper Ventures and Maven11 Amid Veil of Mystery
Business News Report
Modular Blockchain Sophon Raises $10M Funding from Paper Ventures and Maven11 Amid Veil of Mystery
March 29, 2024
Arbitrum Foundation Announces Third Phase Of Grants Program, Opens Applications From April 15th
News Report Technology
Arbitrum Foundation Announces Third Phase Of Grants Program, Opens Applications From April 15th
March 29, 2024
Top Investment Projects of the Week 25-29.03
Digest Technology
Top Investment Projects of the Week 25-29.03
March 29, 2024
Vitalik Buterin Advocates For Memecoins’ Potential In Crypto Sector, Favors ‘Good Memecoins’
News Report Technology
Vitalik Buterin Advocates For Memecoins’ Potential In Crypto Sector, Favors ‘Good Memecoins’
March 29, 2024