NFTs are unique digital assets that represent items such as collectibles, metaverse assets, skins, artwork. NFTs are unique digital assets…
Author: Bal Marsius
TikTok expands its horizons by expanding its music streaming service to three new countries—Australia, Mexico, and Singapore. This move comes…
Karat Financial, the trailblazing startup catering to content creators’ financial needs Karat Financial, the trailblazing startup catering to content creators’…
Centralised exchanges have long been the primary avenue for trading cryptocurrencies. However, with the rise of decentralised finance (DeFi) Centralised…
At the Ethereum Community Conference (EthCC) in Paris, Vitalik Buterinpresented the groundbreaking concept of “account abstraction. At the Ethereum Community…
Google Play product Manager Joseph Mills recently announced that Google has updated its mobile software market policy to allow game…
Polychain Capital recently strengthened its crypto position with $200M funding and co-led $25M for Manta Network Developer. Polychain Capital recently…
Intelligent agents in AI are autonomous progrations that receive information from their environment using sensors and actuators to achieve their…
Young workers in the United States are facing a career crisis, Businessinsider reports. With the development of artificial intelligence (AI)…
OpenAI’s ChatGPT technology has gone viral in just less than a year and is already having an impact on work…
University of Montana’s Research Shows AI’s Potential Impact on Business Innovation University of Montana’s Research Shows AI’s Potential Impact on…
Interest in generative AI models has surged, driven by advancements in natural language processing and image generation.Interest in generative AI models has surged, driven by advancements in naturallanguage processing and image generation. META, a prominent player in the AI researchdomain, has introduced CM3leon, a cutting-edge multimodal model. Multimodal meansthe AI is capable of both text-to-image and image-to-text generation.CM3leon’s unique approach combines a recipe derived from text-only language models.Meta’s model will employ large-scale retrieval-augmented pre-training and multitasksupervised fine-tuning stages. Better Performance in Image GenerationDespite being trained with five times fewer computational resources than previoustransformer-based methods, CM3leon achieves state-of-the-art performance in text-to-image generation. Notably, it exhibits the versatility of autoregressive models whilemaintaining low training costs and efficient inference. This tokenization-based model goes beyond conventional text-to-image approaches. Itcan generate complex sequences of text and images conditioned on arbitrary content.Unlike other specialized image generation models, CM3leon’s large-scale multitaskinstruction tuning significantly enhances performance across various vision-languagetasks, such as image caption generation and visual question answering. Ethical Image Data SourcingMeta announced that it takes an ethical approach to image data sourcing, using onlylicensed images from Shutterstock to avoid issues related to ownership and attribution.This socially responsible methodology sets CM3leon apart from its competitors.In a comparison with widely-used benchmarks, CM3leon achieves an impressive FIDscore of 4.88, outperforming Google’s Parti model and setting a new standard for text-to-image generation. A Frechet Inception Distance (FID) score of 0.0 indicates a perfectscore.CM3leon exhibits an ability to generate intricate compositional objects, evident inexamples like a potted cactus donning sunglasses and a hat.…