"When I first reviewed the #RayBan #Meta #SmartGlasses, I wrote that some of the most intriguing features were the ones I couldn’t try out yet. Of these, the most interesting is what Meta calls '#MultiModalAI,' the ability for the glasses to respond to queries based on what you’re looking at." #GenerativeAI

The Ray-Ban Meta smart glasses’ new #AI powers are impressive, and worrying
https://www.engadget.com/the-ray-ban-meta-smart-glasses-new-ai-powers-are-impressive-and-worrying-181036772.html?ncid=txtlnkusaolp00000618

Meta's Ray-Ban smart glasses resting on a shelf

**IT News** @itnewsbot@schleuss.online · Sep 25, 2023

Sep 25, 2023

IT News @itnewsbot@schleuss.online

ChatGPT update enables its AI to “see, hear, and speak,“ according to OpenAI - Enlarge (credit: Getty Images)

On Monday, OpenAI announced a s... - https://arstechnica.com/?p=1970737 #largelanguagemodels #speechrecognition #machinelearning #speechsynthesis #computervision #textsynthesis #multimodalai #multimodal #microsoft #whisperai #aiethics #bemyeyes #bingchat #android #chatgpt #chatgtp #biz⁢ #openai #tech #ios #ai

Ars Technica · Sep 25, 2023ChatGPT update enables its AI to “see, hear, and speak,“ according to OpenAIImage recognition and voice features aim to make the AI bot's interface more intuitive.

**Norobiik @Norobiik@noc.social** @Norobiik@noc.social · Mar 8, 2023 *

Mar 8, 2023 *

Norobiik @Norobiik@noc.social @Norobiik@noc.social

#ChatGPT, #DallE & #Midjourney are #Unimodal #AIs - Florence is something else .

" Multimodal models — models that, once again, understand multiple modalities, such as language and images or videos and audio — are able to perform tasks in one shot that unimodal models simply cannot (e.g. captioning videos)."

#Microsoft’s #ComputerVision model will generate #AltText for #Reddit images | #AI #FlorenceAI #MultiModalAI | TechCrunch
https://techcrunch.com/2023/03/07/microsofts-computer-vision-model-will-generate-alt-text-for-reddit-images/

An image captioned by Microsoft's Florence AI with the alt-text "Cheetah sitting on a hill"

**Norobiik @Norobiik@noc.social** @Norobiik@noc.social · Mar 2, 2023

Mar 2, 2023

Norobiik @Norobiik@noc.social @Norobiik@noc.social

Some AI experts point to #MultiModalAI as a potential path toward general artificial intelligence, a hypothetical technology that will ostensibly be able to replace humans at any intellectual task (and any intellectual job). #AGI is the stated goal of #OpenAI, a key business partner of Microsoft in the AI space."

#Microsoft unveils #AI model that understands image content, solves visual puzzles | Ars Technica
https://arstechnica.com/information-technology/2023/03/microsoft-unveils-kosmos-1-an-ai-language-model-with-visual-perception-abilities/

Ars TechnicaMicrosoft unveils AI model that understands image content, solves visual puzzlesMicrosoft believes a multimodal approach paves the way for human-level AI.

Drag & drop to upload

Recent searches

Search options

Administered by:

Server stats:

#multimodalai