Post de @ff00aa@mastodon.social [#FF00AA]

14 mai 2024

Back when OpenAI announced “multimodal” ChatGPT I felt that their language was deliberately vague enough for it to be several layers functioning separately — e.g., a discrete image recognizer telling the LLM what’s in a picture.

They’ve finally confirmed that was the case, because *now* GPT-4o is a single, actually omnimodal neural network. And I find the idea that this works, and works so well and so fast, really impressive and terrifying (all over again).

Major ChatGPT-4o update allows audio-video talks with an “emotional” AI chatbot

New GPT-4o model can sing a bedtime story, detect facial expressions, read emotions.

2024/05/14 - 21:38 — publié d'abord sur @ff00aa@mastodon.social

Vous voulez savoir quand je poste du contenu sur mon blog ? Il suffit de vous inscrire gratuitement à un agrégateur RSS (Feedly, NewsBlur, Inoreader, …) et d'ajouter www.ff00aa.com à vos flux (ou www.garoo.net pour vous abonner à tous les sujets). On n'a pas besoin de newsletters, pas besoin de Twitter, le RSS existe toujours.

Mentions légales : ce blog est hébergé par OVH, 2 rue Kellermann, 59100 Roubaix, France, www.ovhcloud.com.

Les données des visiteurs de ce blog ne sont pas utilisées ni transmises à des tiers. Les posteurs de commentaires peuvent demander leur suppression par e-mail.