Llama 3.2 Unveiled at Meta Connect 2024: The New Face of Multimodal and Mobile AI

October 2, 2024

Gizem Argunşah

Meta’s Llama3.2

Llama 3.2 Unveiled at Meta Connect 2024: The New Face of Multimodal and Mobile AI

Meta showcased its latest advancements in artificial intelligence at the Meta Connect 2024 event, introducing the world to Llama 3.2. With this new version, users can now experience Llama Vision, which combines text and visual data, directly on their phones. Llama 3.2 comes with ten new models, ranging from a 1B text-focused model to a 90B multimodal (text + image) model. However, due to certain restrictions within the European Union, there are some details that need attention.

En yeni PHI 3.5, Microsoft’un PHI serisinin en sonuncusunu temsil ediyor ve bu yapay zeka dil modelleri için teknolojide dev bir adım atıyor. PHI 2’nin iyi bilinen temelinden inşa edilen bu nesilde PHI serisi, anlayabildiği ve üretebildiği doğal dilin üstün olması nedeniyle birçok alkış aldı. Bu nedenle, PHI serisi tarafından birçok alkış toplandı.

Llama 3.2 and Revamped Meta AI Features

The world of social media is undergoing a major transformation with Meta’s latest innovations. At the heart of these innovations are Llama 3.2, the first open-source AI model, and the updated Meta AI features. In this blog post, we will delve into these exciting developments in detail.

Llama 3.2 stands out as a multimodal AI model capable of processing both text and image data. Offering new possibilities for developers, Llama 3.2 paves the way for more advanced AI applications.

Meta CEO Mark Zuckerberg emphasizes that open-source models are shaping the future and are more cost-effective than closed-source alternatives.

Llama 3.2 models are available for download on llama.com and Hugging Face and are ready for development in a wide range of partner platform ecosystems. Meta has collaborated with leading companies like Google, Microsoft, Intel, IBM, NVIDIA, Oracle Cloud, and Amazon Web Services (AWS) to develop this model.

New Meta AI: Features Reshaping Social Media

The new Meta AI, powered by Llama 3.2, brings a host of innovations that will fundamentally change the social media experience. Here are some of them:

Automatic Video Dubbing (Reels): Users can translate videos they shoot in one language into other languages using AI, with their own voices. Lip synchronization is also flawlessly implemented, eliminating language barriers and enabling reaching wider audiences.
AI-Powered Images (Instagram and Facebook Feed): Personalized, AI-generated images based on user interests will appear on homepages. Some images will be created using the user’s face. Presented under the title “Imagined for You,” this feature will offer options for visual sharing or real-time new visual creation.
AI-Powered Content Creator Personalities: Content creators will be able to create their own images using AI, and users will be able to chat with these AI-powered personalities through video.
Celebrity-Voiced AI Assistants: AI voices featuring celebrities like Kristen Bell, Awkwafina, and John Cena are added to the AI chatbot on Instagram, WhatsApp, and Facebook. This feature will initially be available to US users.
AI-Powered Photo Editing: Users will be able to make changes to their photos by entering text prompts. Features like adding clothes, adding/removing objects, changing colors/styles, and updating backgrounds are offered.

Reasons to Try Llama 3.2

Benchmark between Lllama 3.2 models and Gemma 2 & Phi 3.5 mini models

Llama 3.2 stands out with its numerous innovative features:

Multimodal Capability: Llama 3.2 Vision offers 11B and 90B models capable of processing text and image data. These models were trained on Llama 3.1 text models using data from 6 billion image-text pairs. This opens new doors for those who want to work with both text and visual content.
Edge Usage: Llama 3.2 Edge offers multilingual text-focused models in 1B and 3B sizes, creating suitable alternatives for local use. These models, capable of running efficiently on mobile devices, provide users with a faster and more accessible experience.
Extensive Context Length: All Llama 3.2 models have a context length of 128,000 tokens. This allows users to process more complex and longer content.
Knowledge Distillation and Fine-Tuning: Llama 3.2 enabled the training of 1B and 3B models from 8B and 70B models through knowledge distillation and fine-tuning. This resulted in models that occupy less space but are equally effective.
Security Measures: Llama Guard 3.2 has two new enhanced security models with visual support. This makes the user experience safer.
Performance Evaluations: The Llama 3.2 3B model performs on par with the Llama 3.1 8B model on IFEval. This offers powerful use cases for on-device information retrieval (RAG) or agencies.

Conclusion

Meta’s Llama 3.2 and new Meta AI features demonstrate the potential of AI technology to revolutionize social media and other fields. With its open-source approach, Meta aims to promote the widespread adoption of this technology and foster further innovation. The widespread use and impact of these innovations will become clearer over time. The metaverse world is becoming even more exciting with these new advancements.

With multimodal capabilities, mobile usage possibilities, and enhanced security measures, Llama 3.2 is a powerful tool designed to meet user needs. Getting acquainted with Llama 3.2 could take your AI experience to the next level!

To try Llama 3.2 on SkyStudio soon, register here!

References:

https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices/

https://github.com/meta-llama/llama-models/blob/main/models/llama3_2/MODEL_CARD.md

https://ollama.com/library/llama3.2

Llama 3.2, Meta

Llama 3.2 Unveiled at Meta Connect 2024: The New Face of Multimodal and Mobile AI

Llama 3.2 Unveiled at Meta Connect 2024: The New Face of Multimodal and Mobile AI

Llama 3.2 and Revamped Meta AI Features

New Meta AI: Features Reshaping Social Media

Reasons to Try Llama 3.2

Conclusion

Join us for updates on our latest AI innovations

Read our latest content on chatbots, generative ai, customer experience, and many more.

ABOUT skymod

Solutıons

Resources

Social

Contacts