• Gen AI Mastery Hub
  • Posts
  • 🤖 This Week in AI: Autonomous Agents, Quantized Models, & Creative Tools

🤖 This Week in AI: Autonomous Agents, Quantized Models, & Creative Tools

Welcome to your weekly dose of the latest news in the world of AI & more.

Hello AI Enthusiasts,

Imagine a world where you could have as many robot colleagues and assistants (agents), as you could possibly need, doing work for you? That's the future we're slowly stepping into, with the advent of AI agents. The tech has been advancing to an extent where AI agents could be collaborate closely with you, as soon as 2025.

This past week has been an exciting one in the world of artificial intelligence, with groundbreaking advancements across various domains!

Here’s your curated update on the latest developments that are sure to impact both the AI and business landscapes.

THIS WEEK’S MENU:

  • Top 5 AI News of the Week 📰

  • Top AI Tool of the Week ✂️

  • Gen AI Art Picks 🎨

Read time: 5 minutes

Top 5 AI News of the Week

🤝 This Week’s News…

1. Claude’s Next Leap in Automation from Anthropic

Image Source: Claude 3.5 Updates

Anthropic’s Claude 3.5 AI has taken a significant step forward, now capable of autonomously interacting with your computer to complete tasks. In a recent demonstration, Claude filled out vendor forms by analyzing a spreadsheet on the desktop, using screenshots to track actions and verify accuracy. This innovative functionality includes:

  • Intelligent Desktop Control: Screenshots for desktop guidance to ensure tasks are accurate and verified.

  • Task Completion Accuracy: Each step is double-checked, moving the mouse to the exact points required.

  • Enhanced Automation Potential: Repeats the process until tasks are complete, opening new avenues for business automation.

Claude 3.5 also introduced an analysis tool, simplifying data analysis directly within its interface. Users can visualize data from a CSV, automate reports, and more with minimal setup.

2. Microsoft Copilot Studio: Autonomous Agents for Business Automation

Image Source: Microsoft Copilot

This week, Microsoft rolled out new autonomous agent capabilities within Copilot Studio, aimed at revolutionizing task automation for business users. This update enables agents to operate independently, reacting to various signals from across an organization’s systems without requiring human intervention. Highlights include:

  • Autonomous Triggers: Agents can now be programmed to respond to signals from databases, tools, or scheduled events, initiating workflows in real-time.

  • Dynamic Path Creation: Each business process can follow unique paths as agents autonomously decide the next steps based on real-time information.

  • Integration with Advanced Models: Using the latest OpenAI models, including GPT-4, these agents bring enhanced reasoning to complex workflows.

With this upgrade, Microsoft positions Copilot Studio as a powerful tool for seamless business automation, promising to cut down on time spent on repetitive tasks while enhancing productivity. Expect a demonstration at Microsoft’s upcoming Ignite event, where more details will likely be unveiled.

3. Meta’s Quantized Llama Models

Image source: Meta

The new quantized Llama 3.2 models by Meta offer powerful on-device AI functionality, with a 56% reduction in model size and up to 4x faster processing. Leveraging Quantization-Aware Training and the SpinQuant method, Meta’s compact models perform well on mobile hardware, maintaining accuracy while reducing memory demands. These models, optimized for Qualcomm and MediaTek chips, support a range of applications from mobile assistants to edge computing

4. Video Generators on the Rise: Runway, Mochi 1 & Genmo

Image Source: Genmo video generator

The video generation field is buzzing with activity. Runway recently announced its Act One tool, enabling users to sync animated characters with real-life expressions. Ideal for animated storytelling, this tool aligns facial expressions and spoken words with animated avatars, promising an exciting future for virtual influencers and storytellers.

For those looking to experiment, Mochi 1 has emerged as an open-source video generator. Users with a powerful GPU can run it locally, which allows for uncensored and customized video creations, though with some variability in quality. Additionally, Genmo offers more accessible text-to-video functionality with creative options for realistic animations, available to anyone with 300 free credits.🖼️ Midjourney Has a New Open Source Competitor: FLUX.1

5. OpenAI Expands Voice Capabilities & Talent Departures

In major news for OpenAI users in the EU, Switzerland, Iceland, Norway, and Liechtenstein, voice mode is now accessible for Plus subscribers. This feature, aimed at enhancing accessibility, allows users to engage in voice-driven interactions directly with ChatGPT.

On a different note, OpenAI’s senior adviser, Miles Brundage, has announced his departure. In his farewell blog post, Brundage shared his views on the readiness (or lack thereof) for AGI. He speculated that while OpenAI has advanced models under wraps, they’re not vastly superior to those currently available to the public. His insights underscore ongoing concerns about AGI readiness and transparency.

And a few more quick updates 🚀

  • Perplexity AI’s New Mac App Launched: Perplexity AI has introduced a Mac app, allowing users quick access to AI-powered search with just a keyboard shortcut. This streamlined feature supports real-time answers and insights, making AI-driven information retrieval faster and more accessible for Mac users.

  • Eleven Labs Unveils Voice Design Tool: Eleven Labs launched a unique voice design tool, letting users create custom voices from text prompts. Whether you want a “smooth and calming” tone or a “playful and bright” sound, this tool opens new possibilities for personalized audio content in various industries.

  • Apple Intelligence Expands in iOS 18.2: The latest iOS 18.2 update adds Apple Intelligence features, like AI-driven emoji generation, visual intelligence that interprets on-screen content, and chat-like functionalities, bringing Apple’s devices closer to personalized AI assistance.

  • IBM Introduces Granite Models for Enterprises: IBM released its Granite 3 models, specifically tailored for enterprise tasks like retrieval, classification, and summarization. With a focus on cost efficiency, these models offer companies a more affordable AI solution, trained on their proprietary data for optimized performance.

  • Asana’s No-Code AI Agent Tool: Asana launched a no-code AI agent tool, enabling users to automate complex workflows without coding. By configuring task prerequisites and responses, users can streamline project management processes, making Asana a more versatile tool for businesses.

  • xAI API Launched: xAI's new API introduces “grok-beta,” a model with flexible, open-ended functionality for developers. Priced at $5 per million input tokens, the API includes function calling, connecting Grok to external databases and tools. Future multimodal features are anticipated, as xAI leverages data from X and Musk's other ventures to bolster Grok's real-world applications.

Top AI Tool of the Week

🖌️ Ideogram: Turn Text into Art with AI

Image Source: Ideogram

Ideogram AI generates stunning images from text prompts, allowing users to create everything from artwork to diagrams.

  • Text-to-Image Generation: Create visuals by simply typing a description, and Ideogram generates matching images.

  • Deep Learning Models: The platform’s deep learning neural networks analyze your text for accurate visuals

  • Flexible Plans: Generate up to 25 sets of images daily for free, or upgrade for more features.

 

GenAI Art Picks: Your Weekly Dose of AI Inspiration

We’re back with our top AI art picks this week to get you inspired to create your own AI masterpieces 🖍️⭐

Image source: Midjourney

Isn't it incredible that these stunning images were entirely AI-generated? Believe it or not, every single pixel was crafted by a machine!

That's it for this week's AI tips & news! Hope you found some gems to spark your curiosity and imagination. Stay tuned for more AI news and tips next week. Keep exploring and stay awesome in the world of AI!

How Did We Do?

Got a minute? Tell us how we did this week by replying to this email. Your feedback's like gold – helps us make this newsletter a hit for you! Share, suggest, and let's make this AI journey epic together.