• Superpower Daily
  • Posts
  • ChatGPT unexpectedly began speaking in a user’s cloned voice during testing 😳

ChatGPT unexpectedly began speaking in a user’s cloned voice during testing 😳

Forget Midjourney — Flux is the new king of AI image generation

In partnership with

In today’s email:

  • 👨🏻‍🏫 Sakana AI announced the world's first fully autonomous AI scientist.

  • 🤯 What if I told you nothing in this video is real?

  • 👀 Girls are using ChatGPT to see if men are lying about their height on dating apps.

  • 🧰 11 new AI-powered tools and resources. Make sure to check the online version for the full list of tools.

Top News

Recently, OpenAI revealed in its system card for the new GPT-4o AI model that during testing, the model's Advanced Voice Mode occasionally mimicked users' voices without permission. This rare but concerning occurrence happened due to audio input noise, which unintentionally triggered the model to imitate a user's voice, bypassing the safeguards meant to prevent this. OpenAI has since implemented stricter controls to ensure this doesn't happen again, highlighting the complexity of safely managing AI that can replicate voices from short clips.

The incident underscores potential risks in AI voice synthesis, where a user’s audio could inadvertently be used as a prompt for voice imitation. While OpenAI now uses a standalone output classifier to prevent unauthorized voice generation, the episode reveals the challenges of controlling such powerful technology. The system card suggests that the residual risk of this occurring again is minimal, thanks to the robust safeguards in place.

Despite these concerns, the ability to imitate voices remains a significant advancement in AI, albeit with serious security implications. While OpenAI has restricted these capabilities for safety reasons, experts believe that similar technology will become available to the public through other sources soon. This points to a future where AI-driven voice synthesis could become a common tool, potentially leading to both innovative uses and new challenges in digital security.

Understand AI’s Risks and Opportunities

With the rapid growth of AI, it's crucial for businesses to understand both the risks and opportunities it presents. Discover how you can utilize AI to drive success while minimizing potential pitfalls.

In the MIT Artificial Intelligence: Implications for Business Strategy online short course you’ll gain:

  • Practical knowledge and a foundational understanding of AI's current state

  • The ability to identify and leverage AI opportunities for organizational growth

  • A focus on the managerial rather than technical aspects of AI to prepare you for strategic decision making

Flux, developed by the startup Black Forest Labs, is a new AI image generation model that has quickly gained popularity, being hailed as the rightful successor to Stable Diffusion. Unlike its competitor Midjourney, Flux is open-source and can run on a decently powerful laptop, making it accessible on various multi-model platforms like Poe, NightCafe, and FreePik. The model is praised for its ability to render people more effectively, although its skin textures aren’t as refined as those in Midjourney v6.1.

Black Forest Labs, a German-based company founded by former engineers of Stability AI, including Robin Rombach, Andreas Blattmann, and Dominik Lorenz, has released three versions of Flux.01: Pro, Dev, and Schnell. These models cater to different needs, with the Pro version offering a commercial license. Flux.01 outperforms many competitors in terms of adherence to prompts, image quality, and text rendering, especially when run on a laptop equipped with an RTX 4090 GPU.

Flux.01 is accessible in several ways, including local installation through the Pinokio launcher or via various online platforms like NightCafe, Poe, and Hugging Face. These platforms allow users to generate images easily, with Flux.01 standing out for its versatility and quality. Additionally, Black Forest Labs is working on a text-to-video model that promises to deliver high-quality, open-source video content generation.

Sakana AI has introduced "The AI Scientist," a pioneering system that fully automates the scientific research process using advanced foundation models like Large Language Models (LLMs). This system autonomously generates innovative research ideas, writes code, conducts experiments, and produces comprehensive scientific manuscripts, all while refining its outputs through an automated peer review process. In collaboration with leading AI researchers, The AI Scientist has demonstrated the ability to produce research papers in machine learning with near-human accuracy and at a significantly reduced cost, marking a major advancement in the democratization of scientific discovery.

The AI Scientist operates through a four-step process: idea generation, experimental iteration, paper writing, and automated reviewing. Starting with a template, it brainstorms new research directions, executes experiments, generates visualizations, writes scientific papers, and evaluates its own work. While the system has achieved notable success, it also has limitations, such as the inability to interpret visual data and occasional errors in writing and evaluating results. Despite these challenges, this innovation opens new possibilities for accelerating scientific progress, raising important ethical considerations and highlighting the need for careful deployment.

Looking to the future, The AI Scientist has the potential to transform the role of human scientists, allowing them to focus on higher-level creative tasks while the AI handles routine research activities. However, questions remain about the system's ability to make paradigm-shifting discoveries and the broader implications of AI-driven research. As The AI Scientist continues to evolve, it may become an indispensable tool in addressing the world's most complex scientific challenges.

Other stuff

All your ChatGPT images in one place 🎉

You can now search for images, see their prompts, and download all images in one place.

Tools & LinkS
Editor's Pick ✨

postgres.new - In-browser Postgres with an AI interface

Cosine - World's best AI Software Engineer

Google Vids - A rich new medium that unlocks powerful video storytelling

Bardeen 3.0 - The AI agent for your browser

LandingPro AI (Buildspace S5) - AI that writes website copy that actually sells

Ultra AI - AI command center for your product

Clips to Videos by VMEG - Transform clips into captivating marketing videos with AI

AI SaaS Launcher - Build SaaS MVPs fast: AI + next-gen low-code power

Gan.AI TTS Model & API Playground - First TTS model to support all 22 Indic languages + English

Langtrace AI - Improve your LLM apps with an open-source observability tool

PolygrAI - Digital polygraph powered by AI

Unclassified 🌀 

  • 🎙️ Receive bite-sized summaries of your favorite podcasts. Generated by AI. Delivered directly to your inbox. Check Out PodSnacks!

How did you like today’s newsletter?

Login or Subscribe to participate in polls.

Help share Superpower

⚡️ Be the Highlight of Someone's Day - Think a friend would enjoy this? Go ahead and forward it. They'll thank you for it!

Hope you enjoyed today's newsletter

Follow me on Twitter and Linkedin for more AI news and resources.

Did you know you can add Superpower Daily to your RSS feed https://rss.beehiiv.com/feeds/GcFiF2T4I5.xml

⚡️ Join over 200,000 people using the Superpower ChatGPT extension on Chrome and Firefox.

OR