OpenAI brings ChatGPT’s Advanced Voice Mode to the web

Google’s Gemini chatbot now has memory

In today’s email:

  • 💸 Perplexity introduces a shopping feature for Pro users in the US

  • 👨🏻‍⚖️ Sam Altman will co-chair San Francisco mayor-elect Daniel Lurie’s transition team

  • 🗣️ Want to speak Italian? Microsoft AI can make it sound like you do.

  • 🧰 10 new AI-powered tools and resources. Make sure to check the online version for the full list of tools.

Top News

OpenAI is bringing its Advanced Voice Mode feature for ChatGPT to the web, enabling users to engage in natural, real-time conversations directly from their browsers. The feature, previously available on iOS and Android since September, is rolling out this week to ChatGPT’s paying customers, including Plus, Enterprise, Teams, and Edu subscribers. Kevin Weil, OpenAI’s chief product officer, announced the launch on X.

The Advanced Voice Mode uses OpenAI’s GPT-4o’s audio capabilities, making conversations more dynamic and lifelike. ChatGPT can understand non-verbal cues like speaking speed and respond with emotion, enhancing the interaction. To start a voice chat, users simply click the Voice icon at the bottom of the prompt window, allow microphone access, and proceed to a screen featuring a blue orb. OpenAI offers nine distinct output voices, such as “Arbor,” described as “easygoing and versatile,” and “Ember,” known for being “confident and optimistic.”

Paying users have daily limits on Advanced Voice Mode, and OpenAI will notify them when only 15 minutes remain for the day. Free users will gain access to a monthly preview of the feature, allowing them to try it out. OpenAI also plans to roll out the feature to all free users in the coming weeks, expanding access to this innovative conversational tool.

Other developer tools can’t tell you how your codebase works and why. Unblocked can. We augment your source code with context from Slack, Confluence, Jira (and more), so your team gets helpful answers without having to search for them.

Unblocked gives you:

  • Instant answers about every aspect of your codebase 

  • Relevant, historical context for any code open in an IDE

  • Automated answers to questions asked in Slack, removing interruptions to you

Engineers who use Unblocked save an hour or more a day.

“Every developer now has the ability to tap into past discussions and decisions to fill their knowledge gaps, regardless of their tenure. We are moving faster and making more accurate decisions as a team as a result.” - Alex Mallet, EVP of Engineering at Forto

Google’s Gemini chatbot has introduced a memory feature, enabling it to retain and recall personal information like your preferences, work details, and favorite foods. Similar to OpenAI's ChatGPT, this functionality enhances conversational context. For instance, if you inform Gemini of your preferred cuisines, it can tailor restaurant recommendations based on your tastes in future interactions. Currently, the memory feature is available exclusively on the web client for subscribers of Google’s $20-per-month Google One AI Premium plan, with plans to expand to iOS and Android users later.

Gemini’s memory system is designed with flexibility in mind. Users can provide specific instructions, such as simplifying language, focusing on JavaScript for coding queries, or including daily costs in trip planning. While the memory feature supports only English for now, it can be toggled off or edited at any time. Memories are stored indefinitely unless manually deleted, but Google emphasizes that these saved details are neither shared externally nor used to train the AI model, addressing privacy concerns.

However, memory features in AI systems pose potential risks if not carefully implemented. Earlier this year, a security researcher demonstrated how hackers could plant “false” memories in ChatGPT, potentially compromising user data. Google’s safeguards aim to prevent such vulnerabilities, ensuring that Gemini’s memory capability remains both practical and secure. This update represents a significant step in making AI interactions more personalized and contextually aware, albeit with a need for ongoing vigilance around data protection.

AI-powered search engine Perplexity has ventured into e-commerce with the launch of a new shopping feature for Pro users in the U.S. This service integrates shopping recommendations directly into search results, allowing users to view product details, prices, seller information, and reviews through visual cards. Shoppers can even complete purchases using a one-click checkout system without leaving Perplexity’s platform. The tool ensures convenience by enabling users to store their address and payment information while offering free shipping to Pro subscribers. The company emphasizes that its product recommendations are currently unbiased, with no sponsored slots.

Perplexity’s shopping feature relies on integrations with platforms like Shopify, providing detailed information on items shipped in the U.S. Users can search using text or images, similar to Google’s search capabilities, enhancing the overall shopping experience. Additionally, Perplexity has launched a merchant program that allows businesses to be better indexed and recommended by sharing detailed product information. Merchants gain free API access to power search on their own websites, potentially increasing Perplexity’s influence in e-commerce without taking affiliate commissions for now.

This move aligns Perplexity with other tech giants like Google and Amazon, which have also invested in AI-powered shopping tools. Amazon’s AI assistant Rufus and Google’s AI-enhanced Shopping tab have already demonstrated the potential of AI in improving e-commerce search. By leveraging large language models, Perplexity and similar startups aim to streamline the process of finding products through natural language queries. The promise of faster, more accurate search results comes with the challenge of building user trust by maintaining transparency and avoiding biases in search recommendations.

Other stuff

All your ChatGPT images in one place 🎉

You can now search for images, see their prompts, and download all images in one place.

Tools & LinkS
Editor's Pick ✨

marimo - The next-generation Python notebook

Layer - Brain-Inspired planner

BetterBugs.io - Capture bugs, record sessions, and fix with AI

Documind - Transform your PDF documents into structured data

Superchat - AI Agents for WhatsApp Business, Instagram & Co

Vozo Video Translator - Precise video translation, perfected with AI pilot

Lightscreen AI - Uncheatable tech screens, driven by a voice agent

HeyGen - iOS Mobile App - Advanced AI video tools for creators on the go

Sandra AI - Multilingual voice AI receptionist for car dealers

BharatDiffusion - Bringing India's culture to life with AI-Visuals

Unclassified 🌀 

How did you like today’s newsletter?

Login or Subscribe to participate in polls.

Help share Superpower

⚡️ Be the Highlight of Someone's Day - Think a friend would enjoy this? Go ahead and forward it. They'll thank you for it!

Hope you enjoyed today's newsletter

Follow me on Twitter and Linkedin for more AI news and resources.

Did you know you can add Superpower Daily to your RSS feed https://rss.beehiiv.com/feeds/GcFiF2T4I5.xml

⚡️ Join over 200,000 people using the Superpower ChatGPT extension on Chrome and Firefox.

OR