Google is shaking up its AI ecosystem with fresh capabilities for Gemini, delivering on top user requests and expanding the reach of its AI-powered tools. These updates include major improvements to audio handling, language support, and document processing, making Gemini more versatile for users worldwide.
Gemini App Now Accepts Audio Files
Responding to user feedback, Google has enabled audio file uploads in the Gemini app, a feature that was the “#1 request” according to Google Labs leadership. Free Gemini users can now upload and process audio files up to 10 minutes long, with a daily limit of five free prompts.
For those subscribed to Gemini’s AI Pro or AI Ultra plans, the app supports audio uploads up to three hours in length per file. All users can attach up to 10 files per prompt, and the app supports an array of formats, even ZIP files.
- Free users: 10-minute audio files, five prompts per day
- Pro/Ultra users: Up to three-hour audio files
- File support: Up to 10 files per prompt, multiple formats and ZIPs
Google Search AI Expands Multilingual Capabilities
Gemini is also powering a significant expansion in the language options for Google Search’s AI Mode. Thanks to the integration of Gemini 2.5, users can now interact with Search in five additional languages: Hindi, Indonesian, Japanese, Korean, and Brazilian Portuguese. This update allows a broader global audience to ask complex questions and explore the web in their preferred language, making the AI experience more accessible and inclusive.
- New languages: Hindi, Indonesian, Japanese, Korean, Brazilian Portuguese
- Gemini 2.5 integration enhances question complexity and depth
NotebookLM Gets Smarter with New Report Styles
Google’s Gemini-powered NotebookLM is evolving into a more comprehensive research assistant. The latest update lets users generate reports in a variety of formats (including study guides, blog posts, briefing documents, flashcards, and quizzes) across more than 80 languages. Users can upload their own documents and media, then customize the structure, tone, and style of the output to suit their needs.
- Report types: Study guides, blogs, briefings, flashcards, quizzes
- Language support: Over 80 languages
- Customization: Structure, tone, and style adjustments
NotebookLM already supported audio files, positioning it as an advanced tool for extracting insights and patterns from mixed-format content. The new reporting features aim to streamline research and presentation workflows for students, professionals, and content creators alike.
Recent AI Momentum at Google
These updates come amid a flurry of AI enhancements across Google’s product suite. In August, Gemini introduced automatic memory for user preferences, and free users gained access to Workspace’s video generation tool, Vids.
In September, Google Photos upgraded to the Veo 3 video engine, allowing users to turn still images into short, silent video clips. This rapid pace of innovation underscores Google’s commitment to making AI more practical and accessible for everyday tasks.
Gemini’s Growing Role in Everyday AI
With these latest updates, Google is positioning Gemini as a central hub for multimedia processing, multilingual interaction, and flexible document creation. The new features not only respond to user demands but also set the stage for more seamless, AI-driven experiences across education, research, and content generation. As Google continues to expand Gemini’s capabilities, users can expect even more powerful and accessible tools in the near future.
Source: The Verge
Gemini’s Powerful New Audio and Language Features: What’s New for Google’s AI