Skip to main content
← Back to Blog
2026-05-13googlegeminirag

Google made Gemini RAG multimodal — images, PDFs, and code in one search

Google quietly made Gemini API’s File Search multimodal this week. Agents can now search across text, images, PDFs, and code in a single RAG query. This is a significant capability upgrade.

Before vs After

Before
• Text-only RAG
• Separate image pipeline
• Manual PDF extraction
After (Gemini)
• Text + Image + PDF in one query
• Unified search index
• Native multimodal RAG

Why apitree users should care

Agents processing documents (contracts, invoices, reports) can now use Gemini for understanding + apitree for action. Read a PDF → extract data → call search_apis("send invoice data to accounting") → apitree routes to the right API.

Source: Greeden Weekly Summary

Try apitree yourself

1,950+ APIs via MCP. No signup for demo.

Run Demo
Blog — apitree · apitree