gemini-mcp-server
Gemini MCP Server is a multi-context processing server using the Gemini API for analyzing webpages.
Gemini MCP Server
Multi-context processing (MCP) server using Gemini API for analyzing webpages.
Features
- Screenshot and analyze webpages
- Tag and metadata for screenshots
- Retrieve screenshots and associated data via API
API Endpoints
Process a webpage
POST /process
Request body:
{
"url": "https://example.com",
"prompt": "Describe what you see on this webpage"
}
Get screenshots
Get screenshot by ID
GET /screenshots/:id
Example:
GET /screenshots/123
Response:
{
"id": 123,
"url": "https://example.com",
"prompt": "Describe what you see on this webpage",
"created_at": "2023-06-15T12:34:56Z",
"tags": ["initial_load", "page_entry"],
"metadata": {
"pageState": "initial",
"pageTitle": "Example Domain"
},
"download_url": "https://yoursupabaseproject.supabase.co/storage/v1/object/public/screenshots/screenshot_1234.png"
}
Get screenshots with filters
GET /screenshots?tag=initial_load
or
GET /screenshots?key=pageTitle&value=Example%20Domain
Response:
{
"count": 2,
"screenshots": [
{
"id": 123,
"url": "https://example.com",
"prompt": "Describe what you see on this webpage",
"created_at": "2023-06-15T12:34:56Z",
"tags": ["initial_load", "page_entry"],
"metadata": {
"pageState": "initial",
"pageTitle": "Example Domain"
},
"download_url": "https://yoursupabaseproject.supabase.co/storage/v1/object/public/screenshots/screenshot_1234.png"
},
// Additional screenshots...
]
}
Installation
- Clone the repository
- Create a
.env
file based on.env.example
- Run
npm install
- Run
npm run build
- Start the server with
npm start
Related MCP Servers
View all browser_automation servers →Fetch
by modelcontextprotocol
A Model Context Protocol server that provides web content fetching capabilities, enabling LLMs to retrieve and process content from web pages.
markdownify-mcp
by zcaceres
Markdownify is a Model Context Protocol (MCP) server that converts various file types and web content to Markdown format.
mcp-playwright
by executeautomation
A Model Context Protocol server that provides browser automation capabilities using Playwright.
web-eval-agent
by Operative-Sh
operative.sh's MCP Server is a tool for autonomous debugging of web applications directly from your code editor.
cursor-talk-to-figma-mcp
by sonnylazuardi
This project implements a Model Context Protocol (MCP) integration between Cursor AI and Figma, allowing Cursor to communicate with Figma for reading designs and modifying them programmatically.
ai-agent-marketplace-index-mcp
by AI-Agent-Hub
MCP Server for AI Agent Marketplace Index from DeepNLP, allowing AI assistants to search available AI agents by keywords or categories.
playwright-mcp
by microsoft
Playwright MCP is a Model Context Protocol server that provides browser automation capabilities using Playwright, enabling LLMs to interact with web pages through structured accessibility snapshots.