webscraperCertified
Webscraper MCP is an MCP server designed for Claude desktop, enabling it to scrape text from websites, YouTube transcripts, and PDFs.
Webscraper MCP is a powerful tool for Claude desktop users, allowing seamless extraction of text content from various online sources. By simply providing a link, users can obtain text from standard web pages, YouTube videos, and PDF documents. This functionality is particularly useful for users who need to gather information from diverse sources quickly and efficiently. The server is designed to handle different types of content, ensuring that users can access the information they need without manual intervention. Whether it's extracting a transcript from a YouTube video or converting a PDF to markdown text, Webscraper MCP simplifies the process, making it accessible and user-friendly.
Features
- Extracts text from standard web pages using provided URLs.
- Converts PDF documents to markdown text for easy readability.
- Retrieves transcripts from YouTube videos for content analysis.
- Supports a wide range of online content types for comprehensive data extraction.
- User-friendly interface for seamless integration with Claude desktop.
MCP Tools
- get_pdf: Convert a URL that leads to a PDF file to markdown text.
- get_webpage_content: Returns the text content on a webpage based on the link provided.
- get_youtube_transcript: Extracts the transcript from a YouTube video.
Usage with Different Platforms
Claude Desktop
mcp: {
"get_webpage_content": "https://example.com",
"get_pdf": "https://example.com/document.pdf",
"get_youtube_transcript": "https://youtube.com/watch?v=example"
}
Related MCP Servers
View all browser_automation servers →Fetch
by modelcontextprotocol
A Model Context Protocol server that provides web content fetching capabilities, enabling LLMs to retrieve and process content from web pages.
markdownify-mcp
by zcaceres
Markdownify is a Model Context Protocol (MCP) server that converts various file types and web content to Markdown format.
deepwiki-mcp
by regenrek
This is an unofficial Deepwiki MCP Server that processes Deepwiki URLs, crawls pages, converts them to Markdown, and returns documents or lists by page.
mcp-playwright
by executeautomation
A Model Context Protocol server that provides browser automation capabilities using Playwright.
fetch-mcp
by zcaceres
This MCP server provides functionality to fetch web content in various formats, including HTML, JSON, plain text, and Markdown.
web-eval-agent
by Operative-Sh
operative.sh's MCP Server is a tool for autonomous debugging of web applications directly from your code editor.
cursor-talk-to-figma-mcp
by sonnylazuardi
This project implements a Model Context Protocol (MCP) integration between Cursor AI and Figma, allowing Cursor to communicate with Figma for reading designs and modifying them programmatically.