mcp-web-extractor
MCP Web Extractor is a Model Context Protocol server that extracts web content using Readability.js, ideal for saving clean, readable versions of articles to Obsidian notes.
The MCP Web Extractor is a server designed to fetch and extract the main content from web pages using Readability.js. It is particularly useful for users who want to save clean and readable versions of articles, free from ads and other distractions, directly into their Obsidian notes. The server processes URLs to return the main text content along with metadata such as the title and an excerpt. It is easily integrable with Obsidian through the Model Context Protocol, making it a valuable tool for users who frequently save web content for later reference or research.
Features
- Extracts readable content from any URL
- Removes ads, sidebars, and other distractions
- Returns clean text along with metadata (title, excerpt, etc.)
- Easy integration with Obsidian via MCP
Usage with Different Platforms
standalone_service
bash
ts-node-esm client-example.ts
obsidian_integration
typescript
// obsidian-integration.ts example
import { MCPWebExtractor } from './mcp-web-extractor';
const extractor = new MCPWebExtractor();
extractor.extractContent('https://example.com').then(content => {
console.log(content);
});
Related MCP Servers
View all browser_automation servers →Fetch
by modelcontextprotocol
A Model Context Protocol server that provides web content fetching capabilities, enabling LLMs to retrieve and process content from web pages.
markdownify-mcp
by zcaceres
Markdownify is a Model Context Protocol (MCP) server that converts various file types and web content to Markdown format.
mcp-playwright
by executeautomation
A Model Context Protocol server that provides browser automation capabilities using Playwright.
playwright-mcp
by microsoft
Playwright MCP is a Model Context Protocol server that provides browser automation capabilities using Playwright, enabling LLMs to interact with web pages through structured accessibility snapshots.
mcp-server-weibo
by Selenium39
The Weibo MCP Server (TypeScript Version) is designed for scraping Weibo user information, feeds, and search functionality using the Model Context Protocol.
web-eval-agent
by Operative-Sh
operative.sh's MCP Server is a tool for autonomous debugging of web applications directly from your code editor.
browser-tools-mcp
by AgentDeskAI
BrowserTools MCP is a browser monitoring and interaction tool that enhances AI applications by capturing and analyzing browser data through a Chrome extension.