novada-mcp
One MCP server. All web data. Search, scrape, crawl, proxy, and AI research — in a single npx command.
README
novada
One MCP server. All web data. Search, scrape, crawl, proxy, and AI research — in a single
npxcommand.
The Problem
AI agents need web data but the tools are fragmented:
- Tavily does search but can't scrape or proxy
- Firecrawl does scrape but can't search or proxy
- BrightData does everything but ships 69 tools that bloat your context window
- Building it yourself means maintaining proxies, anti-bot bypass, retry logic, and 10 different APIs
The Fix
npx novada-mcp
One server. One API key. Tools that cover every web data need an AI agent has:
| Need | Tool | What it does |
|---|---|---|
| Find information | novada_search |
Web search across Google, Bing, DuckDuckGo, Yandex, Yahoo |
| Read a page | novada_extract |
Any URL → clean markdown, batch up to 10 in parallel |
| Deep research | novada_research |
One call → parallel searches → dedup → cited multi-source report |
| Crawl a site | novada_crawl |
BFS/DFS up to 20 pages with regex path filtering |
| Discover URLs | novada_map |
Sitemap + BFS discovery without reading content |
| Platform data | novada_scrape |
Amazon, LinkedIn, TikTok, GitHub, Zillow — 129 platforms |
| Monitor changes | novada_monitor |
Track price/content/availability changes between checks |
| Verify claims | novada_verify |
Parallel fact-checking against live web sources |
| Raw HTML | novada_unblock |
JS render or full browser CDP for bot-protected pages |
| Browser automation | novada_browser |
Navigate, click, type, fill forms, screenshot in cloud browser |
| Browser flows | novada_browser_flow |
Multi-step browser automation sequences |
| Proxy credentials | novada_proxy |
Residential, mobile, ISP, datacenter, static, dedicated — 195 countries |
| AI brand monitoring | novada_ai_monitor |
Check how ChatGPT, Perplexity, Grok, Claude, Gemini mention your brand |
| Health check | novada_health |
Check which API products are active on your key |
| Async scraping | novada_scraper_submit |
Submit async scraping task → poll → retrieve results |
What Makes This Different
novada_research is unique. No other MCP server turns one question into a cited multi-source report. It searches across Google, Bing, and DuckDuckGo in parallel, deduplicates, extracts full content from the top 5 sources, and synthesizes with citations. One tool call replaces an entire research workflow. Depth options: quick (3 queries), deep (5-6), comprehensive (8-10).
Auto-escalation handles anti-bot automatically. Static fetch → JS render → Browser CDP. Known hard targets (Amazon, LinkedIn, G2, Zillow, Glassdoor, Walmart, Instagram, TikTok, Shein) skip straight to the right method based on a 30+ domain registry. You never think about Cloudflare, DataDome, Kasada, or PerimeterX — the tool handles it.
Agent-first design (8.5/10 benchmark score). Every response includes agent_instruction with structured next-step guidance, source field (live/cache/wayback), structured errors with failure_class, cross-tool hints suggesting better alternatives, and a ## Agent Action block with machine-parseable status codes.
Quick Start
-
Get a key at novada.com
-
Add to your MCP client:
Claude Code:
claude mcp add novada -e NOVADA_API_KEY=your_key -- npx -y novada-mcp
Claude Desktop / Cursor / VS Code / Windsurf:
{
"mcpServers": {
"novada": {
"command": "npx",
"args": ["-y", "novada-mcp"],
"env": { "NOVADA_API_KEY": "your_key" }
}
}
}
- Try it:
novada_search({query: "Claude MCP tutorials", num: 5})
novada_research({question: "How do MCP servers work?", depth: "deep"})
novada_extract({url: "https://news.ycombinator.com", format: "markdown"})
novada_monitor({url: "https://amazon.com/dp/B09...", fields: ["price", "availability"]})
Tool Reference
Search & Research
| Tool | Purpose | Key Params | Example |
|---|---|---|---|
novada_search |
Web search via 5 engines | query, engine, num, time_range, include_domains |
novada_search({query: "best API gateways 2026", engine: "google", num: 10}) |
novada_research |
Multi-source parallel research | question, depth, focus |
novada_research({question: "Kong vs Traefik vs APISIX", depth: "comprehensive", focus: "performance benchmarks"}) |
novada_verify |
Fact-check claims against web | claim |
novada_verify({claim: "GPT-5 was released in 2026"}) |
Extract & Crawl
| Tool | Purpose | Key Params | Example |
|---|---|---|---|
novada_extract |
Extract content from URL(s) | url (single or array), format, render, fields |
novada_extract({url: "https://example.com", fields: ["price", "rating"]}) |
novada_crawl |
Crawl multiple pages from a domain | url, max_pages, strategy, select_paths |
novada_crawl({url: "https://docs.example.com", max_pages: 10, select_paths: "/api/.*"}) |
novada_map |
Discover URLs on a site | url, search, limit |
novada_map({url: "https://example.com", search: "pricing"}) |
novada_monitor |
Detect page changes over time | url, fields |
novada_monitor({url: "https://amazon.com/dp/B09...", fields: ["price"]}) |
Structured Platform Data
novada_scrape supports 129 platforms with structured data extraction. Returns clean tabular records, not raw HTML.
| Platform | Operation Examples | Data Returned |
|---|---|---|
| Amazon | amazon_product_keywords, amazon_product_asin |
Title, price, rating, reviews, BSR, availability |
linkedin_company_information_url, linkedin_profile_url |
Company info, employee count, profile data | |
| TikTok | tiktok_posts_url, tiktok_profile_url |
Video stats, engagement, profile data |
| GitHub | github_repository_repo-url |
Stars, forks, issues, description, languages |
reddit_subreddit_posts |
Posts, scores, comments, timestamps | |
| Zillow | zillow_property_url |
Price, beds, baths, sqft, Zestimate |
| Glassdoor | glassdoor_company_reviews_url |
Reviews, ratings, salary data |
| YouTube | youtube_video_search_label |
Video titles, views, duration, channel |
instagram_profile_url |
Posts, followers, engagement | |
| Google Shopping | google_shopping_search |
Products, prices, merchants |
Full platform list: call novada_discover or read the novada://scraper-platforms MCP resource.
Proxy Network
Route your own HTTP requests through Novada's proxy infrastructure. 100M+ IPs across 195 countries.
| Tool | Proxy Type | Best For |
|---|---|---|
novada_proxy_residential |
Real home ISP IPs | Anti-bot bypass, geo-restricted content |
novada_proxy_isp |
ISP-assigned IPs | Social media, ecommerce platforms |
novada_proxy_datacenter |
Datacenter IPs | High-volume, non-protected targets |
novada_proxy_mobile |
4G/5G mobile IPs | Mobile-targeted content, app APIs |
novada_proxy_static |
Dedicated static ISP IP | Account management, login flows |
novada_proxy_dedicated |
Exclusive datacenter IP | High-trust platforms, clean reputation |
Each proxy tool returns connection credentials in url, env, or curl format. Params: country (ISO 2-letter), city (optional), session_id (for sticky sessions).
Browser Automation
| Tool | Purpose | Example |
|---|---|---|
novada_browser |
Full browser interaction via CDP | novada_browser({actions: [{type: "navigate", url: "..."}, {type: "click", selector: "#btn"}]}) |
novada_browser_flow |
Multi-step automation sequences | Click, scroll, wait, type, screenshot — up to 20 actions per call |
novada_unblock |
Raw rendered HTML from protected pages | novada_unblock({url: "...", method: "browser"}) |
Sessions persist across calls via session_id. Cookies, login state, and page context are maintained.
Use Cases
AI Agent Research & RAG Pipelines
novada_research({question: "What are the latest developments in quantum computing?", depth: "comprehensive"})
Returns a cited multi-source report. Feed directly into RAG vector stores or use as context for agent reasoning.
E-Commerce Price Monitoring
novada_monitor({url: "https://amazon.com/dp/B0XXXXXX", fields: ["price", "availability"]})
First call records baseline. Call again later — returns field-level diffs with percentage change (e.g., price: $999 → $899, ↓10%).
Competitive Intelligence
novada_scrape({platform: "amazon.com", operation: "amazon_product_keywords", params: {keyword: "wireless earbuds"}, limit: 20})
Get structured product data (price, rating, reviews, BSR) for competitive analysis across 129 platforms.
Lead Generation
novada_scrape({platform: "linkedin.com", operation: "linkedin_company_information_url", params: {url: "https://linkedin.com/company/..."}, limit: 1})
Extract company info, employee count, and industry data from LinkedIn company pages.
Content Extraction for LLM Training
novada_crawl({url: "https://docs.example.com", max_pages: 20, select_paths: "/docs/.*"})
Crawl documentation sites and extract clean markdown for fine-tuning datasets or knowledge bases.
AI Brand Monitoring
novada_ai_monitor({brand: "YourProduct", models: ["chatgpt", "perplexity", "claude"]})
Check how AI models reference your brand: sentiment, claims, competitor mentions, source URLs.
Geo-Targeted Data Collection
novada_proxy_residential({country: "DE", city: "berlin", format: "curl"})
Get proxy credentials for any of 195 countries. Use with your own HTTP client for geo-specific content access.
Honest Comparison
| Novada | Firecrawl | Tavily | BrightData | |
|---|---|---|---|---|
| Tools | 25 | 14 | 2 | 69 |
| Search engines | 5 | 0 | 1 | 3 |
| Multi-source research | Yes | No | No | No |
| Proxy as MCP tool | Yes | No | No | No |
| Auto anti-bot escalation | Yes | No | N/A | No |
| Change monitoring | Yes | No | No | No |
| Platform scraping | 129 platforms | No | No | 437 platforms |
| Browser automation | Yes (CDP) | No | No | Yes |
| MCP Prompts & Resources | Yes (5+4) | No | No | No |
| Hosted MCP (no install) | No | No | No | Yes |
| Agent-first score | 8.5/10 | 6.0 | 6.0 | N/A |
What we don't have yet: hosted HTTP endpoint (requires terminal install for now), and some Scraper API platforms need separate activation. BrightData has more structured scrapers (437 vs 129).
Anti-Bot Support
Novada automatically handles these anti-bot systems via its escalation chain:
| Anti-Bot System | Detection | Escalation Method |
|---|---|---|
| Cloudflare | cf_chl_, __cf_bm, challenge pages |
Auto-render via Web Unblocker |
| DataDome | datadome cookie/script |
Auto-render |
| Kasada | Script path detection | Browser CDP |
| PerimeterX | _px cookie variants |
Auto-render |
| Akamai | _abck, ak_bmsc cookies |
Auto-render |
| Imperva/Incapsula | incap_ses_, visid_incap_ |
Auto-render |
30+ domains are pre-tagged in the hard target registry — these skip static fetch entirely and go straight to the right method.
Configuration
| Variable | Required | Purpose |
|---|---|---|
NOVADA_API_KEY |
Yes | API key — covers search, extract, crawl, scrape, research, verify, monitor |
NOVADA_BROWSER_WS |
No | Browser API WebSocket URL for novada_browser and novada_browser_flow |
NOVADA_PROXY_USER |
No | Proxy username for novada_proxy_* tools |
NOVADA_PROXY_PASS |
No | Proxy password |
NOVADA_PROXY_ENDPOINT |
No | Proxy host:port endpoint |
NOVADA_WEB_UNBLOCKER_KEY |
No | Separate key for Web Unblocker (if different from main API key) |
NOVADA_TOOLS |
No | Load specific tools only: "extract,search,research,monitor" |
NOVADA_GROUPS |
No | Load tool groups: "search,proxy,browser" — groups: search, proxy, browser, scraper, health |
Links
- Docs + API key: novada.com
- npm: npmjs.com/package/novada-mcp
- GitHub: github.com/NovadaLabs/novada-mcp
- Issues: github.com/NovadaLabs/novada-mcp/issues
- Tool details: call
novada_discoverornovada_healthfrom any MCP client
License
MIT
推荐服务器
Baidu Map
百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
Playwright MCP Server
一个模型上下文协议服务器,它使大型语言模型能够通过结构化的可访问性快照与网页进行交互,而无需视觉模型或屏幕截图。
Magic Component Platform (MCP)
一个由人工智能驱动的工具,可以从自然语言描述生成现代化的用户界面组件,并与流行的集成开发环境(IDE)集成,从而简化用户界面开发流程。
Audiense Insights MCP Server
通过模型上下文协议启用与 Audiense Insights 账户的交互,从而促进营销洞察和受众数据的提取和分析,包括人口统计信息、行为和影响者互动。
VeyraX
一个单一的 MCP 工具,连接你所有喜爱的工具:Gmail、日历以及其他 40 多个工具。
graphlit-mcp-server
模型上下文协议 (MCP) 服务器实现了 MCP 客户端与 Graphlit 服务之间的集成。 除了网络爬取之外,还可以将任何内容(从 Slack 到 Gmail 再到播客订阅源)导入到 Graphlit 项目中,然后从 MCP 客户端检索相关内容。
Kagi MCP Server
一个 MCP 服务器,集成了 Kagi 搜索功能和 Claude AI,使 Claude 能够在回答需要最新信息的问题时执行实时网络搜索。
e2b-mcp-server
使用 MCP 通过 e2b 运行代码。
Neon MCP Server
用于与 Neon 管理 API 和数据库交互的 MCP 服务器
Exa MCP Server
模型上下文协议(MCP)服务器允许像 Claude 这样的 AI 助手使用 Exa AI 搜索 API 进行网络搜索。这种设置允许 AI 模型以安全和受控的方式获取实时的网络信息。