Hugging Face MCP Server Details

Hugging Face Official MCP Server connects your large language models (LLMs) to the Hugging Face Hub and thousands of Gradio AI Applications, enabling seamless MCP (Model Context Protocol) integration across multiple transports. It supports STDIO, SSE (to be deprecated but still commonly deployed), StreamableHTTP, and StreamableHTTPJson, with the Web Application allowing dynamic tool management and status updates. This MCP server is designed to be run locally or in Docker, and it provides integrations with Claude Desktop, Claude Code, Gemini CLI (and its extension), VSCode, and Cursor, making it easy to configure and manage MCP-enabled tools and endpoints. Tools such as hf_doc_search and hf_doc_fetch can be enabled to enhance document discovery, and an optional Authenticate tool can be included to handle OAuth challenges when called.

Use Case

The MCP Server acts as a bridge between LLM clients and MCP-enabled endpoints, orchestrating tool availability and communication across multiple transports. It is capable of running in STDIO, SSE, Streamable HTTP, or JSON-mode HTTP, allowing flexible deployments from local development to production-grade configurations. The Web UI lets you switch tools on and off, and the server can automatically enable document-related tools when document search is enabled. Example deployment patterns include installing via Claude or Gemini CLI, or integrating with VSCode or Cursor for seamless tooling within development environments.

Key usage patterns from the documentation include:

  • Running locally with npx to start in different modes:
  • npx @llmindset/hf-mcp-server       # Start in STDIO mode
    npx @llmindset/hf-mcp-server-http # Start in Streamable HTTP mode
    npx @llmindset/hf-mcp-server-json # Start in Streamable HTTP (JSON RPC) mode

  • Running with Docker:
  • docker pull ghcr.io/evalstate/hf-mcp-server:latest
    docker run --rm -p 3000:3000 ghcr.io/evalstate/hf-mcp-server:latest

  • Installing in Claude Desktop / Claude Code / Gemini CLI / VSCode / Cursor, with example commands:
  • claude mcp add hf-mcp-server -t http https://huggingface.co/mcp?login

    claude mcp add hf-mcp-server \
    -t http https://huggingface.co/mcp \
    -H "Authorization: Bearer <YOUR_HF_TOKEN>"

    gemini mcp add -t http huggingface https://huggingface.co/mcp?login

    gemini extensions install https://github.com/huggingface/hf-mcp-server

    To configure VSCode manually, the example mcp.json snippet is shown as:

    "huggingface": {
    "url": "https://huggingface.co/mcp",
    "headers": {
    "Authorization": "Bearer <YOUR_HF_TOKEN>"
    }

    Similarly, Cursor users can install via a provided link and use a config snippet like:

    "huggingface": {
    "url": "https://huggingface.co/mcp",
    "headers": {
    "Authorization": "Bearer <YOUR_HF_TOKEN>"
    }

    Available Tools (3)

    Examples & Tutorials

    Real examples and usage patterns directly from the docs:

  • Install and connect via Claude Desktop / Claude Code:
  • claude mcp add hf-mcp-server -t http https://huggingface.co/mcp?login

    claude mcp add hf-mcp-server \
    -t http https://huggingface.co/mcp \
    -H "Authorization: Bearer <YOUR_HF_TOKEN>"

  • Install via Gemini CLI:
  • gemini mcp add -t http huggingface https://huggingface.co/mcp?login

  • Install the Gemini CLI extension that bundles the MCP server:
  • gemini extensions install https://github.com/huggingface/hf-mcp-server

  • VSCode integration snippet (mcp.json):
  • "huggingface": {
    "url": "https://huggingface.co/mcp",
    "headers": {
    "Authorization": "Bearer <YOUR_HF_TOKEN>"
    }

  • Cursor integration snippet (mcp.json):
  • "huggingface": {
    "url": "https://huggingface.co/mcp",
    "headers": {
    "Authorization": "Bearer <YOUR_HF_TOKEN>"
    }

  • Run locally in different modes:
  • npx @llmindset/hf-mcp-server       # Start in STDIO mode
    npx @llmindset/hf-mcp-server-http # Start in Streamable HTTP mode
    npx @llmindset/hf-mcp-server-json # Start in Streamable HTTP (JSON RPC) mode

  • Docker-based run:
  • docker build -t hf-mcp-server .

    docker run --rm -p 3000:3000 -e DEFAULT_HF_TOKEN=hf_xxx hf-mcp-server

    Installation Guide

    Follow these steps from the documentation to install and run the MCP Server:

  • Install and run locally with npx (choose mode):
  • npx @llmindset/hf-mcp-server       # Start in STDIO mode
    npx @llmindset/hf-mcp-server-http # Start in Streamable HTTP mode
    npx @llmindset/hf-mcp-server-json # Start in Streamable HTTP (JSON RPC) mode

  • Run with Docker:
  • docker pull ghcr.io/evalstate/hf-mcp-server:latest
    docker run --rm -p 3000:3000 ghcr.io/evalstate/hf-mcp-server:latest

    docker build -t hf-mcp-server .

    docker run --rm -p 3000:3000 -e DEFAULT_HF_TOKEN=hf_xxx hf-mcp-server

  • Transport endpoints overview:

  • STDIO uses stdin/stdout; SSE is available at /sse with /message endpoint; Streamable HTTP at /mcp (JSON mode when using streamableHttpJson).

  • Integration Guides

    Frequently Asked Questions

    Is this your MCP?

    Claim ownership and get verified badge

    Repository Stats
    Important Notes

    SSE is marked as To be deprecated, but it is still commonly deployed. The Web Application can switch tools on and off, and in certain transports (STDIO, SSE, StreamableHTTP) the ToolListChangedNotification is sent when tools change. In JSON mode for StreamableHTTPJSON, a tool may not be listed when the client requests tool lists. Environment variables include MCP_STRICT_COMPLIANCE (GET 405 rejects in JSON mode) and AUTHENTICATE_TOOL (whether to include an Authenticate tool).

    Prerequisites

    pnpm is used for build and development; Corepack is used to ensure everyone uses the same pnpm version (10.12.3).

    Details
    Last Updated1/2/2026
    SourceGitHub

    Compare Alternatives

    Similar MCP Tools

    6 related tools
    Anki MCP Server

    Anki MCP Server

    A Model Context Protocol (MCP) server that enables AI assistants to interact with Anki, the spaced repetition flashcard application. The Anki MCP Server allows AI models to access Anki's card data, enabling features like automated flashcard creation, review, and management.

    1MCP Agent

    1MCP Agent

    A unified Model Context Protocol server implementation that aggregates multiple MCP servers into one. The 1mcp-app/agent is an open-source project that provides a single entry point for multiple MCP servers, making it easier to manage and interact with various AI models and tools.

    Roundtable AI MCP Server

    Roundtable AI MCP Server

    Roundtable AI MCP Server is a zero-configuration local MCP server that unifies multiple AI coding assistants (Codex, Claude Code, Cursor, Gemini) through intelligent auto-discovery and a standardized interface. It coordinates specialized sub-agents from within your IDE to solve engineering problems in parallel, sharing context and synthesizing responses into a single, high-quality output. This documentation details installation, available MCP tools, integration with popular IDEs, and a broad ecosystem of specialized tools and CLIs that can be invoked as part of a roundtable-powered workflow, enabling developers to delegate tasks to the right AI for each facet of a problem without leaving their development environment.

    MCPJungle

    MCPJungle

    MCPJungle is a self-hosted MCP Gateway and Registry for AI agents. It serves as a central registry and gateway to manage Model Context Protocol (MCP) servers and the tools they expose. By consolidating MCP server registration, tool discovery, and access control, MCPJungle enables AI agents and clients to discover, group, and securely invoke tools from a single, unified gateway. The project provides a CLI, Docker-based deployment options, and enterprise-ready features such as tool grouping, access control, and observability to streamline MCP-based workflows across organizations.

    mcpmcp-server

    mcpmcp-server

    mcpmcp-server is a focused solution for discovering, setting up, and integrating MCP servers with your favorite clients to unlock AI-powered workflows. It streamlines how you connect MCP-powered servers to popular clients, enabling seamless AI-assisted interactions across your daily tools. The project emphasizes an approachable, config-driven approach to linking MCP servers with clients like Claude Desktop, while directing you to the homepage for variations across apps and platforms. This README highlights a practical JSON configuration example and notes on supported environments, helping you get started quickly and confidently.

    Imagen3-MCP

    Imagen3-MCP

    Imagen3-MCP is an image generation service based on Google's Imagen 3.0 that exposes its functionality through MCP (Model Control Protocol). The project provides a server to run a local MCP service that accesses Google Gemini-powered image generation, enabling developers to integrate advanced image synthesis into their applications. The documentation covers prerequisites (Gemini API key), installation steps for Cherry Studio, and a Cursor-based JSON configuration example for embedding the MCP server in broader tooling. This MCP is designed to be deployment-friendly, with configurable environment variables and optional proxy settings to adapt to various network environments.