Is Video Vision free?

Yes, Video Vision is free to use.

How do I install Video Vision?

Video Vision is a local plugin. Install it using PyPI package: video-vision-mcp and add the generated configuration snippet to your AI app's MCP config file. Then restart your AI app.

Is Video Vision safe to use?

Yes. Video Vision passed MCP Marketplace's automated security scan with a score of 10/10 (low risk). Every server on MCP Marketplace is security-scanned before it's listed; see the full security report on this page for the findings and permissions.

What credentials does Video Vision need?

Video Vision requires the following credentials or environment variables: VIDEO_MCP_ENV, GEMINI_API_KEY, OPENAI_API_KEY, GROQ_API_KEY, JIRA_URL, JIRA_USERNAME, JIRA_API_TOKEN. You can find setup instructions on the server detail page.

What AI apps work with Video Vision?

Video Vision uses the Model Context Protocol (MCP) and works with any MCP-compatible AI app, including Claude, ChatGPT / Codex, Gemini, Copilot, Cursor, and more.

Back to Browse

Video Vision MCP Server

by KitDevUA

Developer ToolsLow Risk10.0MCP RegistryLocal

Free

Server data from the Official MCP Registry

Analyze any video (file, URL, or Jira attachment) into frames + transcript for Claude Code

About

Analyze any video (file, URL, or Jira attachment) into frames + transcript for Claude Code

Security Report

10.0

Low Risk10.0Low Risk

Valid MCP server (1 strong, 4 medium validity signals). No known CVEs in dependencies. Package registry verified. Imported from the Official MCP Registry.

12 files analyzed · 1 issue found

Security scores are indicators to help you make informed decisions, not guarantees. Always review permissions before connecting any MCP server.

Permissions Required

This plugin requests these system permissions. Most are normal for its category.

file_system

Check that this permission is expected for this type of plugin.

Shell Command Execution

Runs commands on your machine. Be cautious — only use if you trust this plugin.

env_vars

Check that this permission is expected for this type of plugin.

HTTP Network Access

Connects to external APIs or services over the internet.

What You'll Need

Set these up before or after installing:

Optional path to a .env file with config (Jira creds, API keys).Optional

Environment variable: VIDEO_MCP_ENV

Enables tier-3 native Gemini video analysis (default when set).Required

Environment variable: GEMINI_API_KEY

Enables tier-2 OpenAI Whisper transcription.Required

Environment variable: OPENAI_API_KEY

Enables tier-2 Groq whisper-large-v3 transcription.Required

Environment variable: GROQ_API_KEY

Jira base URL — only for the jira_issue_key input path.Optional

Environment variable: JIRA_URL

Jira account email — only for the jira_issue_key input path.Optional

Environment variable: JIRA_USERNAME

Jira API token — only for the jira_issue_key input path.Required

Environment variable: JIRA_API_TOKEN

How to Install

Add this to your MCP configuration file:

{
  "mcpServers": {
    "io-github-kitdevua-video-vision-mcp": {
      "env": {
        "JIRA_URL": "your-jira-url-here",
        "GROQ_API_KEY": "your-groq-api-key-here",
        "JIRA_USERNAME": "your-jira-username-here",
        "VIDEO_MCP_ENV": "your-video-mcp-env-here",
        "GEMINI_API_KEY": "your-gemini-api-key-here",
        "JIRA_API_TOKEN": "your-jira-api-token-here",
        "OPENAI_API_KEY": "your-openai-api-key-here"
      },
      "args": [
        "video-vision-mcp"
      ],
      "command": "uvx"
    }
  }
}

Documentation

View on GitHub

From the project's GitHub README.

video-vision-mcp

An MCP server that gives Claude Code the ability to analyze any video — a local file, a URL, or a Jira ticket attachment — through one set of tools.

Claude can't watch video natively (only text + the first frame of an image). This server converts a video into sampled frame images + an audio transcript, or — when a Gemini key is present — a native Gemini analysis of the whole video. It works alongside mcp-atlassian and shares its .env.

Scenario: open a Jira ticket with a video bug report → one command (analyze_video jira_issue_key=DEV-123) → you see the frames and the transcript (or Gemini's analysis if a key is configured), without juggling two MCP servers.

Three backend tiers (auto-selected)

Tier	Needs	What it does
1 — local (default)	nothing	`ffmpeg` frames + `whisper.cpp` transcript. Free, fully local, always works.
2 — cloud ASR	`OPENAI_API_KEY` or `GROQ_API_KEY`	Local frames, but transcription via OpenAI Whisper / Groq for higher quality.
3 — native Gemini	`GEMINI_API_KEY`	Gemini ingests the whole video (visual + audio) in one call, with MM:SS timestamps. Default when the key is set.

Precedence: Gemini > OpenAI > Groq > local. Set VIDEO_MCP_DISABLE_GEMINI=true to force tiers 1/2 even with a Gemini key. The backend used is named in every result.

Privacy: tier 1 never uploads anything. Tiers 2/3 print a one-time notice in the session the first time video content is sent to a third party.

Tools

analyze_video — frames + transcript + metadata (the main tool).
get_video_transcript_only — transcript text only.
extract_frames_at — frames at specific timestamps ("00:42", "1:05", 12.5).
list_recent_analyses — cached analyses + backend used.
compare_backends — same video via tier 1 and tier 3 side by side.

Install

Requires Python ≥ 3.10. A single install pulls everything — backends, plus the ffmpeg and whisper.cpp dependencies. Nothing is ever installed globally on your machine (no brew/apt/winget, no sudo).

Use it (recommended)

With uv you don't install it explicitly — uvx runs the published package on demand (see Register in Claude Code). To install into an environment instead:

uv pip install video-vision-mcp     # or: pip install video-vision-mcp

From source (development)

git clone https://github.com/KitDevUA/video-vision-mcp.git
cd video-vision-mcp
uv venv && source .venv/bin/activate
uv pip install -e ".[dev]"          # all backends bundled

Dependencies — fully self-contained

ffmpeg / ffprobe: if they are already on your PATH, those system binaries are used. Otherwise the bundled static-ffmpeg package supplies them (fetched once into its own local cache — never a system-wide install).
whisper.cpp (tier 1 transcription): shipped as the bundled pywhispercpp binding (prebuilt wheels; builds from source only if no wheel exists for your platform/Python). A whisper-cli already on PATH is used if present.
whisper model: the ggml model (base by default) downloads from Hugging Face into the cache on first transcription. Override with VIDEO_MCP_WHISPER_MODEL (tiny/base/small/medium/large-v3) or VIDEO_MCP_WHISPER_MODEL_PATH.
cloud-only: set OPENAI_API_KEY / GROQ_API_KEY (tier 2) or GEMINI_API_KEY (tier 3); whisper.cpp is then never invoked.

Configure

cp env.example .env
# edit .env — nothing is required for tier 1

See env.example for every variable. The .env format matches mcp-atlassian, so Jira creds (JIRA_URL / JIRA_USERNAME / JIRA_API_TOKEN) can be shared.

Register in Claude Code

Add to your project .mcp.json (or global config), next to mcp-atlassian — see .mcp.json.example:

{
  "mcpServers": {
    "video-vision": {
      "command": "uvx",
      "args": ["video-vision-mcp"],
      "env": { "VIDEO_MCP_ENV": "/abs/path/to/.env" }
    }
  }
}

uvx downloads and runs the published package automatically — no manual install step. VIDEO_MCP_ENV is optional (tier 1 needs no keys); point it at your .env if you use Jira or cloud backends. For local development against a checkout, use "args": ["--from", "/abs/path/to/video-vision-mcp", "video-vision-mcp"] instead. Restart Claude Code; the video-vision tools then appear.

Cache

Results are cached at ~/.cache/video-vision-mcp/ keyed by (file hash, backend) — re-analyzing the same video is instant, and switching backends keeps each result separately. Downloaded URLs/Jira files and whisper models live under the same dir. Override with VIDEO_MCP_CACHE_DIR.

How it fits with mcp-atlassian

mcp-atlassian can download a Jira attachment but can't analyze it. This server takes over from there: pass jira_issue_key and it fetches the attachment over Jira REST itself (same creds), so you stay in one tool call. If the Jira token is missing/invalid you get a clear error pointing at .env, not a silent failure.

Reviews

No reviews yet

Be the first to review this server!

More Developer Tools MCP Servers

Fetch

Free

by Modelcontextprotocol · Developer Tools

Web content fetching and conversion for efficient LLM usage

Git

Free

by Modelcontextprotocol · Developer Tools

Read, search, and manipulate Git repositories programmatically

Toleno

Free

by Toleno · Developer Tools

Toleno Network MCP Server — Manage your Toleno mining account with Claude AI using natural language.

Video Vision MCP Server

About

Security Report

Findings (1)

Permissions Required

What You'll Need

How to Install

Documentation

video-vision-mcp

Three backend tiers (auto-selected)

Tools

Install

Use it (recommended)

From source (development)

Dependencies — fully self-contained

Configure

Register in Claude Code

Cache

How it fits with mcp-atlassian

Reviews

No reviews yet

More Developer Tools MCP Servers

Fetch

Git

Toleno

mcp-creator-python

MarkItDown

MCP Marketplace