by jordanrendric · MCP Server · ★ 640
Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h
Claude Code Video Vision Give Claude the ability to watch and understand videos. A Claude Code plugin that extracts frames via ffmpeg and processes audio via multiple backends (Gemini API, local Whisper, or OpenAI API). Claude receives frames as images and audio transcription with timestamps — the plugin is a perception layer, not an interpretation layer. Features Multimodal perception — Claude sees video frames directly and reads audio transcriptions with timestamps Flexible backends — Choose between cloud APIs or fully local processing Adaptive extraction — Claude adjusts fps, time range, and resolution based on your question Auto-installation — Whisper models download automatically on first use Interactive setup wizard — walks you through configuration Quick Start Install the plugin Inside Claude Code, run these commands one at a time: Then: The MCP server will auto-install via from npm on first use — no build step required. Alternative: local development Configure I
| Stars | 640 |
| Forks | 77 |
| Language | TypeScript |
| Category | MCP Server |
| License | MIT |
| Quality Score | 55.92/100 |
| Open Issues | 10 |
| Last Updated | 2026-05-18 |
| Created | 2026-03-31 |
| Platforms | claude-code, gemini, mcp, node |
| Est. Tokens | ~1048k |
These tools work well together with claude-video-vision for enhanced workflows:
Looking for a claude-video-vision alternative? If you're comparing claude-video-vision with other mcp server tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.
Overture is an open-source, locally running web interface delivered as an MCP (Model Context Protocol) server
🏛 [UNDER CONSTRUCTION] A (roman) claude plugin marketplace
MCP Server that enables Claude code to interact with Gemini
A single hub to find Claude Skills, Agents, Commands, Hooks, Plugins, and Marketplace collections to extend Cl
Open-source MCP server for LinkedIn. Give Claude and any MCP-compatible AI agent access to profiles, companies
The ultimate RAG for your monorepo. Query, understand, and edit multi-language codebases with the power of AI
Explore other popular mcp server tools:
claude-video-vision is Give Claude the ability to watch and understand videos — Claude Code plugin with frame extraction and multimodal audio analysis. It is categorized as a MCP Server with 640 GitHub stars.
claude-video-vision is primarily written in TypeScript. It covers topics such as claude-code, claude-code-plugin, ffmpeg.
You can find installation instructions and usage details in the claude-video-vision GitHub repository at github.com/jordanrendric/claude-video-vision. The project has 640 stars and 77 forks, indicating an active community.
claude-video-vision is released under the MIT license, making it free to use and modify according to the license terms.
The top alternatives to claude-video-vision on Agent Skills Hub include Overture, claude-emporium, gemini-mcp. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.