by jordanrendric · MCP Server · ★ 455
Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h
Claude Code Video Vision Give Claude the ability to watch and understand videos. A Claude Code plugin that extracts frames via ffmpeg and processes audio via multiple backends (Gemini API, local Whisper, or OpenAI API). Claude receives frames as images and audio transcription with timestamps — the plugin is a perception layer, not an interpretation layer. Features Multimodal perception — Claude sees video frames directly and reads audio transcriptions with timestamps Flexible backends — Choose between cloud APIs or fully local processing Adaptive extraction — Claude adjusts fps, time range, and resolution based on your question Auto-installation — Whisper models download automatically on first use Interactive setup wizard — walks you through configuration Quick Start Install the plugin Inside Claude Code, run these commands one at a time: Then: The MCP server will auto-install via from npm on first use — no build step required. Alternative: local development Configure I
| Stars | 455 |
| Forks | 58 |
| Language | TypeScript |
| Category | MCP Server |
| License | MIT |
| Quality Score | 55.92/100 |
| Open Issues | 4 |
| Last Updated | 2026-05-04 |
| Created | 2026-03-31 |
| Platforms | claude-code, gemini, mcp, node |
| Est. Tokens | ~1037k |
These tools work well together with claude-video-vision for enhanced workflows:
Looking for a claude-video-vision alternative? If you're comparing claude-video-vision with other mcp server tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.
Cross-Code Organizer (formerly Claude Code Organizer): cross-harness config dashboard for Claude Code, Codex C
Dashboard to manage Claude Code memories, configs, and MCP servers — security scanner for tool poisoning, cont
🏛 [UNDER CONSTRUCTION] A (roman) claude plugin marketplace
Comprehensive resources on Generative AI, including a detailed roadmap, projects, use cases, interview prepara
MCP server and Claude plugin for Postgres skills and documentation. Helps AI coding tools generate better Post
Natural (2-way) voice conversations with Claude Code
Explore other popular mcp server tools:
claude-video-vision is Give Claude the ability to watch and understand videos — Claude Code plugin with frame extraction and multimodal audio analysis. It is categorized as a MCP Server with 455 GitHub stars.
claude-video-vision is primarily written in TypeScript. It covers topics such as claude-code, claude-code-plugin, ffmpeg.
You can find installation instructions and usage details in the claude-video-vision GitHub repository at github.com/jordanrendric/claude-video-vision. The project has 455 stars and 58 forks, indicating an active community.
claude-video-vision is released under the MIT license, making it free to use and modify according to the license terms.
The top alternatives to claude-video-vision on Agent Skills Hub include cross-code-organizer, claude-code-organizer, claude-emporium. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.