Best AI Agent Skills for Data Pipeline

Find AI tools for building data pipelines, ETL processes, and data transformation workflows.

Top 10 Data Pipeline Tools

1 huginn by huginn
★ 49.0k Ruby AI Tool

Create agents that monitor and act on your behalf. Your agents are standing by!

View Details → GitHub →
2 skills by video-db
★ 48 Python Codex Skill

Server-side video workflows for agents: ingest, understand, search, edit, stream.

View Details → GitHub →
3 FastGPT by labring
★ 27.6k TypeScript MCP Server

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.

View Details → GitHub →
4 firecrawl-mcp-server by firecrawl
★ 5.8k JavaScript MCP Server

🔥 Official Firecrawl MCP Server - Adds powerful web scraping and search to Cursor, Claude and any other LLM clients.

View Details → GitHub →
5 ruby_llm by crmne
★ 3.8k Ruby Agent Tool

One beautiful Ruby API for OpenAI, Anthropic, Gemini, Bedrock, Azure, OpenRouter, DeepSeek, Ollama, VertexAI, Perplexity, Mistral, xAI, GPUStack & OpenAI compatible APIs. Agents, Chat, Vision, Audio, PDF, Images, Embeddings, Tools, Streaming & Rails integration.

View Details → GitHub →
6 excel-mcp-server by haris-musa
★ 3.4k Python MCP Server

A Model Context Protocol server for Excel file manipulation

View Details → GitHub →
7 mcp-proxy by sparfenyuk
★ 2.4k Python MCP Server

A bridge between Streamable HTTP and stdio MCP transports

View Details → GitHub →
8 aide by nicepkg
★ 2.7k TypeScript Agent Tool

Conquer Any Code in VSCode: One-Click Comments, Conversions, UI-to-Code, and AI Batch Processing of Files! 在 VSCode 中征服任何代码:一键注释、转换、UI 图生成代码、AI 批量处理文件!💪

View Details → GitHub →
9 super-agent-party by heshengtao
★ 2.0k JavaScript MCP Server

⭐ All-in-one AI companion! Super Agent Party = Self hosted neuro sama + openclaw! ⭐ 全能AI伴侣!超级智能体派对 = 自托管neuro sama + openclaw!

View Details → GitHub →
10 DemoGPT by melih-unsal
★ 1.9k Python Agent Tool

🤖 Everything you need to create an LLM Agent—tools, prompts, frameworks, and models—all in one place.

View Details → GitHub →

Comparison

Tool Stars Language License Score
huginn ★ 49.0k Ruby MIT 49
skills ★ 48 Python 40
FastGPT ★ 27.6k TypeScript 48
firecrawl-mcp-server ★ 5.8k JavaScript MIT 57
ruby_llm ★ 3.8k Ruby MIT 49
excel-mcp-server ★ 3.4k Python MIT 44
mcp-proxy ★ 2.4k Python MIT 48
aide ★ 2.7k TypeScript MIT 30
super-agent-party ★ 2.0k JavaScript AGPL-3.0 41
DemoGPT ★ 1.9k Python MIT 40

Related Categories

Web Scraping Document Parsing MCP Database Tools

Frequently Asked Questions

What are the best AI tools for data pipeline?

The top data pipeline tools include huginn, skills, FastGPT. These are ranked by our composite score based on GitHub stars, community activity, and code quality.

Are these data pipeline tools free to use?

Most tools listed here are open-source. 8 out of 10 have explicit open-source licenses, making them free to use and modify.

How do I choose the right data pipeline tool?

Consider your tech stack (language compatibility), project scale (stars indicate community trust), and specific features you need. Use the comparison table above to evaluate side by side.

Get Weekly AI Tool Picks

Top 20 fastest-growing AI tools delivered every Monday. Free.

No spam, unsubscribe anytime.

Explore All 25,000+ Skills on Agent Skills Hub