by datachain-ai · Codex Skill · ★ 2.7k
Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h
DataChain - Data Context Layer for Object Storage DataChain is a data context layer for object storage. It gives AI agents and pipelines a typed, versioned, queryable view of your files - what exists, what schema it has, what's already been computed - without copying data or loading it into memory. Metadata queries across 100M+ files execute in milliseconds against a backend database Pipelines checkpoint - re-running the same script resumes compute without duplicating expensive LLM-call or ML scoring makes re-runs incremental — only new or changed files are processed Every registers a named, versioned dataset with schema and lineage A generated knowledge base () reflects the operational layer as markdown for agents to read before writing code Works with S3, GCS, Azure, and local filesystems. bash pip
| Stars | 2,745 |
| Forks | 143 |
| Language | Python |
| Category | Codex Skill |
| License | Apache-2.0 |
| Quality Score | 45.73/100 |
| Open Issues | 65 |
| Last Updated | 2026-05-12 |
| Created | 2024-06-25 |
| Platforms | claude-code, codex, python |
| Est. Tokens | ~1083k |
These tools work well together with datachain for enhanced workflows:
Looking for a datachain alternative? If you're comparing datachain with other codex skill tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.
Code Editor for the AI Agents Era - Run an army of Claude Code, Codex, etc. on your machine
Unlimited FREE AI coding. Connect Claude Code, Codex, Cursor, Cline, Copilot, Antigravity to FREE Claude/GPT/G
A personal knowledge base that builds and maintains itself. Drop in sources — Claude (or Codex/Gemini) reads t
Agent Skills-compatible LLM wiki for Claude Code, Cursor, and Codex. Build a Karpathy-style knowledge base fro
Build Claude Code–style deep agents in Python: tool-calling, sandboxed execution, multi-agent teams, skills, c
Show usage stats for OpenAI Codex and Claude Code, without having to login.
Explore other popular codex skill tools:
datachain is Data Memory: the operational data context layer for AI agents - typed, versioned datasets over images, video, docs and tables. It is categorized as a Codex Skill with 2.7k GitHub stars.
datachain is primarily written in Python. It covers topics such as ai-agents, claude-code, codex.
You can find installation instructions and usage details in the datachain GitHub repository at github.com/datachain-ai/datachain. The project has 2.7k stars and 143 forks, indicating an active community.
datachain is released under the Apache-2.0 license, making it free to use and modify according to the license terms.
The top alternatives to datachain on Agent Skills Hub include superset, 9router, llm-wiki-agent. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.