by TYH-labs · Claude Skill · ★ 235
Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h
unsloth-buddy /unsloth-buddy I have 500 customer support Q&As and want to fine-tune a summarization model. I only have a MacBook Air. <img src="https://img.shields.io/badge/Try%20It-1%20minute-black?style=for-the-badge" alt="Tr
| Stars | 235 |
| Forks | 13 |
| Language | Python |
| Category | Claude Skill |
| License | MIT |
| Quality Score | 57.098/100 |
| Last Updated | 2026-05-07 |
| Created | 2026-03-15 |
| Platforms | claude-code, cli, gemini, python |
| Est. Tokens | ~199k |
Looking for a unsloth-buddy alternative? If you're comparing unsloth-buddy with other claude skill tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.
A framework for agentic tool use training with reinforcement learning
A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a m
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.
Bilingual (中文+EN) ML / LLM / diffusion / agent interview cheat sheets for AI 秋招 — generated by ARIS /interview
RLAnything (ICML 2026) & AutoTool (ICML 2026), DemyAgent: Open-Source RL for LLMs and Agentic Scenarios
Unified LLM gateway with weighted load balancing, observability & cost tracking. 统一的 LLM 网关,提供权重负载均衡、可观测性与费用追踪
Explore other popular claude skill tools:
unsloth-buddy is Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc. It is categorized as a Claude Skill with 235 GitHub stars.
unsloth-buddy is primarily written in Python. It covers topics such as apple-silicon, claude-code, dpo.
You can find installation instructions and usage details in the unsloth-buddy GitHub repository at github.com/TYH-labs/unsloth-buddy. The project has 235 stars and 13 forks, indicating an active community.
unsloth-buddy is released under the MIT license, making it free to use and modify according to the license terms.
The top alternatives to unsloth-buddy on Agent Skills Hub include ToolBrain, Travel-Agent-based-on-Qwen2-RLHF, LLM-Finetuning-Toolkit. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.