by TYH-labs · Claude Skill · ★ 221
Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h
unsloth-buddy /unsloth-buddy I have 500 customer support Q&As and want to fine-tune a summarization model. I only have a MacBook Air. <img src="https://img.shields.io/badge/Try%20It-1%20minute-black?style=for-the-badge" alt="Tr
| Stars | 221 |
| Forks | 12 |
| Language | Python |
| Category | Claude Skill |
| License | MIT |
| Quality Score | 57.098/100 |
| Last Updated | 2026-04-15 |
| Created | 2026-03-15 |
| Platforms | claude-code, cli, gemini, python |
| Est. Tokens | ~193k |
Looking for a unsloth-buddy alternative? If you're comparing unsloth-buddy with other claude skill tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.
A framework for agentic tool use training with reinforcement learning
A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a m
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.
A private Claude-Code-style coding agent for Apple Silicon — run chat, code, and local model workflows on-devi
OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL,
An open-source RL (DemyAgent & RLAnything) for training LLM-based agents — supporting GRPO, PPO, RLHF, multi-t
Explore other popular claude skill tools:
unsloth-buddy is Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc. It is categorized as a Claude Skill with 221 GitHub stars.
unsloth-buddy is primarily written in Python. It covers topics such as apple-silicon, claude-code, dpo.
You can find installation instructions and usage details in the unsloth-buddy GitHub repository at github.com/TYH-labs/unsloth-buddy. The project has 221 stars and 12 forks, indicating an active community.
unsloth-buddy is released under the MIT license, making it free to use and modify according to the license terms.
The top alternatives to unsloth-buddy on Agent Skills Hub include ToolBrain, Travel-Agent-based-on-Qwen2-RLHF, LLM-Finetuning-Toolkit. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.