unsloth-buddy — Claude Skill by TYH-labs

by TYH-labs · Claude Skill · ★ 221

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About unsloth-buddy

unsloth-buddy /unsloth-buddy I have 500 customer support Q&As and want to fine-tune a summarization model. I only have a MacBook Air. <img src="https://img.shields.io/badge/Try%20It-1%20minute-black?style=for-the-badge" alt="Tr

apple-siliconclaude-codedpofine-tuninggaslampgrpohuggingfaceloraqlorarlhf

Quick Facts

Stars221
Forks12
LanguagePython
CategoryClaude Skill
LicenseMIT
Quality Score57.098/100
Last Updated2026-04-15
Created2026-03-15
Platformsclaude-code, cli, gemini, python
Est. Tokens~193k

unsloth-buddy alternative? Top 6 similar tools

Looking for a unsloth-buddy alternative? If you're comparing unsloth-buddy with other claude skill tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • ToolBrain by ToolBrain · ⭐ 165

    A framework for agentic tool use training with reinforcement learning

  • Travel-Agent-based-on-Qwen2-RLHF by NJUxlj · ⭐ 69

    A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a m

  • LLM-Finetuning-Toolkit by georgian-io · ⭐ 870

    Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

  • ovo-local-llm by ovoment · ⭐ 95

    A private Claude-Code-style coding agent for Apple Silicon — run chat, code, and local model workflows on-devi

  • vllm-mlx by waybarrios · ⭐ 1.1k

    OpenAI and Anthropic compatible server for Apple Silicon. Run LLMs and vision-language models (Llama, Qwen-VL,

  • Open-AgentRL by Gen-Verse · ⭐ 331

    An open-source RL (DemyAgent & RLAnything) for training LLM-based agents — supporting GRPO, PPO, RLHF, multi-t

More Claude Skill Tools

Explore other popular claude skill tools:

View all Claude Skill tools →

Popular Python Agent Tools

Frequently Asked Questions

What is unsloth-buddy?

unsloth-buddy is Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc. It is categorized as a Claude Skill with 221 GitHub stars.

What programming language is unsloth-buddy written in?

unsloth-buddy is primarily written in Python. It covers topics such as apple-silicon, claude-code, dpo.

How do I install or use unsloth-buddy?

You can find installation instructions and usage details in the unsloth-buddy GitHub repository at github.com/TYH-labs/unsloth-buddy. The project has 221 stars and 12 forks, indicating an active community.

What license does unsloth-buddy use?

unsloth-buddy is released under the MIT license, making it free to use and modify according to the license terms.

What are the best alternatives to unsloth-buddy?

The top alternatives to unsloth-buddy on Agent Skills Hub include ToolBrain, Travel-Agent-based-on-Qwen2-RLHF, LLM-Finetuning-Toolkit. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Claude Skill tools