unsloth-buddy — Claude Skill by TYH-labs

Q: What is unsloth-buddy?

unsloth-buddy is Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc. It is categorized as a Claude Skill with 235 GitHub stars.

Q: What programming language is unsloth-buddy written in?

unsloth-buddy is primarily written in Python. It covers topics such as apple-silicon, claude-code, dpo.

Q: How do I install or use unsloth-buddy?

You can find installation instructions and usage details in the unsloth-buddy GitHub repository at github.com/TYH-labs/unsloth-buddy. The project has 235 stars and 13 forks, indicating an active community.

Q: What license does unsloth-buddy use?

unsloth-buddy is released under the MIT license, making it free to use and modify according to the license terms.

Q: What are the best alternatives to unsloth-buddy?

The top alternatives to unsloth-buddy on Agent Skills Hub include ToolBrain, Travel-Agent-based-on-Qwen2-RLHF, LLM-Finetuning-Toolkit. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

by TYH-labs · Claude Skill · ★ 235

Last updated: 2026-05-07 · Indexed by AgentSkillsHub · Auto-synced every 8h

About unsloth-buddy

unsloth-buddy /unsloth-buddy I have 500 customer support Q&As and want to fine-tune a summarization model. I only have a MacBook Air. <img src="https://img.shields.io/badge/Try%20It-1%20minute-black?style=for-the-badge" alt="Tr

apple-silicon claude-code dpo fine-tuning gaslamp grpo huggingface lora qlora rlhf

Quick Facts

Stars	235
Forks	13
Language	Python
Category	Claude Skill
License	MIT
Quality Score	57.098/100
Last Updated	2026-05-07
Created	2026-03-15
Platforms	claude-code, cli, gemini, python
Est. Tokens	~199k

unsloth-buddy alternative? Top 6 similar tools

Looking for a unsloth-buddy alternative? If you're comparing unsloth-buddy with other claude skill tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

ToolBrain by ToolBrain · ⭐ 165
A framework for agentic tool use training with reinforcement learning
Travel-Agent-based-on-Qwen2-RLHF by NJUxlj · ⭐ 69
A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a m
LLM-Finetuning-Toolkit by georgian-io · ⭐ 870
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.
ARIS-in-AI-Offer by wanshuiyin · ⭐ 222
Bilingual (中文+EN) ML / LLM / diffusion / agent interview cheat sheets for AI 秋招 — generated by ARIS /interview
Open-AgentRL by Gen-Verse · ⭐ 545
RLAnything (ICML 2026) & AutoTool (ICML 2026), DemyAgent: Open-Source RL for LLMs and Agentic Scenarios
llmio by atopos31 · ⭐ 307
Unified LLM gateway with weighted load balancing, observability & cost tracking. 统一的 LLM 网关，提供权重负载均衡、可观测性与费用追踪

More Claude Skill Tools

Explore other popular claude skill tools:

claude-codex-settings ⭐ 707
claude-seo ⭐ 8.8k
life-sciences ⭐ 345
claude-ai-mcp ⭐ 322
pilot-shell ⭐ 1.8k
claude-plugins-official ⭐ 30.5k
skyll ⭐ 222
digital-marketing-pro ⭐ 147
awesome-claude-code ⭐ 41.5k
planning-with-files ⭐ 23.4k

View all Claude Skill tools →

Popular Python Agent Tools

TrendRadar ⭐ 59.7k · MCP Server
gpt-researcher ⭐ 27.4k · MCP Server
Scrapling ⭐ 64.6k · MCP Server
serena ⭐ 25.5k · MCP Server
MaxKB ⭐ 21.4k · MCP Server

Frequently Asked Questions

What is unsloth-buddy?

unsloth-buddy is Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc. It is categorized as a Claude Skill with 235 GitHub stars.

What programming language is unsloth-buddy written in?

unsloth-buddy is primarily written in Python. It covers topics such as apple-silicon, claude-code, dpo.

How do I install or use unsloth-buddy?

You can find installation instructions and usage details in the unsloth-buddy GitHub repository at github.com/TYH-labs/unsloth-buddy. The project has 235 stars and 13 forks, indicating an active community.

What license does unsloth-buddy use?

unsloth-buddy is released under the MIT license, making it free to use and modify according to the license terms.

What are the best alternatives to unsloth-buddy?

The top alternatives to unsloth-buddy on Agent Skills Hub include ToolBrain, Travel-Agent-based-on-Qwen2-RLHF, LLM-Finetuning-Toolkit. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Claude Skill tools