unsloth-buddy — Claude Skill by TYH-labs

by TYH-labs · Claude Skill · ★ 235

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About unsloth-buddy

unsloth-buddy /unsloth-buddy I have 500 customer support Q&As and want to fine-tune a summarization model. I only have a MacBook Air. <img src="https://img.shields.io/badge/Try%20It-1%20minute-black?style=for-the-badge" alt="Tr

apple-siliconclaude-codedpofine-tuninggaslampgrpohuggingfaceloraqlorarlhf

Quick Facts

Stars235
Forks13
LanguagePython
CategoryClaude Skill
LicenseMIT
Quality Score57.098/100
Last Updated2026-05-07
Created2026-03-15
Platformsclaude-code, cli, gemini, python
Est. Tokens~199k

unsloth-buddy alternative? Top 6 similar tools

Looking for a unsloth-buddy alternative? If you're comparing unsloth-buddy with other claude skill tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • ToolBrain by ToolBrain · ⭐ 165

    A framework for agentic tool use training with reinforcement learning

  • Travel-Agent-based-on-Qwen2-RLHF by NJUxlj · ⭐ 69

    A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a m

  • LLM-Finetuning-Toolkit by georgian-io · ⭐ 870

    Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

  • ARIS-in-AI-Offer by wanshuiyin · ⭐ 222

    Bilingual (中文+EN) ML / LLM / diffusion / agent interview cheat sheets for AI 秋招 — generated by ARIS /interview

  • Open-AgentRL by Gen-Verse · ⭐ 545

    RLAnything (ICML 2026) & AutoTool (ICML 2026), DemyAgent: Open-Source RL for LLMs and Agentic Scenarios

  • llmio by atopos31 · ⭐ 307

    Unified LLM gateway with weighted load balancing, observability & cost tracking. 统一的 LLM 网关,提供权重负载均衡、可观测性与费用追踪

More Claude Skill Tools

Explore other popular claude skill tools:

View all Claude Skill tools →

Popular Python Agent Tools

Frequently Asked Questions

What is unsloth-buddy?

unsloth-buddy is Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc. It is categorized as a Claude Skill with 235 GitHub stars.

What programming language is unsloth-buddy written in?

unsloth-buddy is primarily written in Python. It covers topics such as apple-silicon, claude-code, dpo.

How do I install or use unsloth-buddy?

You can find installation instructions and usage details in the unsloth-buddy GitHub repository at github.com/TYH-labs/unsloth-buddy. The project has 235 stars and 13 forks, indicating an active community.

What license does unsloth-buddy use?

unsloth-buddy is released under the MIT license, making it free to use and modify according to the license terms.

What are the best alternatives to unsloth-buddy?

The top alternatives to unsloth-buddy on Agent Skills Hub include ToolBrain, Travel-Agent-based-on-Qwen2-RLHF, LLM-Finetuning-Toolkit. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Claude Skill tools