InsTag — Agent Tool by OFA-Sys

by OFA-Sys · Agent Tool · ★ 285

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About InsTag

InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning We introduce a tool named InsTag for analyzing supervised fine-tuning (SFT) data in LLM aligning with human preference. For local tagging deployment, we release InsTagger, fine-tuned on InsTag results, to tag the queries in SFT data. Through the scope of tags, we sample a 6K subset of open-resourced SFT data to fine-tune LLaMA and LLaMA-2 and the fine-tuned models TagLM-13B-v1.0 and TagLM-13B-v2.0 outperform many open-resourced LLMs on MT-Bench. 🤗 InsTagger Checkpoint • 👉 Online LocalTagger Demo • 📖 Paper 🤖️ TagLM-13B-v1.0 Checkpoint 🤖️ TagLM-13B-v2.0 Checkpoint What is InsTag? Foundation language models obtain the instruction-following ability through supervised fine-tuning (SFT). Diversity and complexity are considered critical factors of a successful SFT dataset, while their definitions remain obscure and lack quantitative analyses. In this work, we propose InsTag, an open-set fine-grained tagger, to tag samples within SF

alignmentlarge-language-modelsllamallama2natural-language-processingnlptagging

Quick Facts

Stars285
Forks8
CategoryAgent Tool
Quality Score33.75/100
Open Issues9
Last Updated2023-08-20
Created2023-08-14
Est. Tokens~158k

InsTag alternative? Top 6 similar tools

Looking for a InsTag alternative? If you're comparing InsTag with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • spacy-llm by explosion · ⭐ 1.4k

    🦙 Integrating LLMs into structured NLP pipelines

  • LLM-Finetuning-Toolkit by georgian-io · ⭐ 870

    Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

  • autollm by viddexa · ⭐ 1.0k

    Ship RAG based LLM web apps in seconds.

  • awesome-japanese-nlp-resources by taishi-i · ⭐ 977

    A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese

  • langkit by whylabs · ⭐ 976

    🔍 LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). 📚 Extracts signals from promp

  • ontogpt by monarch-initiative · ⭐ 846

    LLM-based ontological extraction tools, including SPIRES

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Frequently Asked Questions

What is InsTag?

InsTag is InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning. It is categorized as a Agent Tool with 285 GitHub stars.

How do I install or use InsTag?

You can find installation instructions and usage details in the InsTag GitHub repository at github.com/OFA-Sys/InsTag. The project has 285 stars and 8 forks, indicating an active community.

What are the best alternatives to InsTag?

The top alternatives to InsTag on Agent Skills Hub include spacy-llm, LLM-Finetuning-Toolkit, autollm. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Agent Tool tools