InsTag — Agent Tool by OFA-Sys

Last updated: 2023-08-20 · Indexed by AgentSkillsHub · Auto-synced every 8h

About InsTag

InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning We introduce a tool named InsTag for analyzing supervised fine-tuning (SFT) data in LLM aligning with human preference. For local tagging deployment, we release InsTagger, fine-tuned on InsTag results, to tag the queries in SFT data. Through the scope of tags, we sample a 6K subset of open-resourced SFT data to fine-tune LLaMA and LLaMA-2 and the fine-tuned models TagLM-13B-v1.0 and TagLM-13B-v2.0 outperform many open-resourced LLMs on MT-Bench. 🤗 InsTagger Checkpoint • 👉 Online LocalTagger Demo • 📖 Paper 🤖️ TagLM-13B-v1.0 Checkpoint 🤖️ TagLM-13B-v2.0 Checkpoint What is InsTag? Foundation language models obtain the instruction-following ability through supervised fine-tuning (SFT). Diversity and complexity are considered critical factors of a successful SFT dataset, while their definitions remain obscure and lack quantitative analyses. In this work, we propose InsTag, an open-set fine-grained tagger, to tag samples within SF

alignment large-language-models llama llama2 natural-language-processing nlp tagging

Quick Facts

Stars	285
Forks	8
Category	Agent Tool
Quality Score	33.75/100
Open Issues	9
Last Updated	2023-08-20
Created	2023-08-14
Est. Tokens	~158k

InsTag alternative? Top 6 similar tools

Looking for a InsTag alternative? If you're comparing InsTag with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

spacy-llm by explosion · ⭐ 1.4k
🦙 Integrating LLMs into structured NLP pipelines
LLM-Finetuning-Toolkit by georgian-io · ⭐ 870
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.
autollm by viddexa · ⭐ 1.0k
Ship RAG based LLM web apps in seconds.
awesome-japanese-nlp-resources by taishi-i · ⭐ 977
A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese
langkit by whylabs · ⭐ 976
🔍 LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). 📚 Extracts signals from promp
ontogpt by monarch-initiative · ⭐ 846
LLM-based ontological extraction tools, including SPIRES

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Frequently Asked Questions

What is InsTag?

InsTag is InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning. It is categorized as a Agent Tool with 285 GitHub stars.

How do I install or use InsTag?

You can find installation instructions and usage details in the InsTag GitHub repository at github.com/OFA-Sys/InsTag. The project has 285 stars and 8 forks, indicating an active community.

What are the best alternatives to InsTag?

The top alternatives to InsTag on Agent Skills Hub include spacy-llm, LLM-Finetuning-Toolkit, autollm. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Agent Tool tools