Travel-Agent-based-on-Qwen2-RLHF — Agent Tool by NJUxlj

by NJUxlj · Agent Tool · ★ 69

Last updated: · Indexed by AgentSkillsHub · Auto-synced every 8h

About Travel-Agent-based-on-Qwen2-RLHF

基于Qwen2.5+LoRA微调+RLHF+RAG的旅游路径规划智能体 注: 如果各位运行项目出现问题无法解决的话,可以直接给我提issue,我会抽空fix的。 最好附上报错截图。 近期修复: 2026-04-15 代码全面审查与修复:共修复26个问题 训练器修复:ppo/dpo/grpo/sft初始化、参数误用、语法错误、混合精度 RAG系统:correctiverag实现、rag()方法、导入路径、数据库初始化逻辑 安全修复:移除toolexecutor中的eval安全风险 代码质量:环境变量配置、循环导入、代码重复、梯度检查点 导入路径:修复main.py、rag.py、ragwebdemo.py等处的所有导入错误 项目的组成部分 本项目希望借助轻量级大模型帮助用户在本地更好地规划旅行路径,提高旅行体验。本项目是基于Qwen2.5+SFT微调+RLHF+RAG的旅游路径规划智能体。 项目的整体框架如下: RAG系统 辅助工具: 工具调用系统 (Google Search + Weather API + Hotel Booking API + Plane Ticket API + Shortest Path API) 自定义的Prompt模板, 继承了 query+context+文档库匹配段落+工具列表+工具格式。它可以让大模型返回用户query命中的工具的函数API字符串. ToolDispatcher: 工具调度器,它可以解析用户query命中的工具的函数API字符串,随后使用ToolExecutor对象调用对应的工具函数。 ToolExecutor: 实际调用各种工具API的执行器。 ChatPDF 传统RAG: 基于传统文档匹配+BM25实现的RAG 完全基于Langchain实现的RAG MemWalker [一种RAG方法,可以自己去搜paper]: 记忆树构建过程 将长文本分割成适合LLM上下文窗口的小块 使用LLM为每个段落生成摘要节点 (summary node / leaf node) 递归地将这些摘要进一步汇总形成高层次摘要节点 (parent node) 最终构建完整的树形记忆结构 交互式导航机制 从树的根节点开始导航 LLM检查不同文本部分,确定与查询相关的路径, 最后LLM会通过相似度比较,一直走到某个叶节点。 最后通过自我纠错能力,判断该叶节点是不是需要的答案,如果不是,则回溯到父节点,继续搜寻下一个叶节点。 在遍历过程中维护工作记忆 能够从早期导航步骤中的错误中恢复, 通过让模型在叶节点输出 action=-1 / -2 / 0 这样的标识,来决定是否回溯

agentdpogrpolangchainlorappoqwen2ragrlhftool-use

Quick Facts

Stars69
Forks6
LanguagePython
CategoryAgent Tool
Quality Score35.75/100
Open Issues5
Last Updated2026-04-16
Created2024-04-19
Platformspython
Est. Tokens~9840k

Compatible Skills

These tools work well together with Travel-Agent-based-on-Qwen2-RLHF for enhanced workflows:

  • unsloth-buddy — semantic(0.30)+complementary+rare_topics+same_lang+similar_pop+shared_platform (76%)
  • llm-agents-interview — semantic(0.23)+complementary+shared_fw(langchain)+same_lang+similar_pop+shared_platform (61%)
  • Open-AgentRL — semantic(0.31)+rare_topics+same_lang+similar_pop+shared_platform (54%)
  • AnyTool — semantic(0.24)+complementary+same_lang+similar_pop+shared_platform (53%)

Travel-Agent-based-on-Qwen2-RLHF alternative? Top 6 similar tools

Looking for a Travel-Agent-based-on-Qwen2-RLHF alternative? If you're comparing Travel-Agent-based-on-Qwen2-RLHF with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

  • unsloth-buddy by TYH-labs · ⭐ 235

    Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/M

  • ToolBrain by ToolBrain · ⭐ 165

    A framework for agentic tool use training with reinforcement learning

  • agent_learning by Haozhe-Xing · ⭐ 248

    A systematic AI Agent development tutorial covering LLM agents, RAG, tool use, memory systems, multi-agent sys

  • oreilly-ai-agents by sinanuozdemir · ⭐ 269

    An introduction to the world of AI Agents

  • weam by weam-ai · ⭐ 216

    Web app for teams of 20+ members. In-built connections to major LLMs via API. Share chats, prompts, and agents

  • Autono by vortezwohl · ⭐ 209

    A ReAct-Based Highly Robust Autonomous Agent Framework.

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Popular Python Agent Tools

Frequently Asked Questions

What is Travel-Agent-based-on-Qwen2-RLHF?

Travel-Agent-based-on-Qwen2-RLHF is A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using the response. A RAG system is build upon the tuned qwen2, using. It is categorized as a Agent Tool with 69 GitHub stars.

What programming language is Travel-Agent-based-on-Qwen2-RLHF written in?

Travel-Agent-based-on-Qwen2-RLHF is primarily written in Python. It covers topics such as agent, dpo, grpo.

How do I install or use Travel-Agent-based-on-Qwen2-RLHF?

You can find installation instructions and usage details in the Travel-Agent-based-on-Qwen2-RLHF GitHub repository at github.com/NJUxlj/Travel-Agent-based-on-Qwen2-RLHF. The project has 69 stars and 6 forks, indicating an active community.

What are the best alternatives to Travel-Agent-based-on-Qwen2-RLHF?

The top alternatives to Travel-Agent-based-on-Qwen2-RLHF on Agent Skills Hub include unsloth-buddy, ToolBrain, agent_learning. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Agent Tool tools