Travel-Agent-based-on-Qwen2-RLHF — Agent Tool by NJUxlj

Last updated: 2026-04-16 · Indexed by AgentSkillsHub · Auto-synced every 8h

About Travel-Agent-based-on-Qwen2-RLHF

基于Qwen2.5+LoRA微调+RLHF+RAG的旅游路径规划智能体注：如果各位运行项目出现问题无法解决的话，可以直接给我提issue，我会抽空fix的。最好附上报错截图。近期修复： 2026-04-15 代码全面审查与修复：共修复26个问题训练器修复：ppo/dpo/grpo/sft初始化、参数误用、语法错误、混合精度 RAG系统：correctiverag实现、rag()方法、导入路径、数据库初始化逻辑安全修复：移除toolexecutor中的eval安全风险代码质量：环境变量配置、循环导入、代码重复、梯度检查点导入路径：修复main.py、rag.py、ragwebdemo.py等处的所有导入错误项目的组成部分本项目希望借助轻量级大模型帮助用户在本地更好地规划旅行路径，提高旅行体验。本项目是基于Qwen2.5+SFT微调+RLHF+RAG的旅游路径规划智能体。项目的整体框架如下： RAG系统辅助工具：工具调用系统 (Google Search + Weather API + Hotel Booking API + Plane Ticket API + Shortest Path API) 自定义的Prompt模板, 继承了 query+context+文档库匹配段落+工具列表+工具格式。它可以让大模型返回用户query命中的工具的函数API字符串. ToolDispatcher: 工具调度器，它可以解析用户query命中的工具的函数API字符串，随后使用ToolExecutor对象调用对应的工具函数。 ToolExecutor: 实际调用各种工具API的执行器。 ChatPDF 传统RAG：基于传统文档匹配+BM25实现的RAG 完全基于Langchain实现的RAG MemWalker [一种RAG方法，可以自己去搜paper]: 记忆树构建过程将长文本分割成适合LLM上下文窗口的小块使用LLM为每个段落生成摘要节点 (summary node / leaf node) 递归地将这些摘要进一步汇总形成高层次摘要节点 (parent node) 最终构建完整的树形记忆结构交互式导航机制从树的根节点开始导航 LLM检查不同文本部分，确定与查询相关的路径, 最后LLM会通过相似度比较，一直走到某个叶节点。最后通过自我纠错能力，判断该叶节点是不是需要的答案，如果不是，则回溯到父节点，继续搜寻下一个叶节点。在遍历过程中维护工作记忆能够从早期导航步骤中的错误中恢复, 通过让模型在叶节点输出 action=-1 / -2 / 0 这样的标识，来决定是否回溯

agent dpo grpo langchain lora ppo qwen2 rag rlhf tool-use

Quick Facts

Stars	69
Forks	6
Language	Python
Category	Agent Tool
Quality Score	35.75/100
Open Issues	5
Last Updated	2026-04-16
Created	2024-04-19
Platforms	python
Est. Tokens	~9840k

Compatible Skills

These tools work well together with Travel-Agent-based-on-Qwen2-RLHF for enhanced workflows:

unsloth-buddy — semantic(0.30)+complementary+rare_topics+same_lang+similar_pop+shared_platform (76%)
llm-agents-interview — semantic(0.23)+complementary+shared_fw(langchain)+same_lang+similar_pop+shared_platform (61%)
Open-AgentRL — semantic(0.31)+rare_topics+same_lang+similar_pop+shared_platform (54%)
AnyTool — semantic(0.24)+complementary+same_lang+similar_pop+shared_platform (53%)

Travel-Agent-based-on-Qwen2-RLHF alternative? Top 6 similar tools

Looking for a Travel-Agent-based-on-Qwen2-RLHF alternative? If you're comparing Travel-Agent-based-on-Qwen2-RLHF with other agent tool tools, these 6 projects are the closest alternatives on Agent Skills Hub — ranked by topic overlap, star count, and community traction.

unsloth-buddy by TYH-labs · ⭐ 235
Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/M
ToolBrain by ToolBrain · ⭐ 165
A framework for agentic tool use training with reinforcement learning
agent_learning by Haozhe-Xing · ⭐ 248
A systematic AI Agent development tutorial covering LLM agents, RAG, tool use, memory systems, multi-agent sys
oreilly-ai-agents by sinanuozdemir · ⭐ 269
An introduction to the world of AI Agents
weam by weam-ai · ⭐ 216
Web app for teams of 20+ members. In-built connections to major LLMs via API. Share chats, prompts, and agents
Autono by vortezwohl · ⭐ 209
A ReAct-Based Highly Robust Autonomous Agent Framework.

More Agent Tool Tools

Explore other popular agent tool tools:

View all Agent Tool tools →

Popular Python Agent Tools

TrendRadar ⭐ 59.7k · MCP Server
gpt-researcher ⭐ 27.4k · MCP Server
Scrapling ⭐ 64.6k · MCP Server
serena ⭐ 25.5k · MCP Server
MaxKB ⭐ 21.4k · MCP Server

Frequently Asked Questions

What is Travel-Agent-based-on-Qwen2-RLHF?

Travel-Agent-based-on-Qwen2-RLHF is A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using the response. A RAG system is build upon the tuned qwen2, using. It is categorized as a Agent Tool with 69 GitHub stars.

What programming language is Travel-Agent-based-on-Qwen2-RLHF written in?

Travel-Agent-based-on-Qwen2-RLHF is primarily written in Python. It covers topics such as agent, dpo, grpo.

How do I install or use Travel-Agent-based-on-Qwen2-RLHF?

You can find installation instructions and usage details in the Travel-Agent-based-on-Qwen2-RLHF GitHub repository at github.com/NJUxlj/Travel-Agent-based-on-Qwen2-RLHF. The project has 69 stars and 6 forks, indicating an active community.

What are the best alternatives to Travel-Agent-based-on-Qwen2-RLHF?

The top alternatives to Travel-Agent-based-on-Qwen2-RLHF on Agent Skills Hub include unsloth-buddy, ToolBrain, agent_learning. Each offers a different approach to the same problem space — compare them side-by-side by stars, quality score, and community activity.

View on GitHub → Browse Agent Tool tools