QMath 功能增强 | Gintmr's Home

Work Record

🔎QMath 功能增强

字数 189阅读时长≈ 1 分钟

2025-3-6

password

comment

type

status

date

slug

summary

tags

category

icon

AI custom autofill

remaining_eval.py

remaining_eval.py

apply_RL_prompt.py

long_short_remaining_RL.sh

核心代码以及运行脚本👆

主要改动：

功能

涉及到两个环境变量：tip和stage

tip包括：

Ahead：

将\n<remaining>[{budget} token]</remaining>\n加入进问题的末尾

remaining：

包含Ahead功能，且同时在推理时递归地插入倒计时budget token

prompt-based：

在问题末尾加上基于自然语言的prompt提示，例如You should finish thinking with in {budget} tokens.\n

default：

对问题不做处理

stage包括：

1：一次性输出完成

2：在第一轮输出完成后，在文本末尾处添加</think>\n\n**Final Answer**\n\\boxed

template

Ⅰ增加了deepseek3的提示模板

路径：/data05/wuxinrui/Qwen2.5-Math/evaluation/utils.py

Ⅱ同时在终止符列表中加入了deepseek3模板对应的终止符

路径：/data05/wuxinrui/Qwen2.5-Math/evaluation/remaining_eval.py

两JSON指定键值替换

COCO格式数据集合并脚本

COCO格式数据集合并脚本

作者:Gintmr
链接:https://gintmr.20250130.xyz//article/1aeaf1ce-0c90-80dd-89a7-c857c8b813b2
声明:本文采用 CC BY-NC-SA 4.0 许可协议，转载请注明出处。

相关文章

Lazy loaded image

Lazy loaded image

合并SafeTensors

Lazy loaded image

统计数据集分布-MetaMath

Lazy loaded image

配置免密登录Linux服务器

Lazy loaded image

比对两个文件夹

Lazy loaded image

评论

Loading...

目录

Hello! 👇This is a Bulletin board.

Gintmr

Welcome to My Home🥰

Notion本身还不能当图床~好烦😵‍💫

另外~没有经历过把mv写成rm 的人生是不完整的🥲

to be continued···

目录

最新发布

Lazy loaded image

Lazy loaded image

Lazy loaded image

Lazy loaded image

Lazy loaded image

Lazy loaded image

²²

¹³

¹⁰

⁶

³

³

¹

Number of Articles:

36

Historical Duration:

92 天