Add script for sync post

llx-08 · llx-08 · commit 2b44bc99a2d7 · 2026-04-01T11:17:23.000+08:00
diff --git a/hexo-site/package.json b/hexo-site/package.json
@@ -3,7 +3,8 @@
   "version": "0.0.0",
   "private": true,
   "scripts": {
-    "build": "hexo generate",
+    "sync-posts": "python ../scripts/sync_root_to_hexo_posts.py",
+    "build": "npm run sync-posts && hexo generate",
     "clean": "hexo clean",
     "deploy": "hexo deploy",
     "server": "hexo server"
diff --git a/hexo-site/source/_posts/2026-03-31-kl-divergence.md b/hexo-site/source/_posts/2026-03-31-kl-divergence.md
@@ -3,6 +3,7 @@ title: KL 散度简介与 PPO 中的应用
 date: 2026-03-31
 tags: [RL, 数学]
 ---
+
 # 简介
 KL 散度（Kullback-Leibler Divergence）判断的是两个分布的“相似程度”，使用MSE并不能得到想要的结果。
 
@@ -16,18 +17,18 @@ $D_{KL}(P||Q) = \sum^N_{i=1}[p(x_i)logp(x_i)-p(x_i)logq(x_i)] $
 
 那么现在，小明和小红谁预测的概率分布离真实分布比较近？这时候就可以用KL散度来衡量P1与Q的相似性、P2与Q的相似性，然后对比可得谁更相似。
 
-![KL 散度示意](/imgs/kl_example.png)
+![KL 散度示意](imgs/kl_example.png)
 
 $KL1$比$KL2$更小，说明P1与Q更相近。
 
 # PPO中的应用
 为了防止Reward Model带来的权重修改过大，在loss函数中添加了一个约束项，也可以理解为KL散度。
 
-![PPO loss 示意](/imgs/ppo_loss.png)
+![PPO loss 示意](imgs/ppo_loss.png)
 
 这里的$\pi^{RL}_{\phi}$代表最终经过RL的模型权重概率分布，$\pi^{SFT}$代表SFT后得到的模型权重概率分布
 
 如果$\gamma$等于0，则是PPO的迭代方式；如果带有$\gamma$，则会应用在预训练的时候的损失函数，防止模型过多的偏向Reward Model带来的改变。
 
 # Ref
-https://zhuanlan.zhihu.com/p/339613080
+https://zhuanlan.zhihu.com/p/339613080
diff --git a/hexo-site/source/_posts/2026-03-31-ppo.md b/hexo-site/source/_posts/2026-03-31-ppo.md
@@ -3,6 +3,7 @@ title: PPO（Proximal Policy Optimization）学习笔记
 date: 2026-03-31
 tags: [RL]
 ---
+
 # PPO（Proximal Policy Optimization）学习笔记
 
 ## 1. On-Policy 与 Off-Policy
diff --git a/hexo-site/source/_posts/2026-03-31-vllm-gdn-computation.md b/hexo-site/source/_posts/2026-03-31-vllm-gdn-computation.md
@@ -3,6 +3,7 @@ title: GDN (GatedDeltaNet) 在 vLLM 中的计算流程
 date: 2026-03-31
 tags: [vLLM]
 ---
+
 # GDN (GatedDeltaNet) 在 vLLM 中的计算流程
 
 ## 1. 整体架构概览
diff --git a/hexo-site/source/_posts/2026-03-31-vllm-qsa-computation.md b/hexo-site/source/_posts/2026-03-31-vllm-qsa-computation.md
@@ -3,6 +3,7 @@ title: QSA (Query-Side Aggregation) 在 vLLM 中的计算流程
 date: 2026-03-31
 tags: [vLLM]
 ---
+
 # QSA (Query-Side Aggregation) 在 vLLM 中的计算流程
 
 ## 1. QSA 概述
diff --git a/hexo-site/source/_posts/2026-03-31-vllm-quantization-rotate.md b/hexo-site/source/_posts/2026-03-31-vllm-quantization-rotate.md
@@ -3,6 +3,7 @@ title: 量化中的 Rotate（旋转变换）技术及 vLLM 实现
 date: 2026-03-31
 tags: [vLLM, 量化]
 ---
+
 # 量化中的 Rotate（旋转变换）技术及 vLLM 实现
 
 ## 1. 背景：为什么需要旋转变换
diff --git a/hexo-site/source/_posts/2026-03-31-vllm-speculative-decoding.md b/hexo-site/source/_posts/2026-03-31-vllm-speculative-decoding.md
@@ -3,6 +3,7 @@ title: vLLM 投机解码（Eagle / MTP）实现与运行流程
 date: 2026-03-31
 tags: [vLLM]
 ---
+
 # vLLM 投机解码（Eagle / MTP）实现与运行流程
 
 ## 目录
diff --git a/hexo-site/source/_posts/2026-04-01-turboquant.md b/hexo-site/source/_posts/2026-04-01-turboquant.md
diff --git a/scripts/sync_root_to_hexo_posts.py b/scripts/sync_root_to_hexo_posts.py