功能: 重写小红书 Skills，完整迁移为 CDP Python 实现 (#1)

## 主要变更 ### 核心模块重写 - 创建 scripts/xhs/ 包，包含 18 个专业模块（3728 行代码） - 基于 xiaohongshu-mcp Go 源码完整实现 - CDP WebSocket 直接通信，替代第三方库依赖 ### 模块清单 - cdp.py: Browser/Page/Element 类，完整 CDP 协议实现 - stealth.py: 反检测 JS 注入 + Chrome 启动参数 - login.py: 登录检查与二维码登录（QR 码保存到临时文件供 Agent 显示） - publish.py: 图文发布完整流程 - publish_video.py: 视频发布完整流程 - search.py: 搜索与内容筛选 - feed_detail.py: 笔记详情与评论加载 - comment.py: 评论与回复 - like_favorite.py: 点赞与收藏 - user_profile.py: 用户主页 - cookies.py: Cookie 持久化 - types.py: 完整的 dataclass 数据类型系统 - errors.py: 自定义异常体系 - human.py: 人类行为模拟（延迟、滚动） - selectors.py: CSS 选择器常量 - urls.py: URL 构建函数 ### CLI 统一接口 - scripts/cli.py: 13 个子命令，完全兼容 xiaohongshu-mcp MCP 工具 - check-login: 检查登录状态 - login: 获取登录二维码 - switch-account/delete-cookies: 账号切换 - publish-content: 图文发布 - publish-with-video: 视频发布 - list-feeds: Feed 列表 - search-feeds: Feed 搜索 - get-feed-detail: 笔记详情 - user-profile: 用户主页 - post-comment: 发送评论 - like-feed: 点赞笔记 - favorite-feed: 收藏笔记 ### 支持脚本重写 - chrome_launcher.py: Chrome 进程管理（跨平台） - account_manager.py: 多账号 Profile 隔离 - image_downloader.py: 图片/视频下载（SHA256 缓存） - title_utils.py: UTF-16 标题长度计算 - run_lock.py: 单实例锁机制 - publish_pipeline.py: 发布流程编排 CLI ### 文档与配置 - SKILL.md: 统一技能入口（路由到 5 个子技能） - skills/xhs-auth/SKILL.md: 认证管理技能 - skills/xhs-publish/SKILL.md: 内容发布技能（图文+视频） - skills/xhs-explore/SKILL.md: 内容发现与分析技能 - skills/xhs-interact/SKILL.md: 社交互动技能（评论/点赞/收藏） - skills/xhs-content-ops/SKILL.md: 复合内容运营工作流技能 - CLAUDE.md: 项目开发指南 - PROMPT.md: Ralph Loop 驱动文件 - pyproject.toml: uv 项目配置（uv.lock） - README.md: 完整项目文档 ### 技术栈 - Python 3.11+ with uv 包管理 - requests + websockets: CDP WebSocket 通信 - 代码规范: ruff lint + format ## 对应关系所有 13 个子命令与 xiaohongshu-mcp MCP 工具完全对应支持 OpenClaw agent 框架直接调用 ## 前置工作 - 创建 scripts/xhs/ 包架构 - 实现 CDP WebSocket 协议 - 完整的类型系统和错误处理 - CLI 子命令系统 Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

功能: 重写小红书 Skills，完整迁移为 CDP Python 实现 (#1)
## 主要变更 ### 核心模块重写 - 创建 scripts/xhs/ 包，包含 18 个专业模块（3728 行代码） - 基于 xiaohongshu-mcp Go 源码完整实现 - CDP WebSocket 直接通信，替代第三方库依赖 ### 模块清单 - cdp.py: Browser/Page/Element 类，完整 CDP 协议实现 - stealth.py: 反检测 JS 注入 + Chrome 启动参数 - login.py: 登录检查与二维码登录（QR 码保存到临时文件供 Agent 显示） - publish.py: 图文发布完整流程 - publish_video.py: 视频发布完整流程 - search.py: 搜索与内容筛选 - feed_detail.py: 笔记详情与评论加载 - comment.py: 评论与回复 - like_favorite.py: 点赞与收藏 - user_profile.py: 用户主页 - cookies.py: Cookie 持久化 - types.py: 完整的 dataclass 数据类型系统 - errors.py: 自定义异常体系 - human.py: 人类行为模拟（延迟、滚动） - selectors.py: CSS 选择器常量 - urls.py: URL 构建函数 ### CLI 统一接口 - scripts/cli.py: 13 个子命令，完全兼容 xiaohongshu-mcp MCP 工具 - check-login: 检查登录状态 - login: 获取登录二维码 - switch-account/delete-cookies: 账号切换 - publish-content: 图文发布 - publish-with-video: 视频发布 - list-feeds: Feed 列表 - search-feeds: Feed 搜索 - get-feed-detail: 笔记详情 - user-profile: 用户主页 - post-comment: 发送评论 - like-feed: 点赞笔记 - favorite-feed: 收藏笔记 ### 支持脚本重写 - chrome_launcher.py: Chrome 进程管理（跨平台） - account_manager.py: 多账号 Profile 隔离 - image_downloader.py: 图片/视频下载（SHA256 缓存） - title_utils.py: UTF-16 标题长度计算 - run_lock.py: 单实例锁机制 - publish_pipeline.py: 发布流程编排 CLI ### 文档与配置 - SKILL.md: 统一技能入口（路由到 5 个子技能） - skills/xhs-auth/SKILL.md: 认证管理技能 - skills/xhs-publish/SKILL.md: 内容发布技能（图文+视频） - skills/xhs-explore/SKILL.md: 内容发现与分析技能 - skills/xhs-interact/SKILL.md: 社交互动技能（评论/点赞/收藏） - skills/xhs-content-ops/SKILL.md: 复合内容运营工作流技能 - CLAUDE.md: 项目开发指南 - PROMPT.md: Ralph Loop 驱动文件 - pyproject.toml: uv 项目配置（uv.lock） - README.md: 完整项目文档 ### 技术栈 - Python 3.11+ with uv 包管理 - requests + websockets: CDP WebSocket 通信 - 代码规范: ruff lint + format ## 对应关系所有 13 个子命令与 xiaohongshu-mcp MCP 工具完全对应支持 OpenClaw agent 框架直接调用 ## 前置工作 - 创建 scripts/xhs/ 包架构 - 实现 CDP WebSocket 协议 - 完整的类型系统和错误处理 - CLI 子命令系统 Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
zy · GitHub
Commit b8ec00ae52acbadfab2b14bc7e1402368773231b b8ec00ae 1 parent ee0fdf50
Showing 30 changed files with 4888 additions and 1 deletions
.gitignore
CLAUDE.md
PROMPT.md
README.md
SKILL.md
pyproject.toml
scripts/account_manager.py
scripts/chrome_launcher.py
scripts/cli.py
scripts/image_downloader.py
scripts/publish_pipeline.py
scripts/run_lock.py
scripts/title_utils.py
scripts/xhs/__init__.py
scripts/xhs/cdp.py
scripts/xhs/comment.py
scripts/xhs/cookies.py
scripts/xhs/errors.py
scripts/xhs/feed_detail.py
scripts/xhs/feeds.py
--- a/.gitignore
View file @b8ec00a
+++ b/.gitignore
View file @b8ec00a
@@ -205,3 +205,15 @@ cython_debug/
 marimo/_static/
 marimo/_lsp/
 __marimo__/
+
+# Project specific
+tmp/
+*.txt
+!requirements.txt
+config/accounts.json
+title.txt
+content.txt
+comment.txt
+
+# Ralph Loop state
+.claude/.ralph-loop.local.md
--- a/CLAUDE.md 0 → 100644
View file @b8ec00a
+++ b/CLAUDE.md 0 → 100644
View file @b8ec00a
+# xiaohongshu-skills
+
+小红书自动化 Claude Code Skills，基于 Python CDP 浏览器自动化引擎。
+为 OpenClaw 生态提供小红书操作能力，同时支持 Claude Code skills 格式。
+
+## 项目结构
+
+```
+xiaohongshu-skills/
+├── scripts/                        # Python CDP 自动化引擎
+│   ├── xhs/                        # 核心 XHS 自动化包
+│   │   ├── __init__.py
+│   │   ├── cdp.py                  # CDP WebSocket 客户端（Browser, Page, Element）
+│   │   ├── stealth.py              # 反检测 JS 注入 + Chrome 启动参数
+│   │   ├── cookies.py              # Cookie 文件持久化
+│   │   ├── types.py                # 数据类型（dataclass）
+│   │   ├── errors.py               # 异常体系
+│   │   ├── selectors.py            # CSS 选择器常量
+│   │   ├── urls.py                 # URL 常量和构建函数
+│   │   ├── human.py                # 人类行为模拟（延迟、滚动）
+│   │   ├── login.py                # 登录检查、二维码登录
+│   │   ├── feeds.py                # 首页 Feed 列表
+│   │   ├── search.py               # 搜索 + 筛选
+│   │   ├── feed_detail.py          # 笔记详情 + 评论加载
+│   │   ├── user_profile.py         # 用户主页
+│   │   ├── comment.py              # 评论、回复
+│   │   ├── like_favorite.py        # 点赞、收藏
+│   │   ├── publish.py              # 图文发布
+│   │   └── publish_video.py        # 视频发布
+│   ├── cli.py                      # 统一 CLI 入口（13 个子命令）
+│   ├── chrome_launcher.py          # Chrome 进程管理
+│   ├── account_manager.py          # 多账号管理
+│   ├── image_downloader.py         # 媒体下载（SHA256 缓存）
+│   ├── title_utils.py              # UTF-16 标题长度计算
+│   ├── run_lock.py                 # 单实例锁
+│   └── publish_pipeline.py         # 发布编排器
+├── skills/                         # Claude Code Skills 定义
+│   ├── xhs-auth/SKILL.md           # 认证管理
+│   ├── xhs-publish/SKILL.md        # 内容发布（图文+视频）
+│   ├── xhs-explore/SKILL.md        # 内容发现与分析
+│   ├── xhs-interact/SKILL.md       # 社交互动（评论/点赞/收藏）
+│   └── xhs-content-ops/SKILL.md    # 复合内容运营工作流
+├── pyproject.toml                  # uv 项目配置
+├── SKILL.md                        # 统一入口（路由到子技能）
+├── CLAUDE.md                       # 本文件
+├── PROMPT.md                       # Ralph Loop 驱动文件
+└── README.md
+```
+
+## 技术栈
+
+- **Python**: >=3.11
+- **包管理**: uv
+- **依赖**: requests + websockets（直接 CDP WebSocket 通信）
+- **浏览器**: Chrome（通过 CDP 远程调试协议控制）
+- **代码规范**: ruff（lint + format）
+- **数据提取**: `window.__INITIAL_STATE__`（与 Go 源码一致）
+
+## 开发命令
+
+```bash
+uv sync                    # 安装依赖
+uv run ruff check .        # Lint 检查
+uv run ruff format .       # 代码格式化
+uv run pytest              # 运行测试
+```
+
+## 架构设计
+
+### 双层结构
+
+1. **scripts/ — Python CDP 引擎**
+   - 基于 xiaohongshu-mcp Go 源码从零重写
+   - `xhs/` 包：模块化的核心自动化库
+   - `cli.py`：统一 CLI 入口，13 个子命令对应 MCP 工具
+   - JSON 结构化输出，便于 agent 解析
+   - 多账号支持，独立 Chrome Profile 隔离
+   - 反检测保护（stealth flags + JS 注入）
+
+2. **skills/ — Claude Code Skills 定义**
+   - SKILL.md 格式，指导 Claude 如何调用 scripts/
+   - 包含输入判断、约束规则、工作流程、失败处理
+
+### 调用方式
+
+```bash
+# 统一 CLI 入口
+python scripts/cli.py check-login
+python scripts/cli.py search-feeds --keyword "关键词"
+python scripts/cli.py publish --title-file t.txt --content-file c.txt --images pic.jpg
+
+# 发布流水线（含图片下载和登录检查）
+python scripts/publish_pipeline.py --title-file t.txt --content-file c.txt --images URL1
+```
+
+## 代码规范
+
+### Python 风格
+- 遵循 PEP 8，使用 ruff 强制执行
+- 完整的 type hints（PEP 484），使用 `str | None` 语法
+- 公共函数和类必须有 docstring
+- 行长度上限 100 字符
+- 使用 `from __future__ import annotations` 启用延迟注解
+
+### 命名约定
+- 文件名：snake_case
+- 类名：PascalCase
+- 函数/变量：snake_case
+- 常量：UPPER_SNAKE_CASE
+
+### 错误处理
+- 自定义异常类继承自 `XHSError` 基类（`xhs/errors.py`）
+- CLI 命令使用结构化 exit code：0=成功，1=未登录，2=错误
+- 所有用户可见的错误信息使用中文
+
+### 安全约束
+- 发布类操作必须有用户确认机制
+- 文件路径必须使用绝对路径
+- 不在命令行参数中内联敏感内容（使用文件传递）
+- Chrome Profile 目录隔离账号 cookies
+
+## 参考资源
+
+- **xiaohongshu-mcp Go 源码**: /Users/zy/src/zy/xiaohongshu-mcp/
+
+## MCP 工具对照表
+
+scripts/cli.py 的 13 个子命令对应 xiaohongshu-mcp 的 MCP 工具：
+
+| CLI 子命令 | MCP 工具 | 分类 |
+|--|--|--|
+| `check-login` | check_login_status | 认证 |
+| `login` | get_login_qrcode | 认证 |
+| `delete-cookies` | delete_cookies | 认证 |
+| `list-feeds` | list_feeds | 浏览 |
+| `search-feeds` | search_feeds | 浏览 |
+| `get-feed-detail` | get_feed_detail | 浏览 |
+| `user-profile` | user_profile | 浏览 |
+| `post-comment` | post_comment_to_feed | 互动 |
+| `reply-comment` | reply_comment_in_feed | 互动 |
+| `like-feed` | like_feed | 互动 |
+| `favorite-feed` | favorite_feed | 互动 |
+| `publish` | publish_content | 发布 |
+| `publish-video` | publish_with_video | 发布 |
--- a/PROMPT.md 0 → 100644
View file @b8ec00a
+++ b/PROMPT.md 0 → 100644
View file @b8ec00a
+# 小红书 Skills 开发任务
+
+## 目标
+
+基于 xiaohongshu-mcp Go 源码，从零重写 Python CDP 引擎，为 OpenClaw 生态构建完整的小红书自动化 Skills。
+
+## 参考资料
+
+- **xiaohongshu-mcp Go 源码**: `/Users/zy/src/zy/xiaohongshu-mcp/` — 10k stars，13 个 MCP 工具
+- **xiaohongshu-mcp 数据结构**: `/Users/zy/src/zy/xiaohongshu-mcp/xiaohongshu/types.go`
+- **xiaohongshu-mcp 工具定义**: `/Users/zy/src/zy/xiaohongshu-mcp/mcp_server.go`
+
+## 架构
+
+### 模块结构
+
+```
+scripts/
+├── xhs/                        # 核心 XHS 自动化包
+│   ├── cdp.py                  # CDP WebSocket 客户端
+│   ├── stealth.py              # 反检测 JS 注入 + Chrome 启动参数
+│   ├── cookies.py              # Cookie 文件持久化
+│   ├── types.py                # 数据类型（dataclass）
+│   ├── errors.py               # 异常体系
+│   ├── selectors.py            # CSS 选择器常量
+│   ├── urls.py                 # URL 常量
+│   ├── human.py                # 人类行为模拟
+│   ├── login.py                # 登录
+│   ├── feeds.py                # 首页 Feed
+│   ├── search.py               # 搜索 + 筛选
+│   ├── feed_detail.py          # 笔记详情 + 评论加载
+│   ├── user_profile.py         # 用户主页
+│   ├── comment.py              # 评论、回复
+│   ├── like_favorite.py        # 点赞、收藏
+│   ├── publish.py              # 图文发布
+│   └── publish_video.py        # 视频发布
+├── cli.py                      # 统一 CLI 入口（13 个子命令）
+├── chrome_launcher.py          # Chrome 进程管理
+├── account_manager.py          # 多账号管理
+├── image_downloader.py         # 媒体下载（SHA256 缓存）
+├── title_utils.py              # UTF-16 标题长度计算
+├── run_lock.py                 # 单实例锁
+└── publish_pipeline.py         # 发布编排器
+```
+
+### CLI 接口（对应 Go 的 13 个 MCP 工具）
+
+```bash
+python scripts/cli.py check-login
+python scripts/cli.py login
+python scripts/cli.py delete-cookies
+python scripts/cli.py list-feeds
+python scripts/cli.py search-feeds --keyword "关键词" [--sort-by --note-type ...]
+python scripts/cli.py get-feed-detail --feed-id ID --xsec-token TOKEN [--load-all-comments]
+python scripts/cli.py user-profile --user-id ID --xsec-token TOKEN
+python scripts/cli.py post-comment --feed-id ID --xsec-token TOKEN --content "内容"
+python scripts/cli.py reply-comment --feed-id ID --xsec-token TOKEN --content "内容" [--comment-id | --user-id]
+python scripts/cli.py like-feed --feed-id ID --xsec-token TOKEN [--unlike]
+python scripts/cli.py favorite-feed --feed-id ID --xsec-token TOKEN [--unfavorite]
+python scripts/cli.py publish --title-file T --content-file C --images P1 P2 [--tags --schedule-at --visibility]
+python scripts/cli.py publish-video --title-file T --content-file C --video P [--tags --schedule-at]
+```
+
+全局选项：`--host`, `--port`, `--account`
+输出：JSON（`ensure_ascii=False`）
+退出码：0=成功，1=未登录，2=错误
+
+## 代码规范要求
+
+- Python 代码必须通过 `ruff check` 和 `ruff format`
+- 完整的 type hints（PEP 484），使用 `str | None` 而非 `Optional[str]`
+- 公共函数和类必须有 docstring
+- 行长度上限 100 字符
+- 使用 `from __future__ import annotations` 启用延迟注解
+- 异常类统一继承自 `XHSError`
+- CLI 使用 argparse，exit code: 0=成功，1=未登录，2=错误
+- JSON 输出使用 `ensure_ascii=False` 保留中文
+
+## 完成标志
+
+当以下条件全部满足时，输出完成标志：
+1. `xhs/` 包 17 个模块已全部创建
+2. `cli.py` 13 个子命令已实现
+3. 5 个支撑脚本已重写
+4. 5 个 `skills/*/SKILL.md` 已更新
+5. 根目录 `SKILL.md`、`CLAUDE.md`、`README.md` 已更新
+6. `uv run ruff check .` 无错误
+7. `uv run ruff format --check .` 无差异
+
+<promise>ALL SKILLS COMPLETE</promise>
--- a/README.md
View file @b8ec00a
+++ b/README.md
View file @b8ec00a
 # xiaohongshu-skills
-xiaohongshu-skills
+
+小红书自动化 Claude Code Skills，基于 Python CDP 浏览器自动化引擎。
+
+为 OpenClaw 生态提供小红书操作能力，同时兼容 Claude Code Skills 格式。
+
+## 功能概览
+
+| 技能 | 说明 | 核心命令 |
+|------|------|----------|
+| **xhs-auth** | 认证管理 | `check-login`, `login`, `delete-cookies` |
+| **xhs-publish** | 内容发布 | `publish`, `publish-video` |
+| **xhs-explore** | 内容发现 | `list-feeds`, `search-feeds`, `get-feed-detail`, `user-profile` |
+| **xhs-interact** | 社交互动 | `post-comment`, `reply-comment`, `like-feed`, `favorite-feed` |
+| **xhs-content-ops** | 复合运营 | 竞品分析、热点追踪、内容创作、互动管理 |
+
+## 安装
+
+```bash
+# 克隆项目
+git clone https://github.com/autoclaw-cc/xiaohongshu-skills.git
+cd xiaohongshu-skills
+
+# 安装依赖（需要 uv）
+uv sync
+```
+
+### 前置条件
+
+- Python >= 3.11
+- [uv](https://docs.astral.sh/uv/) 包管理器
+- Google Chrome 浏览器
+
+## 快速开始
+
+### 1. 启动 Chrome
+
+```bash
+# 有窗口模式（推荐首次登录）
+python scripts/chrome_launcher.py
+
+# 无头模式
+python scripts/chrome_launcher.py --headless
+```
+
+### 2. 登录小红书
+
+```bash
+# 检查登录状态
+python scripts/cli.py check-login
+
+# 登录（扫码）
+python scripts/cli.py login
+```
+
+### 3. 搜索笔记
+
+```bash
+python scripts/cli.py search-feeds --keyword "关键词"
+
+# 带筛选
+python scripts/cli.py search-feeds \
+  --keyword "关键词" --sort-by 最新 --note-type 图文
+```
+
+### 4. 查看笔记详情
+
+```bash
+python scripts/cli.py get-feed-detail \
+  --feed-id FEED_ID --xsec-token XSEC_TOKEN
+```
+
+### 5. 发布内容
+
+```bash
+# 图文发布
+python scripts/cli.py publish \
+  --title-file title.txt \
+  --content-file content.txt \
+  --images "/abs/path/pic1.jpg" "/abs/path/pic2.jpg"
+
+# 视频发布
+python scripts/cli.py publish-video \
+  --title-file title.txt \
+  --content-file content.txt \
+  --video "/abs/path/video.mp4"
+```
+
+### 6. 社交互动
+
+```bash
+# 发表评论
+python scripts/cli.py post-comment \
+  --feed-id FEED_ID \
+  --xsec-token XSEC_TOKEN \
+  --content "评论内容"
+
+# 点赞
+python scripts/cli.py like-feed \
+  --feed-id FEED_ID --xsec-token XSEC_TOKEN
+
+# 收藏
+python scripts/cli.py favorite-feed \
+  --feed-id FEED_ID --xsec-token XSEC_TOKEN
+```
+
+## CLI 命令参考
+
+所有命令通过 `scripts/cli.py` 统一入口调用，输出 JSON 格式。
+
+全局选项：
+- `--host HOST` — Chrome 调试主机（默认 127.0.0.1）
+- `--port PORT` — Chrome 调试端口（默认 9222）
+- `--account NAME` — 指定账号
+
+| 子命令 | 说明 |
+|--------|------|
+| `check-login` | 检查登录状态 |
+| `login` | 获取登录二维码，等待扫码 |
+| `delete-cookies` | 清除 cookies |
+| `list-feeds` | 获取首页推荐 Feed |
+| `search-feeds` | 关键词搜索笔记 |
+| `get-feed-detail` | 获取笔记详情和评论 |
+| `user-profile` | 获取用户主页信息 |
+| `post-comment` | 对笔记发表评论 |
+| `reply-comment` | 回复指定评论 |
+| `like-feed` | 点赞 / 取消点赞 |
+| `favorite-feed` | 收藏 / 取消收藏 |
+| `publish` | 发布图文内容 |
+| `publish-video` | 发布视频内容 |
+
+退出码：0=成功，1=未登录，2=错误
+
+## 项目结构
+
+```
+xiaohongshu-skills/
+├── scripts/                        # Python CDP 自动化引擎
+│   ├── xhs/                        # 核心自动化包（模块化）
+│   │   ├── cdp.py                  # CDP WebSocket 客户端
+│   │   ├── stealth.py              # 反检测保护
+│   │   ├── cookies.py              # Cookie 持久化
+│   │   ├── types.py                # 数据类型
+│   │   ├── errors.py               # 异常体系
+│   │   ├── selectors.py            # CSS 选择器
+│   │   ├── urls.py                 # URL 常量
+│   │   ├── human.py                # 人类行为模拟
+│   │   ├── login.py                # 登录
+│   │   ├── feeds.py                # 首页 Feed
+│   │   ├── search.py               # 搜索
+│   │   ├── feed_detail.py          # 笔记详情
+│   │   ├── user_profile.py         # 用户主页
+│   │   ├── comment.py              # 评论
+│   │   ├── like_favorite.py        # 点赞/收藏
+│   │   ├── publish.py              # 图文发布
+│   │   └── publish_video.py        # 视频发布
+│   ├── cli.py                      # 统一 CLI（13 个子命令）
+│   ├── chrome_launcher.py          # Chrome 进程管理
+│   ├── account_manager.py          # 多账号管理
+│   ├── image_downloader.py         # 媒体下载
+│   ├── title_utils.py              # 标题长度计算
+│   ├── run_lock.py                 # 单实例锁
+│   └── publish_pipeline.py         # 发布编排器
+├── skills/                         # Claude Code Skills 定义
+│   ├── xhs-auth/SKILL.md           # 认证管理
+│   ├── xhs-publish/SKILL.md        # 内容发布
+│   ├── xhs-explore/SKILL.md        # 内容发现
+│   ├── xhs-interact/SKILL.md       # 社交互动
+│   └── xhs-content-ops/SKILL.md    # 复合运营
+├── SKILL.md                        # 统一入口
+├── CLAUDE.md                       # 项目开发指南
+├── pyproject.toml                  # uv 项目配置
+└── README.md
+```
+
+## 技术架构
+
+### 双层结构
+
+1. **scripts/ — Python CDP 引擎**
+   - 基于 xiaohongshu-mcp Go 源码从零重写
+   - 通过 Chrome DevTools Protocol (CDP) 直接控制浏览器
+   - 数据提取使用 `window.__INITIAL_STATE__` 模式
+   - 内置反检测保护（stealth flags + JS 注入）
+   - JSON 结构化输出
+
+2. **skills/ — Claude Code Skills 定义**
+   - SKILL.md 格式，指导 AI agent 如何调用 scripts/
+   - 包含输入判断、约束规则、工作流程、失败处理
+
+## 开发
+
+```bash
+uv sync                    # 安装依赖
+uv run ruff check .        # Lint 检查
+uv run ruff format .       # 代码格式化
+uv run pytest              # 运行测试
+```
--- a/SKILL.md 0 → 100644
View file @b8ec00a
+++ b/SKILL.md 0 → 100644
View file @b8ec00a
+---
+name: xiaohongshu-skills
+description: |
+  小红书自动化技能集合。支持认证登录、内容发布、搜索发现、社交互动、复合运营。
+  当用户要求操作小红书（发布、搜索、评论、登录、分析、点赞、收藏）时触发。
+---
+
+# 小红书自动化 Skills
+
+你是"小红书自动化助手"。根据用户意图路由到对应的子技能完成任务。
+
+## 输入判断
+
+按优先级判断用户意图，路由到对应子技能：
+
+1. **认证相关**（"登录 / 检查登录 / 切换账号"）→ 执行 `xhs-auth` 技能。
+2. **内容发布**（"发布 / 发帖 / 上传图文 / 上传视频"）→ 执行 `xhs-publish` 技能。
+3. **搜索发现**（"搜索笔记 / 查看详情 / 浏览首页 / 查看用户"）→ 执行 `xhs-explore` 技能。
+4. **社交互动**（"评论 / 回复 / 点赞 / 收藏"）→ 执行 `xhs-interact` 技能。
+5. **复合运营**（"竞品分析 / 热点追踪 / 批量互动 / 一键创作"）→ 执行 `xhs-content-ops` 技能。
+
+## 全局约束
+
+- 所有操作前应确认登录状态（通过 `check-login`）。
+- 发布和评论操作必须经过用户确认后才能执行。
+- 文件路径必须使用绝对路径。
+- CLI 输出为 JSON 格式，结构化呈现给用户。
+- 操作频率不宜过高，保持合理间隔。
+
+## 子技能概览
+
+### xhs-auth — 认证管理
+
+管理小红书登录状态和多账号切换。
+
+| 命令 | 功能 |
+|------|------|
+| `cli.py check-login` | 检查登录状态 |
+| `cli.py login` | 获取登录二维码，等待扫码 |
+| `cli.py delete-cookies` | 清除 cookies（退出/切换账号） |
+
+### xhs-publish — 内容发布
+
+发布图文或视频内容到小红书。
+
+| 命令 | 功能 |
+|------|------|
+| `cli.py publish` | 图文发布（本地图片或 URL） |
+| `cli.py publish-video` | 视频发布 |
+| `publish_pipeline.py` | 发布流水线（含图片下载和登录检查） |
+
+### xhs-explore — 内容发现
+
+搜索笔记、查看详情、获取用户资料。
+
+| 命令 | 功能 |
+|------|------|
+| `cli.py list-feeds` | 获取首页推荐 Feed |
+| `cli.py search-feeds` | 关键词搜索笔记 |
+| `cli.py get-feed-detail` | 获取笔记完整内容和评论 |
+| `cli.py user-profile` | 获取用户主页信息 |
+
+### xhs-interact — 社交互动
+
+发表评论、回复、点赞、收藏。
+
+| 命令 | 功能 |
+|------|------|
+| `cli.py post-comment` | 对笔记发表评论 |
+| `cli.py reply-comment` | 回复指定评论 |
+| `cli.py like-feed` | 点赞 / 取消点赞 |
+| `cli.py favorite-feed` | 收藏 / 取消收藏 |
+
+### xhs-content-ops — 复合运营
+
+组合多步骤完成运营工作流：竞品分析、热点追踪、内容创作、互动管理。
+
+## 快速开始
+
+```bash
+# 1. 启动 Chrome
+python scripts/chrome_launcher.py
+
+# 2. 检查登录状态
+python scripts/cli.py check-login
+
+# 3. 登录（如需要）
+python scripts/cli.py login
+
+# 4. 搜索笔记
+python scripts/cli.py search-feeds --keyword "关键词"
+
+# 5. 查看笔记详情
+python scripts/cli.py get-feed-detail \
+  --feed-id FEED_ID --xsec-token XSEC_TOKEN
+
+# 6. 发布图文
+python scripts/cli.py publish \
+  --title-file title.txt \
+  --content-file content.txt \
+  --images "/abs/path/pic1.jpg"
+
+# 7. 发表评论
+python scripts/cli.py post-comment \
+  --feed-id FEED_ID \
+  --xsec-token XSEC_TOKEN \
+  --content "评论内容"
+
+# 8. 点赞
+python scripts/cli.py like-feed \
+  --feed-id FEED_ID --xsec-token XSEC_TOKEN
+```
+
+## 失败处理
+
+- **未登录**：提示用户执行登录流程（xhs-auth）。
+- **Chrome 未启动**：使用 `chrome_launcher.py` 启动浏览器。
+- **操作超时**：检查网络连接，适当增加等待时间。
+- **频率限制**：降低操作频率，增大间隔。
--- a/pyproject.toml 0 → 100644
View file @b8ec00a
+++ b/pyproject.toml 0 → 100644
View file @b8ec00a
+[project]
+name = "xiaohongshu-skills"
+version = "0.1.0"
+description = "小红书自动化 Skills，基于 CDP 浏览器自动化"
+readme = "README.md"
+license = { text = "MIT" }
+requires-python = ">=3.11"
+dependencies = [
+    "requests>=2.28.0",
+    "websockets>=12.0",
+]
+
+[project.optional-dependencies]
+dev = [
+    "ruff>=0.9.0",
+    "pytest>=8.0",
+]
+
+[tool.ruff]
+target-version = "py311"
+line-length = 100
+
+[tool.ruff.lint]
+select = [
+    "E",    # pycodestyle errors
+    "W",    # pycodestyle warnings
+    "F",    # pyflakes
+    "I",    # isort
+    "N",    # pep8-naming
+    "UP",   # pyupgrade
+    "B",    # flake8-bugbear
+    "SIM",  # flake8-simplify
+    "RUF",  # ruff-specific rules
+]
+ignore = [
+    "E402",   # module-level imports not at top (needed for sys.path manipulation)
+    "RUF001", # ambiguous unicode characters (Chinese punctuation is intentional)
+    "RUF002", # ambiguous unicode in docstrings (Chinese punctuation is intentional)
+    "RUF003", # ambiguous unicode in comments (Chinese punctuation is intentional)
+]
+
+[tool.ruff.lint.per-file-ignores]
+
+[tool.ruff.lint.isort]
+known-first-party = ["xiaohongshu_skills"]
+
+[tool.pytest.ini_options]
+testpaths = ["tests"]
--- a/scripts/account_manager.py 0 → 100644
View file @b8ec00a
+++ b/scripts/account_manager.py 0 → 100644
View file @b8ec00a
+"""多账号管理，对应独立的账号配置管理。"""
+
+from __future__ import annotations
+
+import json
+import logging
+import os
+from pathlib import Path
+
+logger = logging.getLogger(__name__)
+
+# 账号配置文件路径
+_CONFIG_DIR = Path.home() / ".xhs"
+_ACCOUNTS_FILE = _CONFIG_DIR / "accounts.json"
+
+
+def _load_config() -> dict:
+    """加载账号配置。"""
+    if not _ACCOUNTS_FILE.exists():
+        return {"default": "", "accounts": {}}
+    with open(_ACCOUNTS_FILE, encoding="utf-8") as f:
+        return json.load(f)
+
+
+def _save_config(config: dict) -> None:
+    """保存账号配置。"""
+    _CONFIG_DIR.mkdir(parents=True, exist_ok=True)
+    with open(_ACCOUNTS_FILE, "w", encoding="utf-8") as f:
+        json.dump(config, f, ensure_ascii=False, indent=2)
+
+
+def list_accounts() -> list[dict]:
+    """列出所有账号。"""
+    config = _load_config()
+    default = config.get("default", "")
+    accounts = config.get("accounts", {})
+    result = []
+    for name, info in accounts.items():
+        result.append(
+            {
+                "name": name,
+                "description": info.get("description", ""),
+                "is_default": name == default,
+                "profile_dir": _get_profile_dir(name),
+            }
+        )
+    return result
+
+
+def add_account(name: str, description: str = "") -> None:
+    """添加账号。"""
+    config = _load_config()
+    accounts = config.setdefault("accounts", {})
+    if name in accounts:
+        raise ValueError(f"账号 '{name}' 已存在")
+
+    accounts[name] = {"description": description}
+
+    # 如果是第一个账号，设为默认
+    if not config.get("default"):
+        config["default"] = name
+
+    _save_config(config)
+
+    # 创建 Profile 目录
+    profile_dir = _get_profile_dir(name)
+    os.makedirs(profile_dir, exist_ok=True)
+
+    logger.info("添加账号: %s", name)
+
+
+def remove_account(name: str) -> None:
+    """删除账号。"""
+    config = _load_config()
+    accounts = config.get("accounts", {})
+    if name not in accounts:
+        raise ValueError(f"账号 '{name}' 不存在")
+
+    del accounts[name]
+
+    # 如果删除的是默认账号，清除默认
+    if config.get("default") == name:
+        config["default"] = next(iter(accounts), "")
+
+    _save_config(config)
+    logger.info("删除账号: %s", name)
+
+
+def set_default_account(name: str) -> None:
+    """设置默认账号。"""
+    config = _load_config()
+    accounts = config.get("accounts", {})
+    if name not in accounts:
+        raise ValueError(f"账号 '{name}' 不存在")
+
+    config["default"] = name
+    _save_config(config)
+    logger.info("默认账号设置为: %s", name)
+
+
+def get_default_account() -> str:
+    """获取默认账号名称。"""
+    config = _load_config()
+    return config.get("default", "")
+
+
+def _get_profile_dir(account: str) -> str:
+    """获取账号的 Chrome Profile 目录。"""
+    return str(_CONFIG_DIR / "accounts" / account / "chrome-profile")
--- a/scripts/chrome_launcher.py 0 → 100644
View file @b8ec00a
+++ b/scripts/chrome_launcher.py 0 → 100644
View file @b8ec00a
+"""Chrome 进程管理（跨平台），对应 Go browser/browser.go 的进程管理部分。"""
+
+from __future__ import annotations
+
+import logging
+import os
+import platform
+import shutil
+import signal
+import subprocess
+import time
+
+from xhs.stealth import STEALTH_ARGS
+
+logger = logging.getLogger(__name__)
+
+# 默认远程调试端口
+DEFAULT_PORT = 9222
+
+# 各平台 Chrome 默认路径
+_CHROME_PATHS: dict[str, list[str]] = {
+    "Darwin": [
+        "/Applications/Google Chrome.app/Contents/MacOS/Google Chrome",
+        "/Applications/Chromium.app/Contents/MacOS/Chromium",
+    ],
+    "Linux": [
+        "/usr/bin/google-chrome",
+        "/usr/bin/google-chrome-stable",
+        "/usr/bin/chromium",
+        "/usr/bin/chromium-browser",
+        "/snap/bin/chromium",
+    ],
+    "Windows": [
+        r"C:\Program Files\Google\Chrome\Application\chrome.exe",
+        r"C:\Program Files (x86)\Google\Chrome\Application\chrome.exe",
+    ],
+}
+
+
+def find_chrome() -> str | None:
+    """查找 Chrome 可执行文件路径。"""
+    # 环境变量优先
+    env_path = os.getenv("CHROME_BIN")
+    if env_path and os.path.isfile(env_path):
+        return env_path
+
+    # which/where 查找
+    chrome = shutil.which("google-chrome") or shutil.which("chromium")
+    if chrome:
+        return chrome
+
+    # 平台默认路径
+    system = platform.system()
+    for path in _CHROME_PATHS.get(system, []):
+        if os.path.isfile(path):
+            return path
+
+    return None
+
+
+def launch_chrome(
+    port: int = DEFAULT_PORT,
+    headless: bool = False,
+    user_data_dir: str | None = None,
+    chrome_bin: str | None = None,
+) -> subprocess.Popen:
+    """启动 Chrome 进程（带远程调试端口）。
+
+    Args:
+        port: 远程调试端口。
+        headless: 是否无头模式。
+        user_data_dir: 用户数据目录（Profile 隔离）。
+        chrome_bin: Chrome 可执行文件路径。
+
+    Returns:
+        Chrome 子进程。
+
+    Raises:
+        FileNotFoundError: 未找到 Chrome。
+    """
+    if not chrome_bin:
+        chrome_bin = find_chrome()
+    if not chrome_bin:
+        raise FileNotFoundError("未找到 Chrome，请设置 CHROME_BIN 环境变量或安装 Chrome")
+
+    args = [
+        chrome_bin,
+        f"--remote-debugging-port={port}",
+        *STEALTH_ARGS,
+    ]
+
+    if headless:
+        args.append("--headless=new")
+
+    if user_data_dir:
+        args.append(f"--user-data-dir={user_data_dir}")
+
+    # 代理
+    proxy = os.getenv("XHS_PROXY")
+    if proxy:
+        args.append(f"--proxy-server={proxy}")
+        logger.info("使用代理: %s", _mask_proxy(proxy))
+
+    logger.info("启动 Chrome: port=%d, headless=%s", port, headless)
+    process = subprocess.Popen(
+        args,
+        stdout=subprocess.DEVNULL,
+        stderr=subprocess.DEVNULL,
+    )
+
+    # 等待 Chrome 准备就绪
+    _wait_for_chrome(port)
+    return process
+
+
+def close_chrome(process: subprocess.Popen) -> None:
+    """关闭 Chrome 进程。"""
+    if process.poll() is not None:
+        return
+
+    try:
+        process.send_signal(signal.SIGTERM)
+        process.wait(timeout=5)
+    except (subprocess.TimeoutExpired, OSError):
+        process.kill()
+        process.wait(timeout=3)
+
+    logger.info("Chrome 进程已关闭")
+
+
+def is_chrome_running(port: int = DEFAULT_PORT) -> bool:
+    """检查指定端口的 Chrome 是否在运行。"""
+    import requests
+
+    try:
+        resp = requests.get(f"http://127.0.0.1:{port}/json/version", timeout=2)
+        return resp.status_code == 200
+    except (requests.ConnectionError, requests.Timeout):
+        return False
+
+
+def _wait_for_chrome(port: int, timeout: float = 15.0) -> None:
+    """等待 Chrome 调试端口就绪。"""
+    deadline = time.monotonic() + timeout
+    while time.monotonic() < deadline:
+        if is_chrome_running(port):
+            logger.info("Chrome 已就绪 (port=%d)", port)
+            return
+        time.sleep(0.5)
+    logger.warning("等待 Chrome 就绪超时 (port=%d)", port)
+
+
+def _mask_proxy(proxy_url: str) -> str:
+    """隐藏代理 URL 中的敏感信息。"""
+    from urllib.parse import urlparse
+
+    try:
+        parsed = urlparse(proxy_url)
+        if parsed.username:
+            return proxy_url.replace(parsed.username, "***").replace(parsed.password or "", "***")
+    except Exception:
+        pass
+    return proxy_url
--- a/scripts/cli.py 0 → 100644
View file @b8ec00a
+++ b/scripts/cli.py 0 → 100644
View file @b8ec00a
+"""统一 CLI 入口，对应 Go MCP 工具的 13 个子命令。
+
+全局选项: --host, --port, --account
+输出: JSON（ensure_ascii=False）
+退出码: 0=成功, 1=未登录, 2=错误
+"""
+
+from __future__ import annotations
+
+import argparse
+import json
+import logging
+import sys
+
+logging.basicConfig(
+    level=logging.INFO,
+    format="%(asctime)s %(levelname)s %(name)s: %(message)s",
+)
+logger = logging.getLogger("xhs-cli")
+
+
+def _output(data: dict, exit_code: int = 0) -> None:
+    """输出 JSON 并退出。"""
+    print(json.dumps(data, ensure_ascii=False, indent=2))
+    sys.exit(exit_code)
+
+
+def _connect(args: argparse.Namespace):
+    """连接到 Chrome 并返回 (browser, page)。"""
+    from xhs.cdp import Browser
+
+    browser = Browser(host=args.host, port=args.port)
+    browser.connect()
+    page = browser.new_page()
+    return browser, page
+
+
+# ========== 子命令实现 ==========
+
+
+def cmd_check_login(args: argparse.Namespace) -> None:
+    """检查登录状态。"""
+    from xhs.login import check_login_status
+
+    browser, page = _connect(args)
+    try:
+        logged_in = check_login_status(page)
+        _output({"logged_in": logged_in}, exit_code=0 if logged_in else 1)
+    finally:
+        browser.close_page(page)
+        browser.close()
+
+
+def cmd_login(args: argparse.Namespace) -> None:
+    """获取登录二维码并等待扫码。"""
+    from xhs.login import fetch_qrcode, save_qrcode_to_file, wait_for_login
+
+    browser, page = _connect(args)
+    try:
+        src, already = fetch_qrcode(page)
+        if already:
+            _output({"logged_in": True, "message": "已登录"})
+        else:
+            # 保存二维码到临时文件
+            qrcode_path = save_qrcode_to_file(src)
+            print(
+                json.dumps(
+                    {
+                        "qrcode_path": qrcode_path,
+                        "message": "请扫码登录，二维码已保存到文件",
+                    },
+                    ensure_ascii=False,
+                )
+            )
+            success = wait_for_login(page, timeout=120)
+            _output(
+                {"logged_in": success, "message": "登录成功" if success else "登录超时"},
+                exit_code=0 if success else 2,
+            )
+    finally:
+        browser.close_page(page)
+        browser.close()
+
+
+def cmd_delete_cookies(args: argparse.Namespace) -> None:
+    """删除 cookies。"""
+    from xhs.cookies import delete_cookies, get_cookies_file_path
+
+    path = get_cookies_file_path(args.account)
+    delete_cookies(path)
+    _output({"success": True, "message": f"已删除 cookies: {path}"})
+
+
+def cmd_list_feeds(args: argparse.Namespace) -> None:
+    """获取首页 Feed 列表。"""
+    from xhs.feeds import list_feeds
+
+    browser, page = _connect(args)
+    try:
+        feeds = list_feeds(page)
+        _output({"feeds": [f.to_dict() for f in feeds], "count": len(feeds)})
+    finally:
+        browser.close_page(page)
+        browser.close()
+
+
+def cmd_search_feeds(args: argparse.Namespace) -> None:
+    """搜索 Feeds。"""
+    from xhs.search import search_feeds
+    from xhs.types import FilterOption
+
+    filter_opt = FilterOption(
+        sort_by=args.sort_by or "",
+        note_type=args.note_type or "",
+        publish_time=args.publish_time or "",
+        search_scope=args.search_scope or "",
+        location=args.location or "",
+    )
+
+    browser, page = _connect(args)
+    try:
+        feeds = search_feeds(page, args.keyword, filter_opt)
+        _output({"feeds": [f.to_dict() for f in feeds], "count": len(feeds)})
+    finally:
+        browser.close_page(page)
+        browser.close()
+
+
+def cmd_get_feed_detail(args: argparse.Namespace) -> None:
+    """获取 Feed 详情。"""
+    from xhs.feed_detail import get_feed_detail
+    from xhs.types import CommentLoadConfig
+
+    config = CommentLoadConfig(
+        click_more_replies=args.click_more_replies,
+        max_replies_threshold=args.max_replies_threshold,
+        max_comment_items=args.max_comment_items,
+        scroll_speed=args.scroll_speed,
+    )
+
+    browser, page = _connect(args)
+    try:
+        detail = get_feed_detail(
+            page,
+            args.feed_id,
+            args.xsec_token,
+            load_all_comments=args.load_all_comments,
+            config=config,
+        )
+        _output(detail.to_dict())
+    finally:
+        browser.close_page(page)
+        browser.close()
+
+
+def cmd_user_profile(args: argparse.Namespace) -> None:
+    """获取用户主页。"""
+    from xhs.user_profile import get_user_profile
+
+    browser, page = _connect(args)
+    try:
+        profile = get_user_profile(page, args.user_id, args.xsec_token)
+        _output(profile.to_dict())
+    finally:
+        browser.close_page(page)
+        browser.close()
+
+
+def cmd_post_comment(args: argparse.Namespace) -> None:
+    """发表评论。"""
+    from xhs.comment import post_comment
+
+    browser, page = _connect(args)
+    try:
+        post_comment(page, args.feed_id, args.xsec_token, args.content)
+        _output({"success": True, "message": "评论发送成功"})
+    finally:
+        browser.close_page(page)
+        browser.close()
+
+
+def cmd_reply_comment(args: argparse.Namespace) -> None:
+    """回复评论。"""
+    from xhs.comment import reply_comment
+
+    browser, page = _connect(args)
+    try:
+        reply_comment(
+            page,
+            args.feed_id,
+            args.xsec_token,
+            args.content,
+            comment_id=args.comment_id or "",
+            user_id=args.user_id or "",
+        )
+        _output({"success": True, "message": "回复成功"})
+    finally:
+        browser.close_page(page)
+        browser.close()
+
+
+def cmd_like_feed(args: argparse.Namespace) -> None:
+    """点赞/取消点赞。"""
+    from xhs.like_favorite import like_feed, unlike_feed
+
+    browser, page = _connect(args)
+    try:
+        if args.unlike:
+            result = unlike_feed(page, args.feed_id, args.xsec_token)
+        else:
+            result = like_feed(page, args.feed_id, args.xsec_token)
+        _output(result.to_dict())
+    finally:
+        browser.close_page(page)
+        browser.close()
+
+
+def cmd_favorite_feed(args: argparse.Namespace) -> None:
+    """收藏/取消收藏。"""
+    from xhs.like_favorite import favorite_feed, unfavorite_feed
+
+    browser, page = _connect(args)
+    try:
+        if args.unfavorite:
+            result = unfavorite_feed(page, args.feed_id, args.xsec_token)
+        else:
+            result = favorite_feed(page, args.feed_id, args.xsec_token)
+        _output(result.to_dict())
+    finally:
+        browser.close_page(page)
+        browser.close()
+
+
+def cmd_publish(args: argparse.Namespace) -> None:
+    """发布图文内容。"""
+    from image_downloader import process_images
+    from xhs.publish import publish_image_content
+    from xhs.types import PublishImageContent
+
+    # 读取标题和正文
+    with open(args.title_file, encoding="utf-8") as f:
+        title = f.read().strip()
+    with open(args.content_file, encoding="utf-8") as f:
+        content = f.read().strip()
+
+    # 处理图片
+    image_paths = process_images(args.images) if args.images else []
+    if not image_paths:
+        _output({"success": False, "error": "没有有效的图片"}, exit_code=2)
+
+    browser, page = _connect(args)
+    try:
+        publish_image_content(
+            page,
+            PublishImageContent(
+                title=title,
+                content=content,
+                tags=args.tags or [],
+                image_paths=image_paths,
+                schedule_time=args.schedule_at,
+                is_original=args.original,
+                visibility=args.visibility or "",
+            ),
+        )
+        _output({"success": True, "title": title, "images": len(image_paths), "status": "发布完成"})
+    finally:
+        browser.close_page(page)
+        browser.close()
+
+
+def cmd_publish_video(args: argparse.Namespace) -> None:
+    """发布视频内容。"""
+    from xhs.publish_video import publish_video_content
+    from xhs.types import PublishVideoContent
+
+    with open(args.title_file, encoding="utf-8") as f:
+        title = f.read().strip()
+    with open(args.content_file, encoding="utf-8") as f:
+        content = f.read().strip()
+
+    browser, page = _connect(args)
+    try:
+        publish_video_content(
+            page,
+            PublishVideoContent(
+                title=title,
+                content=content,
+                tags=args.tags or [],
+                video_path=args.video,
+                schedule_time=args.schedule_at,
+                visibility=args.visibility or "",
+            ),
+        )
+        _output({"success": True, "title": title, "video": args.video, "status": "发布完成"})
+    finally:
+        browser.close_page(page)
+        browser.close()
+
+
+# ========== 参数解析 ==========
+
+
+def build_parser() -> argparse.ArgumentParser:
+    """构建 CLI 参数解析器。"""
+    parser = argparse.ArgumentParser(
+        prog="xhs-cli",
+        description="小红书自动化 CLI",
+    )
+
+    # 全局选项
+    parser.add_argument("--host", default="127.0.0.1", help="Chrome 调试主机 (default: 127.0.0.1)")
+    parser.add_argument("--port", type=int, default=9222, help="Chrome 调试端口 (default: 9222)")
+    parser.add_argument("--account", default="", help="账号名称")
+
+    subparsers = parser.add_subparsers(dest="command", required=True)
+
+    # check-login
+    sub = subparsers.add_parser("check-login", help="检查登录状态")
+    sub.set_defaults(func=cmd_check_login)
+
+    # login
+    sub = subparsers.add_parser("login", help="登录（扫码）")
+    sub.set_defaults(func=cmd_login)
+
+    # delete-cookies
+    sub = subparsers.add_parser("delete-cookies", help="删除 cookies")
+    sub.set_defaults(func=cmd_delete_cookies)
+
+    # list-feeds
+    sub = subparsers.add_parser("list-feeds", help="获取首页 Feed 列表")
+    sub.set_defaults(func=cmd_list_feeds)
+
+    # search-feeds
+    sub = subparsers.add_parser("search-feeds", help="搜索 Feeds")
+    sub.add_argument("--keyword", required=True, help="搜索关键词")
+    sub.add_argument("--sort-by", help="排序: 综合|最新|最多点赞|最多评论|最多收藏")
+    sub.add_argument("--note-type", help="类型: 不限|视频|图文")
+    sub.add_argument("--publish-time", help="时间: 不限|一天内|一周内|半年内")
+    sub.add_argument("--search-scope", help="范围: 不限|已看过|未看过|已关注")
+    sub.add_argument("--location", help="位置: 不限|同城|附近")
+    sub.set_defaults(func=cmd_search_feeds)
+
+    # get-feed-detail
+    sub = subparsers.add_parser("get-feed-detail", help="获取 Feed 详情")
+    sub.add_argument("--feed-id", required=True, help="Feed ID")
+    sub.add_argument("--xsec-token", required=True, help="xsec_token")
+    sub.add_argument("--load-all-comments", action="store_true", help="加载全部评论")
+    sub.add_argument("--click-more-replies", action="store_true", help="点击展开更多回复")
+    sub.add_argument("--max-replies-threshold", type=int, default=10, help="展开回复数阈值")
+    sub.add_argument("--max-comment-items", type=int, default=0, help="最大评论数 (0=不限)")
+    sub.add_argument("--scroll-speed", default="normal", help="滚动速度: slow|normal|fast")
+    sub.set_defaults(func=cmd_get_feed_detail)
+
+    # user-profile
+    sub = subparsers.add_parser("user-profile", help="获取用户主页")
+    sub.add_argument("--user-id", required=True, help="用户 ID")
+    sub.add_argument("--xsec-token", required=True, help="xsec_token")
+    sub.set_defaults(func=cmd_user_profile)
+
+    # post-comment
+    sub = subparsers.add_parser("post-comment", help="发表评论")
+    sub.add_argument("--feed-id", required=True, help="Feed ID")
+    sub.add_argument("--xsec-token", required=True, help="xsec_token")
+    sub.add_argument("--content", required=True, help="评论内容")
+    sub.set_defaults(func=cmd_post_comment)
+
+    # reply-comment
+    sub = subparsers.add_parser("reply-comment", help="回复评论")
+    sub.add_argument("--feed-id", required=True, help="Feed ID")
+    sub.add_argument("--xsec-token", required=True, help="xsec_token")
+    sub.add_argument("--content", required=True, help="回复内容")
+    sub.add_argument("--comment-id", help="目标评论 ID")
+    sub.add_argument("--user-id", help="目标用户 ID")
+    sub.set_defaults(func=cmd_reply_comment)
+
+    # like-feed
+    sub = subparsers.add_parser("like-feed", help="点赞")
+    sub.add_argument("--feed-id", required=True, help="Feed ID")
+    sub.add_argument("--xsec-token", required=True, help="xsec_token")
+    sub.add_argument("--unlike", action="store_true", help="取消点赞")
+    sub.set_defaults(func=cmd_like_feed)
+
+    # favorite-feed
+    sub = subparsers.add_parser("favorite-feed", help="收藏")
+    sub.add_argument("--feed-id", required=True, help="Feed ID")
+    sub.add_argument("--xsec-token", required=True, help="xsec_token")
+    sub.add_argument("--unfavorite", action="store_true", help="取消收藏")
+    sub.set_defaults(func=cmd_favorite_feed)
+
+    # publish
+    sub = subparsers.add_parser("publish", help="发布图文")
+    sub.add_argument("--title-file", required=True, help="标题文件路径")
+    sub.add_argument("--content-file", required=True, help="正文文件路径")
+    sub.add_argument("--images", nargs="+", required=True, help="图片路径/URL")
+    sub.add_argument("--tags", nargs="*", help="标签")
+    sub.add_argument("--schedule-at", help="定时发布 (ISO8601)")
+    sub.add_argument("--original", action="store_true", help="声明原创")
+    sub.add_argument("--visibility", help="可见范围")
+    sub.set_defaults(func=cmd_publish)
+
+    # publish-video
+    sub = subparsers.add_parser("publish-video", help="发布视频")
+    sub.add_argument("--title-file", required=True, help="标题文件路径")
+    sub.add_argument("--content-file", required=True, help="正文文件路径")
+    sub.add_argument("--video", required=True, help="视频文件路径")
+    sub.add_argument("--tags", nargs="*", help="标签")
+    sub.add_argument("--schedule-at", help="定时发布 (ISO8601)")
+    sub.add_argument("--visibility", help="可见范围")
+    sub.set_defaults(func=cmd_publish_video)
+
+    return parser
+
+
+def main() -> None:
+    """CLI 入口。"""
+    parser = build_parser()
+    args = parser.parse_args()
+
+    try:
+        args.func(args)
+    except Exception as e:
+        logger.error("执行失败: %s", e, exc_info=True)
+        _output({"success": False, "error": str(e)}, exit_code=2)
+
+
+if __name__ == "__main__":
+    main()
--- a/scripts/image_downloader.py 0 → 100644
View file @b8ec00a
+++ b/scripts/image_downloader.py 0 → 100644
View file @b8ec00a
+"""媒体下载（SHA256 缓存），对应 Go pkg/downloader/images.go。"""
+
+from __future__ import annotations
+
+import hashlib
+import logging
+import os
+import time
+from urllib.parse import urlparse
+
+import requests
+
+logger = logging.getLogger(__name__)
+
+_USER_AGENT = (
+    "Mozilla/5.0 (Windows NT 10.0; Win64; x64) "
+    "AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36"
+)
+
+# 已知图片扩展名
+_IMAGE_EXTENSIONS = {".jpg", ".jpeg", ".png", ".gif", ".webp", ".bmp", ".svg"}
+
+
+def is_image_url(path: str) -> bool:
+    """判断字符串是否为图片/媒体 URL。"""
+    return path.lower().startswith(("http://", "https://"))
+
+
+class ImageDownloader:
+    """图片下载器（带 SHA256 缓存）。"""
+
+    def __init__(self, save_path: str) -> None:
+        self.save_path = save_path
+        os.makedirs(save_path, exist_ok=True)
+        self._session = requests.Session()
+        self._session.timeout = 30
+
+    def download_image(self, image_url: str) -> str:
+        """下载单张图片，返回本地文件路径。
+
+        如果文件已存在（通过 URL hash 判断），直接返回路径。
+
+        Raises:
+            ValueError: URL 格式无效。
+            RuntimeError: 下载失败。
+        """
+        if not is_image_url(image_url):
+            raise ValueError(f"无效的图片 URL: {image_url}")
+
+        # 生成文件名
+        url_hash = hashlib.sha256(image_url.encode()).hexdigest()[:16]
+        ext = self._detect_extension(image_url)
+        filename = f"img_{url_hash}_{int(time.time())}{ext}"
+        filepath = os.path.join(self.save_path, filename)
+
+        # 检查是否已有同 hash 的文件
+        existing = self._find_existing(url_hash)
+        if existing:
+            return existing
+
+        # 下载
+        parsed = urlparse(image_url)
+        headers = {
+            "User-Agent": _USER_AGENT,
+            "Referer": f"{parsed.scheme}://{parsed.hostname}/",
+        }
+
+        resp = self._session.get(image_url, headers=headers)
+        if resp.status_code != 200:
+            raise RuntimeError(f"下载失败 (status={resp.status_code}): {image_url}")
+
+        # 保存
+        with open(filepath, "wb") as f:
+            f.write(resp.content)
+
+        logger.info("下载完成: %s -> %s", image_url, filepath)
+        return filepath
+
+    def download_images(self, image_urls: list[str]) -> list[str]:
+        """批量下载图片。"""
+        paths = []
+        for url in image_urls:
+            try:
+                path = self.download_image(url)
+                paths.append(path)
+            except Exception as e:
+                logger.error("下载失败 %s: %s", url, e)
+        return paths
+
+    def _detect_extension(self, url: str) -> str:
+        """从 URL 推断文件扩展名。"""
+        parsed = urlparse(url)
+        path = parsed.path.lower()
+        for ext in _IMAGE_EXTENSIONS:
+            if path.endswith(ext):
+                return ext
+        return ".jpg"  # 默认
+
+    def _find_existing(self, url_hash: str) -> str | None:
+        """查找已有同 hash 的文件。"""
+        prefix = f"img_{url_hash}_"
+        for filename in os.listdir(self.save_path):
+            if filename.startswith(prefix):
+                return os.path.join(self.save_path, filename)
+        return None
+
+
+def process_images(images: list[str], save_dir: str | None = None) -> list[str]:
+    """处理图片列表（URL 下载，本地路径直接返回）。"""
+    if not save_dir:
+        save_dir = os.path.join(os.path.expanduser("~"), ".xhs", "images")
+
+    downloader = ImageDownloader(save_dir)
+    result = []
+
+    for img in images:
+        if is_image_url(img):
+            path = downloader.download_image(img)
+            result.append(path)
+        else:
+            # 本地路径
+            if os.path.exists(img):
+                result.append(os.path.abspath(img))
+            else:
+                logger.warning("文件不存在: %s", img)
+
+    return result
--- a/scripts/publish_pipeline.py 0 → 100644
View file @b8ec00a
+++ b/scripts/publish_pipeline.py 0 → 100644
View file @b8ec00a
+"""发布编排器：下载 → 登录检查 → 发布 → 报告。"""
+
+from __future__ import annotations
+
+import json
+import logging
+import sys
+
+from image_downloader import process_images
+from title_utils import calc_title_length
+from xhs.cdp import Browser
+from xhs.login import check_login_status
+from xhs.publish import publish_image_content
+from xhs.publish_video import publish_video_content
+from xhs.types import PublishImageContent, PublishVideoContent
+
+logger = logging.getLogger(__name__)
+
+
+def run_publish_pipeline(
+    title: str,
+    content: str,
+    images: list[str] | None = None,
+    video: str | None = None,
+    tags: list[str] | None = None,
+    schedule_time: str | None = None,
+    is_original: bool = False,
+    visibility: str = "",
+    host: str = "127.0.0.1",
+    port: int = 9222,
+    account: str = "",
+) -> dict:
+    """执行完整发布流水线。
+
+    Returns:
+        发布结果字典。
+    """
+    # 标题长度校验
+    title_len = calc_title_length(title)
+    if title_len > 20:
+        return {"success": False, "error": f"标题长度超限: {title_len}/20"}
+
+    # 处理图片（下载 URL / 验证本地路径）
+    local_images: list[str] = []
+    if images:
+        local_images = process_images(images)
+        if not local_images:
+            return {"success": False, "error": "没有有效的图片"}
+
+    # 连接浏览器
+    browser = Browser(host=host, port=port)
+    browser.connect()
+
+    try:
+        page = browser.new_page()
+        try:
+            # 登录检查
+            if not check_login_status(page):
+                return {"success": False, "error": "未登录", "exit_code": 1}
+
+            # 发布
+            if video:
+                publish_video_content(
+                    page,
+                    PublishVideoContent(
+                        title=title,
+                        content=content,
+                        tags=tags or [],
+                        video_path=video,
+                        schedule_time=schedule_time,
+                        visibility=visibility,
+                    ),
+                )
+            else:
+                publish_image_content(
+                    page,
+                    PublishImageContent(
+                        title=title,
+                        content=content,
+                        tags=tags or [],
+                        image_paths=local_images,
+                        schedule_time=schedule_time,
+                        is_original=is_original,
+                        visibility=visibility,
+                    ),
+                )
+
+            return {
+                "success": True,
+                "title": title,
+                "content_length": len(content),
+                "images": len(local_images),
+                "video": video or "",
+                "status": "发布完成",
+            }
+
+        finally:
+            browser.close_page(page)
+    finally:
+        browser.close()
+
+
+def main() -> None:
+    """CLI 入口（被 cli.py 的 publish/publish-video 子命令调用时使用）。"""
+    import argparse
+
+    parser = argparse.ArgumentParser(description="小红书发布流水线")
+    parser.add_argument("--title-file", required=True, help="标题文件路径")
+    parser.add_argument("--content-file", required=True, help="正文文件路径")
+    parser.add_argument("--images", nargs="*", help="图片路径或 URL 列表")
+    parser.add_argument("--video", help="视频文件路径")
+    parser.add_argument("--tags", nargs="*", help="标签列表")
+    parser.add_argument("--schedule-at", help="定时发布时间 (ISO8601)")
+    parser.add_argument("--original", action="store_true", help="声明原创")
+    parser.add_argument("--visibility", default="", help="可见范围")
+    parser.add_argument("--host", default="127.0.0.1")
+    parser.add_argument("--port", type=int, default=9222)
+    parser.add_argument("--account", default="")
+    args = parser.parse_args()
+
+    # 读取标题和正文
+    with open(args.title_file, encoding="utf-8") as f:
+        title = f.read().strip()
+    with open(args.content_file, encoding="utf-8") as f:
+        content = f.read().strip()
+
+    result = run_publish_pipeline(
+        title=title,
+        content=content,
+        images=args.images,
+        video=args.video,
+        tags=args.tags,
+        schedule_time=args.schedule_at,
+        is_original=args.original,
+        visibility=args.visibility,
+        host=args.host,
+        port=args.port,
+        account=args.account,
+    )
+
+    print(json.dumps(result, ensure_ascii=False, indent=2))
+    sys.exit(0 if result["success"] else 2)
+
+
+if __name__ == "__main__":
+    main()
--- a/scripts/run_lock.py 0 → 100644
View file @b8ec00a
+++ b/scripts/run_lock.py 0 → 100644
View file @b8ec00a
+"""单实例锁，防止多个进程同时操作浏览器。"""
+
+from __future__ import annotations
+
+import contextlib
+import logging
+import os
+import time
+
+logger = logging.getLogger(__name__)
+
+_DEFAULT_LOCK_FILE = os.path.join(os.path.expanduser("~"), ".xhs", "run.lock")
+
+
+class RunLock:
+    """文件锁，确保同一时间只有一个进程在操作。"""
+
+    def __init__(self, lock_file: str = _DEFAULT_LOCK_FILE) -> None:
+        self.lock_file = lock_file
+        self._fd: int | None = None
+
+    def acquire(self, timeout: float = 30.0) -> bool:
+        """获取锁。
+
+        Args:
+            timeout: 超时时间（秒）。
+
+        Returns:
+            True 获取成功，False 超时。
+        """
+        os.makedirs(os.path.dirname(self.lock_file), exist_ok=True)
+        deadline = time.monotonic() + timeout
+
+        while time.monotonic() < deadline:
+            try:
+                self._fd = os.open(
+                    self.lock_file,
+                    os.O_CREAT | os.O_EXCL | os.O_WRONLY,
+                )
+                # 写入 PID
+                os.write(self._fd, str(os.getpid()).encode())
+                logger.debug("获取锁成功: %s", self.lock_file)
+                return True
+            except FileExistsError:
+                # 检查持有者是否还活着
+                if self._is_stale():
+                    self._force_release()
+                    continue
+                time.sleep(1)
+
+        logger.warning("获取锁超时: %s", self.lock_file)
+        return False
+
+    def release(self) -> None:
+        """释放锁。"""
+        if self._fd is not None:
+            with contextlib.suppress(OSError):
+                os.close(self._fd)
+            self._fd = None
+
+        with contextlib.suppress(FileNotFoundError):
+            os.remove(self.lock_file)
+
+        logger.debug("释放锁: %s", self.lock_file)
+
+    def _is_stale(self) -> bool:
+        """检查锁文件是否已过时（持有进程已退出）。"""
+        try:
+            with open(self.lock_file) as f:
+                pid = int(f.read().strip())
+            # 检查进程是否存在
+            os.kill(pid, 0)
+            return False
+        except (FileNotFoundError, ValueError, ProcessLookupError, PermissionError):
+            return True
+
+    def _force_release(self) -> None:
+        """强制释放过时的锁。"""
+        with contextlib.suppress(FileNotFoundError):
+            os.remove(self.lock_file)
+        logger.info("强制释放过时锁: %s", self.lock_file)
+
+    def __enter__(self) -> RunLock:
+        if not self.acquire():
+            raise TimeoutError(f"无法获取锁: {self.lock_file}")
+        return self
+
+    def __exit__(self, *args: object) -> None:
+        self.release()
--- a/scripts/title_utils.py 0 → 100644
View file @b8ec00a
+++ b/scripts/title_utils.py 0 → 100644
View file @b8ec00a
+"""UTF-16 标题长度计算，对应 Go pkg/xhsutil/title.go。"""
+
+
+def calc_title_length(s: str) -> int:
+    """计算小红书标题长度。
+
+    规则：非 ASCII 字符（中文、全角符号等）算 2 字节，
+    ASCII 字符算 1 字节，最终结果向上取整除以 2。
+
+    Examples:
+        >>> calc_title_length("你好世界")
+        4
+        >>> calc_title_length("hello")
+        3
+        >>> calc_title_length("OOTD穿搭分享")
+        6
+    """
+    byte_len = 0
+    # 用 UTF-16 编码来处理（包括 surrogate pairs）
+    encoded = s.encode("utf-16-le")
+    for i in range(0, len(encoded), 2):
+        code_unit = int.from_bytes(encoded[i : i + 2], "little")
+        if code_unit > 127:
+            byte_len += 2
+        else:
+            byte_len += 1
+    return (byte_len + 1) // 2
--- a/scripts/xhs/__init__.py 0 → 100644
View file @b8ec00a
+++ b/scripts/xhs/__init__.py 0 → 100644
View file @b8ec00a
+"""小红书 CDP 自动化核心包。"""
--- a/scripts/xhs/cdp.py 0 → 100644
View file @b8ec00a
+++ b/scripts/xhs/cdp.py 0 → 100644
View file @b8ec00a
+"""CDP WebSocket 客户端（Browser, Page, Element），对应 Go browser/browser.go + go-rod API。
+
+通过原生 WebSocket 与 Chrome DevTools Protocol 通信，实现浏览器自动化控制。
+"""
+
+from __future__ import annotations
+
+import json
+import logging
+import time
+from typing import Any
+
+import requests
+import websockets.sync.client as ws_client
+
+from .errors import CDPError, ElementNotFoundError
+from .stealth import STEALTH_JS
+
+logger = logging.getLogger(__name__)
+
+
+class CDPClient:
+    """底层 CDP WebSocket 通信客户端。"""
+
+    def __init__(self, ws_url: str) -> None:
+        self._ws = ws_client.connect(ws_url, max_size=50 * 1024 * 1024)
+        self._id = 0
+        self._callbacks: dict[int, Any] = {}
+
+    def send(self, method: str, params: dict | None = None) -> dict:
+        """发送 CDP 命令并等待结果。"""
+        self._id += 1
+        msg: dict[str, Any] = {"id": self._id, "method": method}
+        if params:
+            msg["params"] = params
+        self._ws.send(json.dumps(msg))
+        return self._wait_for(self._id)
+
+    def _wait_for(self, msg_id: int, timeout: float = 30.0) -> dict:
+        """等待指定 id 的响应。"""
+        deadline = time.monotonic() + timeout
+        while time.monotonic() < deadline:
+            try:
+                raw = self._ws.recv(timeout=max(0.1, deadline - time.monotonic()))
+            except TimeoutError:
+                break
+            data = json.loads(raw)
+            if data.get("id") == msg_id:
+                if "error" in data:
+                    raise CDPError(f"CDP 错误: {data['error']}")
+                return data.get("result", {})
+        raise CDPError(f"等待 CDP 响应超时 (id={msg_id})")
+
+    def close(self) -> None:
+        import contextlib
+
+        with contextlib.suppress(Exception):
+            self._ws.close()
+
+
+class Page:
+    """CDP 页面对象，封装常用操作。"""
+
+    def __init__(self, cdp: CDPClient, target_id: str, session_id: str) -> None:
+        self._cdp = cdp
+        self.target_id = target_id
+        self.session_id = session_id
+        self._ws = cdp._ws
+        self._id_counter = 1000
+
+    def _send_session(self, method: str, params: dict | None = None) -> dict:
+        """向 session 发送命令。"""
+        self._id_counter += 1
+        msg: dict[str, Any] = {
+            "id": self._id_counter,
+            "method": method,
+            "sessionId": self.session_id,
+        }
+        if params:
+            msg["params"] = params
+        self._ws.send(json.dumps(msg))
+        return self._wait_session(self._id_counter)
+
+    def _wait_session(self, msg_id: int, timeout: float = 60.0) -> dict:
+        """等待 session 响应。"""
+        deadline = time.monotonic() + timeout
+        while time.monotonic() < deadline:
+            try:
+                raw = self._ws.recv(timeout=max(0.1, deadline - time.monotonic()))
+            except TimeoutError:
+                break
+            data = json.loads(raw)
+            if data.get("id") == msg_id:
+                if "error" in data:
+                    raise CDPError(f"CDP 错误: {data['error']}")
+                return data.get("result", {})
+        raise CDPError(f"等待 session 响应超时 (id={msg_id})")
+
+    def navigate(self, url: str) -> None:
+        """导航到指定 URL。"""
+        logger.info("导航到: %s", url)
+        self._send_session("Page.navigate", {"url": url})
+
+    def wait_for_load(self, timeout: float = 60.0) -> None:
+        """等待页面加载完成（通过轮询 document.readyState）。"""
+        deadline = time.monotonic() + timeout
+        while time.monotonic() < deadline:
+            try:
+                state = self.evaluate("document.readyState")
+                if state == "complete":
+                    return
+            except CDPError:
+                pass
+            time.sleep(0.5)
+        logger.warning("等待页面加载超时")
+
+    def wait_dom_stable(self, timeout: float = 10.0, interval: float = 0.5) -> None:
+        """等待 DOM 稳定（连续两次 DOM 快照一致）。"""
+        last_html = ""
+        deadline = time.monotonic() + timeout
+        while time.monotonic() < deadline:
+            try:
+                html = self.evaluate("document.body ? document.body.innerHTML.length : 0")
+                if html == last_html and html != "":
+                    return
+                last_html = html
+            except CDPError:
+                pass
+            time.sleep(interval)
+
+    def evaluate(self, expression: str, timeout: float = 30.0) -> Any:
+        """执行 JavaScript 表达式并返回结果。"""
+        result = self._send_session(
+            "Runtime.evaluate",
+            {
+                "expression": expression,
+                "returnByValue": True,
+                "awaitPromise": False,
+            },
+        )
+        if "exceptionDetails" in result:
+            raise CDPError(f"JS 执行异常: {result['exceptionDetails']}")
+        remote_obj = result.get("result", {})
+        return remote_obj.get("value")
+
+    def evaluate_function(self, function_body: str, *args: Any) -> Any:
+        """执行 JavaScript 函数并返回结果。
+
+        function_body 是一个完整的函数体，如 `() => { return 1; }`
+        """
+        result = self._send_session(
+            "Runtime.evaluate",
+            {
+                "expression": f"({function_body})()",
+                "returnByValue": True,
+                "awaitPromise": False,
+            },
+        )
+        if "exceptionDetails" in result:
+            raise CDPError(f"JS 函数执行异常: {result['exceptionDetails']}")
+        remote_obj = result.get("result", {})
+        return remote_obj.get("value")
+
+    def query_selector(self, selector: str) -> str | None:
+        """查找单个元素，返回 objectId 或 None。"""
+        result = self._send_session(
+            "Runtime.evaluate",
+            {
+                "expression": f"document.querySelector({json.dumps(selector)})",
+                "returnByValue": False,
+            },
+        )
+        remote_obj = result.get("result", {})
+        if remote_obj.get("subtype") == "null" or remote_obj.get("type") == "undefined":
+            return None
+        return remote_obj.get("objectId")
+
+    def query_selector_all(self, selector: str) -> list[str]:
+        """查找多个元素，返回 objectId 列表。"""
+        # 通过 JS 返回元素数量，然后逐个获取
+        count = self.evaluate(f"document.querySelectorAll({json.dumps(selector)}).length")
+        if not count:
+            return []
+        object_ids = []
+        for i in range(count):
+            result = self._send_session(
+                "Runtime.evaluate",
+                {
+                    "expression": (f"document.querySelectorAll({json.dumps(selector)})[{i}]"),
+                    "returnByValue": False,
+                },
+            )
+            obj = result.get("result", {})
+            oid = obj.get("objectId")
+            if oid:
+                object_ids.append(oid)
+        return object_ids
+
+    def has_element(self, selector: str) -> bool:
+        """检查元素是否存在。"""
+        return self.evaluate(f"document.querySelector({json.dumps(selector)}) !== null") is True
+
+    def wait_for_element(self, selector: str, timeout: float = 30.0) -> str:
+        """等待元素出现，返回 objectId。"""
+        deadline = time.monotonic() + timeout
+        while time.monotonic() < deadline:
+            oid = self.query_selector(selector)
+            if oid:
+                return oid
+            time.sleep(0.5)
+        raise ElementNotFoundError(selector)
+
+    def click_element(self, selector: str) -> None:
+        """点击指定选择器的元素。"""
+        self.evaluate(
+            f"""
+            (() => {{
+                const el = document.querySelector({json.dumps(selector)});
+                if (el) el.click();
+            }})()
+            """
+        )
+
+    def input_text(self, selector: str, text: str) -> None:
+        """向指定选择器的元素输入文本。"""
+        self.evaluate(
+            f"""
+            (() => {{
+                const el = document.querySelector({json.dumps(selector)});
+                if (!el) return;
+                el.focus();
+                el.value = {json.dumps(text)};
+                el.dispatchEvent(new Event('input', {{bubbles: true}}));
+                el.dispatchEvent(new Event('change', {{bubbles: true}}));
+            }})()
+            """
+        )
+
+    def input_content_editable(self, selector: str, text: str) -> None:
+        """向 contentEditable 元素输入文本（如 div.ql-editor）。"""
+        self.evaluate(
+            f"""
+            (() => {{
+                const el = document.querySelector({json.dumps(selector)});
+                if (!el) return;
+                el.focus();
+                el.textContent = {json.dumps(text)};
+                el.dispatchEvent(new Event('input', {{bubbles: true}}));
+            }})()
+            """
+        )
+
+    def get_element_text(self, selector: str) -> str | None:
+        """获取元素文本内容。"""
+        return self.evaluate(
+            f"""
+            (() => {{
+                const el = document.querySelector({json.dumps(selector)});
+                return el ? el.textContent : null;
+            }})()
+            """
+        )
+
+    def get_element_attribute(self, selector: str, attr: str) -> str | None:
+        """获取元素属性值。"""
+        return self.evaluate(
+            f"""
+            (() => {{
+                const el = document.querySelector({json.dumps(selector)});
+                return el ? el.getAttribute({json.dumps(attr)}) : null;
+            }})()
+            """
+        )
+
+    def get_elements_count(self, selector: str) -> int:
+        """获取匹配元素数量。"""
+        result = self.evaluate(f"document.querySelectorAll({json.dumps(selector)}).length")
+        return result if isinstance(result, int) else 0
+
+    def scroll_by(self, x: int, y: int) -> None:
+        """滚动页面。"""
+        self.evaluate(f"window.scrollBy({x}, {y})")
+
+    def scroll_to(self, x: int, y: int) -> None:
+        """滚动到指定位置。"""
+        self.evaluate(f"window.scrollTo({x}, {y})")
+
+    def scroll_to_bottom(self) -> None:
+        """滚动到页面底部。"""
+        self.evaluate("window.scrollTo(0, document.body.scrollHeight)")
+
+    def scroll_element_into_view(self, selector: str) -> None:
+        """将元素滚动到可视区域。"""
+        self.evaluate(
+            f"""
+            (() => {{
+                const el = document.querySelector({json.dumps(selector)});
+                if (el) el.scrollIntoView({{behavior: 'smooth', block: 'center'}});
+            }})()
+            """
+        )
+
+    def scroll_nth_element_into_view(self, selector: str, index: int) -> None:
+        """将第 N 个匹配元素滚动到可视区域。"""
+        self.evaluate(
+            f"""
+            (() => {{
+                const els = document.querySelectorAll({json.dumps(selector)});
+                if (els[{index}]) els[{index}].scrollIntoView(
+                    {{behavior: 'smooth', block: 'center'}}
+                );
+            }})()
+            """
+        )
+
+    def get_scroll_top(self) -> int:
+        """获取当前滚动位置。"""
+        result = self.evaluate(
+            "window.pageYOffset || document.documentElement.scrollTop"
+            " || document.body.scrollTop || 0"
+        )
+        return int(result) if result else 0
+
+    def get_viewport_height(self) -> int:
+        """获取视口高度。"""
+        result = self.evaluate("window.innerHeight")
+        return int(result) if result else 768
+
+    def set_file_input(self, selector: str, files: list[str]) -> None:
+        """设置文件输入框的文件（通过 CDP DOM.setFileInputFiles）。"""
+        # 先获取 nodeId
+        doc = self._send_session("DOM.getDocument", {"depth": 0})
+        root_node_id = doc["root"]["nodeId"]
+        result = self._send_session(
+            "DOM.querySelector",
+            {"nodeId": root_node_id, "selector": selector},
+        )
+        node_id = result.get("nodeId", 0)
+        if node_id == 0:
+            raise ElementNotFoundError(selector)
+        self._send_session(
+            "DOM.setFileInputFiles",
+            {"nodeId": node_id, "files": files},
+        )
+
+    def dispatch_wheel_event(self, delta_y: float) -> None:
+        """触发滚轮事件以激活懒加载。"""
+        self.evaluate(
+            f"""
+            (() => {{
+                let target = document.querySelector('.note-scroller')
+                    || document.querySelector('.interaction-container')
+                    || document.documentElement;
+                const event = new WheelEvent('wheel', {{
+                    deltaY: {delta_y},
+                    deltaMode: 0,
+                    bubbles: true,
+                    cancelable: true,
+                    view: window,
+                }});
+                target.dispatchEvent(event);
+            }})()
+            """
+        )
+
+    def mouse_move(self, x: float, y: float) -> None:
+        """移动鼠标。"""
+        self._send_session(
+            "Input.dispatchMouseEvent",
+            {"type": "mouseMoved", "x": x, "y": y},
+        )
+
+    def mouse_click(self, x: float, y: float, button: str = "left") -> None:
+        """在指定坐标点击。"""
+        self._send_session(
+            "Input.dispatchMouseEvent",
+            {"type": "mousePressed", "x": x, "y": y, "button": button, "clickCount": 1},
+        )
+        self._send_session(
+            "Input.dispatchMouseEvent",
+            {"type": "mouseReleased", "x": x, "y": y, "button": button, "clickCount": 1},
+        )
+
+    def type_text(self, text: str, delay_ms: int = 50) -> None:
+        """逐字符输入文本。"""
+        for char in text:
+            self._send_session(
+                "Input.dispatchKeyEvent",
+                {"type": "keyDown", "text": char},
+            )
+            self._send_session(
+                "Input.dispatchKeyEvent",
+                {"type": "keyUp", "text": char},
+            )
+            if delay_ms > 0:
+                time.sleep(delay_ms / 1000.0)
+
+    def press_key(self, key: str) -> None:
+        """按下并释放指定键。"""
+        key_map = {
+            "Enter": {"key": "Enter", "code": "Enter", "windowsVirtualKeyCode": 13},
+            "ArrowDown": {
+                "key": "ArrowDown",
+                "code": "ArrowDown",
+                "windowsVirtualKeyCode": 40,
+            },
+            "Tab": {"key": "Tab", "code": "Tab", "windowsVirtualKeyCode": 9},
+        }
+        info = key_map.get(key, {"key": key, "code": key})
+        self._send_session(
+            "Input.dispatchKeyEvent",
+            {"type": "keyDown", **info},
+        )
+        self._send_session(
+            "Input.dispatchKeyEvent",
+            {"type": "keyUp", **info},
+        )
+
+    def inject_stealth(self) -> None:
+        """注入反检测脚本。"""
+        self._send_session(
+            "Page.addScriptToEvaluateOnNewDocument",
+            {"source": STEALTH_JS},
+        )
+
+    def remove_element(self, selector: str) -> None:
+        """移除 DOM 元素。"""
+        self.evaluate(
+            f"""
+            (() => {{
+                const el = document.querySelector({json.dumps(selector)});
+                if (el) el.remove();
+            }})()
+            """
+        )
+
+    def hover_element(self, selector: str) -> None:
+        """悬停到元素中心。"""
+        box = self.evaluate(
+            f"""
+            (() => {{
+                const el = document.querySelector({json.dumps(selector)});
+                if (!el) return null;
+                const rect = el.getBoundingClientRect();
+                return {{x: rect.left + rect.width / 2, y: rect.top + rect.height / 2}};
+            }})()
+            """
+        )
+        if box:
+            self.mouse_move(box["x"], box["y"])
+
+    def select_all_text(self, selector: str) -> None:
+        """选中输入框内所有文本。"""
+        self.evaluate(
+            f"""
+            (() => {{
+                const el = document.querySelector({json.dumps(selector)});
+                if (!el) return;
+                el.focus();
+                el.select ? el.select() : document.execCommand('selectAll');
+            }})()
+            """
+        )
+
+
+class Browser:
+    """Chrome 浏览器 CDP 控制器。"""
+
+    def __init__(self, host: str = "127.0.0.1", port: int = 9222) -> None:
+        self.host = host
+        self.port = port
+        self.base_url = f"http://{host}:{port}"
+        self._cdp: CDPClient | None = None
+
+    def connect(self) -> None:
+        """连接到 Chrome DevTools。"""
+        resp = requests.get(f"{self.base_url}/json/version", timeout=5)
+        resp.raise_for_status()
+        info = resp.json()
+        ws_url = info["webSocketDebuggerUrl"]
+        logger.info("连接到 Chrome: %s", ws_url)
+        self._cdp = CDPClient(ws_url)
+
+    def new_page(self, url: str = "about:blank") -> Page:
+        """创建新页面。"""
+        if not self._cdp:
+            self.connect()
+        assert self._cdp is not None
+
+        # 创建 target
+        result = self._cdp.send("Target.createTarget", {"url": url})
+        target_id = result["targetId"]
+
+        # 附加到 target
+        result = self._cdp.send(
+            "Target.attachToTarget",
+            {"targetId": target_id, "flatten": True},
+        )
+        session_id = result["sessionId"]
+
+        page = Page(self._cdp, target_id, session_id)
+
+        # 启用必要的 domain
+        page._send_session("Page.enable")
+        page._send_session("DOM.enable")
+        page._send_session("Runtime.enable")
+
+        # 注入反检测
+        page.inject_stealth()
+
+        return page
+
+    def get_existing_page(self) -> Page | None:
+        """获取已有页面（取第一个非 about:blank 的 page target）。"""
+        if not self._cdp:
+            self.connect()
+        assert self._cdp is not None
+
+        resp = requests.get(f"{self.base_url}/json", timeout=5)
+        targets = resp.json()
+
+        for target in targets:
+            if target.get("type") == "page" and target.get("url") != "about:blank":
+                target_id = target["id"]
+                result = self._cdp.send(
+                    "Target.attachToTarget",
+                    {"targetId": target_id, "flatten": True},
+                )
+                session_id = result["sessionId"]
+                page = Page(self._cdp, target_id, session_id)
+                page._send_session("Page.enable")
+                page._send_session("DOM.enable")
+                page._send_session("Runtime.enable")
+                page.inject_stealth()
+                return page
+        return None
+
+    def close_page(self, page: Page) -> None:
+        """关闭页面。"""
+        import contextlib
+
+        if self._cdp:
+            with contextlib.suppress(CDPError):
+                self._cdp.send("Target.closeTarget", {"targetId": page.target_id})
+
+    def close(self) -> None:
+        """关闭连接。"""
+        if self._cdp:
+            self._cdp.close()
+            self._cdp = None
--- a/scripts/xhs/comment.py 0 → 100644
View file @b8ec00a
+++ b/scripts/xhs/comment.py 0 → 100644
View file @b8ec00a
+"""评论操作，对应 Go xiaohongshu/comment_feed.go。"""
+
+from __future__ import annotations
+
+import logging
+import time
+
+from .cdp import Page
+from .feed_detail import _check_end_container, _check_page_accessible, _get_comment_count
+from .selectors import (
+    COMMENT_INPUT_FIELD,
+    COMMENT_INPUT_TRIGGER,
+    COMMENT_SUBMIT_BUTTON,
+    PARENT_COMMENT,
+    REPLY_BUTTON,
+)
+from .urls import make_feed_detail_url
+
+logger = logging.getLogger(__name__)
+
+
+def post_comment(page: Page, feed_id: str, xsec_token: str, content: str) -> None:
+    """发表评论到 Feed。
+
+    Args:
+        page: CDP 页面对象。
+        feed_id: Feed ID。
+        xsec_token: xsec_token。
+        content: 评论内容。
+
+    Raises:
+        RuntimeError: 评论失败。
+    """
+    url = make_feed_detail_url(feed_id, xsec_token)
+    logger.info("打开 feed 详情页: %s", url)
+
+    page.navigate(url)
+    page.wait_for_load()
+    page.wait_dom_stable()
+    time.sleep(1)
+
+    _check_page_accessible(page)
+
+    # 点击评论输入触发区域
+    if not page.has_element(COMMENT_INPUT_TRIGGER):
+        raise RuntimeError("未找到评论输入框，该帖子可能不支持评论或网页端不可访问")
+
+    page.click_element(COMMENT_INPUT_TRIGGER)
+    time.sleep(0.5)
+
+    # 输入评论内容
+    page.wait_for_element(COMMENT_INPUT_FIELD, timeout=5)
+    page.evaluate(
+        f"""
+        (() => {{
+            const el = document.querySelector({_js_str(COMMENT_INPUT_FIELD)});
+            if (el) {{
+                el.focus();
+                el.textContent = {_js_str(content)};
+                el.dispatchEvent(new Event('input', {{bubbles: true}}));
+            }}
+        }})()
+        """
+    )
+    time.sleep(1)
+
+    # 点击提交
+    page.click_element(COMMENT_SUBMIT_BUTTON)
+    time.sleep(1)
+
+    logger.info("评论发送成功: feed=%s", feed_id)
+
+
+def reply_comment(
+    page: Page,
+    feed_id: str,
+    xsec_token: str,
+    content: str,
+    comment_id: str = "",
+    user_id: str = "",
+) -> None:
+    """回复指定评论。
+
+    通过 comment_id 或 user_id 定位评论，然后回复。
+
+    Args:
+        page: CDP 页面对象。
+        feed_id: Feed ID。
+        xsec_token: xsec_token。
+        content: 回复内容。
+        comment_id: 评论 ID（优先使用）。
+        user_id: 用户 ID（备选）。
+
+    Raises:
+        RuntimeError: 回复失败。
+    """
+    if not comment_id and not user_id:
+        raise ValueError("comment_id 和 user_id 至少提供一个")
+
+    url = make_feed_detail_url(feed_id, xsec_token)
+    logger.info("打开 feed 详情页进行回复: %s", url)
+
+    page.navigate(url)
+    page.wait_for_load()
+    page.wait_dom_stable()
+    time.sleep(1)
+
+    _check_page_accessible(page)
+    time.sleep(2)
+
+    # 查找目标评论
+    comment_found = _find_and_scroll_to_comment(page, comment_id, user_id)
+    if not comment_found:
+        raise RuntimeError(f"未找到评论 (commentID: {comment_id}, userID: {user_id})")
+
+    time.sleep(1)
+
+    # 点击回复按钮
+    reply_selector = f"#comment-{comment_id} {REPLY_BUTTON}" if comment_id else REPLY_BUTTON
+    page.click_element(reply_selector)
+    time.sleep(1)
+
+    # 输入回复内容
+    page.wait_for_element(COMMENT_INPUT_FIELD, timeout=5)
+    page.evaluate(
+        f"""
+        (() => {{
+            const el = document.querySelector({_js_str(COMMENT_INPUT_FIELD)});
+            if (el) {{
+                el.focus();
+                el.textContent = {_js_str(content)};
+                el.dispatchEvent(new Event('input', {{bubbles: true}}));
+            }}
+        }})()
+        """
+    )
+    time.sleep(0.5)
+
+    # 点击提交
+    page.click_element(COMMENT_SUBMIT_BUTTON)
+    time.sleep(2)
+
+    logger.info("回复评论成功")
+
+
+def _find_and_scroll_to_comment(
+    page: Page,
+    comment_id: str,
+    user_id: str,
+    max_attempts: int = 100,
+) -> bool:
+    """查找并滚动到目标评论。"""
+    logger.info("开始查找评论 - commentID: %s, userID: %s", comment_id, user_id)
+
+    # 先滚动到评论区
+    page.scroll_element_into_view(".comments-container")
+    time.sleep(1)
+
+    last_count = 0
+    stagnant = 0
+
+    for attempt in range(max_attempts):
+        # 检查是否到底
+        if _check_end_container(page):
+            logger.info("已到达评论底部，未找到目标评论")
+            break
+
+        # 停滞检测
+        current_count = _get_comment_count(page)
+        if current_count != last_count:
+            last_count = current_count
+            stagnant = 0
+        else:
+            stagnant += 1
+        if stagnant >= 10:
+            logger.info("评论数量停滞超过10次")
+            break
+
+        # 滚动到最后一条评论
+        if current_count > 0:
+            page.scroll_nth_element_into_view(PARENT_COMMENT, current_count - 1)
+            time.sleep(0.3)
+
+        # 继续滚动
+        page.evaluate("window.scrollBy(0, window.innerHeight * 0.8)")
+        time.sleep(0.5)
+
+        # 通过 commentID 查找
+        if comment_id:
+            selector = f"#comment-{comment_id}"
+            if page.has_element(selector):
+                logger.info("通过 commentID 找到评论 (尝试 %d 次)", attempt + 1)
+                page.scroll_element_into_view(selector)
+                return True
+
+        # 通过 userID 查找
+        if user_id:
+            found = page.evaluate(
+                f"""
+                (() => {{
+                    const els = document.querySelectorAll(
+                        '.parent-comment, .comment-item, .comment'
+                    );
+                    for (const el of els) {{
+                        if (el.querySelector('[data-user-id="{user_id}"]')) {{
+                            el.scrollIntoView({{behavior: 'smooth', block: 'center'}});
+                            return true;
+                        }}
+                    }}
+                    return false;
+                }})()
+                """
+            )
+            if found:
+                logger.info("通过 userID 找到评论 (尝试 %d 次)", attempt + 1)
+                return True
+
+        time.sleep(0.8)
+
+    return False
+
+
+def _js_str(s: str) -> str:
+    """将 Python 字符串转为 JS 字面量（含引号）。"""
+    import json
+
+    return json.dumps(s)
--- a/scripts/xhs/cookies.py 0 → 100644
View file @b8ec00a
+++ b/scripts/xhs/cookies.py 0 → 100644
View file @b8ec00a
+"""Cookie 文件持久化，对应 Go cookies/cookies.go。"""
+
+from __future__ import annotations
+
+import os
+from pathlib import Path
+
+
+def get_cookies_file_path(account: str = "") -> str:
+    """获取 cookies 文件路径。
+
+    优先级：
+    1. /tmp/cookies.json（向后兼容）
+    2. COOKIES_PATH 环境变量
+    3. 多账号模式：~/.xhs/accounts/{account}/cookies.json
+    4. ./cookies.json（本地调试）
+    """
+    if account:
+        account_dir = Path.home() / ".xhs" / "accounts" / account
+        account_dir.mkdir(parents=True, exist_ok=True)
+        return str(account_dir / "cookies.json")
+
+    # 旧路径
+    import tempfile
+
+    old_path = os.path.join(tempfile.gettempdir(), "cookies.json")
+    if os.path.exists(old_path):
+        return old_path
+
+    # 环境变量
+    env_path = os.getenv("COOKIES_PATH")
+    if env_path:
+        return env_path
+
+    return "cookies.json"
+
+
+def load_cookies(path: str) -> bytes | None:
+    """从文件加载 cookies。"""
+    try:
+        with open(path, "rb") as f:
+            return f.read()
+    except FileNotFoundError:
+        return None
+
+
+def save_cookies(path: str, data: bytes) -> None:
+    """保存 cookies 到文件。"""
+    os.makedirs(os.path.dirname(path) or ".", exist_ok=True)
+    with open(path, "wb") as f:
+        f.write(data)
+
+
+def delete_cookies(path: str) -> None:
+    """删除 cookies 文件。"""
+    import contextlib
+
+    with contextlib.suppress(FileNotFoundError):
+        os.remove(path)
--- a/scripts/xhs/errors.py 0 → 100644
View file @b8ec00a
+++ b/scripts/xhs/errors.py 0 → 100644
View file @b8ec00a
+"""小红书自动化异常体系。"""
+
+
+class XHSError(Exception):
+    """小红书自动化基础异常。"""
+
+
+class NoFeedsError(XHSError):
+    """没有捕获到 feeds 数据。"""
+
+    def __init__(self) -> None:
+        super().__init__("没有捕获到 feeds 数据")
+
+
+class NoFeedDetailError(XHSError):
+    """没有捕获到 feed 详情数据。"""
+
+    def __init__(self) -> None:
+        super().__init__("没有捕获到 feed 详情数据")
+
+
+class NotLoggedInError(XHSError):
+    """未登录。"""
+
+    def __init__(self) -> None:
+        super().__init__("未登录，请先扫码登录")
+
+
+class PageNotAccessibleError(XHSError):
+    """页面不可访问。"""
+
+    def __init__(self, reason: str) -> None:
+        self.reason = reason
+        super().__init__(f"笔记不可访问: {reason}")
+
+
+class UploadTimeoutError(XHSError):
+    """上传超时。"""
+
+
+class PublishError(XHSError):
+    """发布失败。"""
+
+
+class TitleTooLongError(PublishError):
+    """标题超过长度限制。"""
+
+    def __init__(self, current: str, maximum: str) -> None:
+        self.current = current
+        self.maximum = maximum
+        super().__init__(f"当前输入长度为{current}，最大长度为{maximum}")
+
+
+class ContentTooLongError(PublishError):
+    """正文超过长度限制。"""
+
+    def __init__(self, current: str, maximum: str) -> None:
+        self.current = current
+        self.maximum = maximum
+        super().__init__(f"当前输入长度为{current}，最大长度为{maximum}")
+
+
+class CDPError(XHSError):
+    """CDP 通信异常。"""
+
+
+class ElementNotFoundError(XHSError):
+    """页面元素未找到。"""
+
+    def __init__(self, selector: str) -> None:
+        self.selector = selector
+        super().__init__(f"未找到元素: {selector}")
--- a/scripts/xhs/feed_detail.py 0 → 100644
View file @b8ec00a
+++ b/scripts/xhs/feed_detail.py 0 → 100644
View file @b8ec00a
+"""Feed 详情 + 评论加载，对应 Go xiaohongshu/feed_detail.go（867 行）。"""
+
+from __future__ import annotations
+
+import json
+import logging
+import random
+import re
+import time
+
+from .cdp import Page
+from .errors import NoFeedDetailError, PageNotAccessibleError
+from .human import (
+    BUTTON_CLICK_INTERVAL,
+    DEFAULT_MAX_ATTEMPTS,
+    FINAL_SPRINT_PUSH_COUNT,
+    HUMAN_DELAY,
+    LARGE_SCROLL_TRIGGER,
+    MAX_CLICK_PER_ROUND,
+    MIN_SCROLL_DELTA,
+    POST_SCROLL,
+    REACTION_TIME,
+    READ_TIME,
+    SCROLL_WAIT,
+    SHORT_READ,
+    STAGNANT_LIMIT,
+    calculate_scroll_delta,
+    get_scroll_interval,
+    get_scroll_ratio,
+    sleep_random,
+)
+from .selectors import (
+    ACCESS_ERROR_WRAPPER,
+    END_CONTAINER,
+    NO_COMMENTS_TEXT,
+    PARENT_COMMENT,
+    SHOW_MORE_BUTTON,
+)
+from .types import (
+    CommentList,
+    CommentLoadConfig,
+    FeedDetail,
+    FeedDetailResponse,
+)
+from .urls import make_feed_detail_url
+
+logger = logging.getLogger(__name__)
+
+# 页面不可访问关键词
+_INACCESSIBLE_KEYWORDS = [
+    "当前笔记暂时无法浏览",
+    "该内容因违规已被删除",
+    "该笔记已被删除",
+    "内容不存在",
+    "笔记不存在",
+    "已失效",
+    "私密笔记",
+    "仅作者可见",
+    "因用户设置，你无法查看",
+    "因违规无法查看",
+]
+
+_REPLY_COUNT_RE = re.compile(r"展开\s*(\d+)\s*条回复")
+_TOTAL_COMMENT_RE = re.compile(r"共(\d+)条评论")
+
+
+def get_feed_detail(
+    page: Page,
+    feed_id: str,
+    xsec_token: str,
+    load_all_comments: bool = False,
+    config: CommentLoadConfig | None = None,
+) -> FeedDetailResponse:
+    """获取 Feed 详情（含评论）。
+
+    Args:
+        page: CDP 页面对象。
+        feed_id: Feed ID。
+        xsec_token: xsec_token。
+        load_all_comments: 是否加载全部评论。
+        config: 评论加载配置。
+
+    Raises:
+        PageNotAccessibleError: 页面不可访问。
+        NoFeedDetailError: 未获取到详情数据。
+    """
+    if config is None:
+        config = CommentLoadConfig()
+
+    url = make_feed_detail_url(feed_id, xsec_token)
+    logger.info("打开 feed 详情页: %s", url)
+    logger.info(
+        "配置: 点击更多=%s, 回复阈值=%d, 最大评论数=%d, 滚动速度=%s",
+        config.click_more_replies,
+        config.max_replies_threshold,
+        config.max_comment_items,
+        config.scroll_speed,
+    )
+
+    # 导航（含重试）
+    for attempt in range(3):
+        try:
+            page.navigate(url)
+            page.wait_for_load()
+            page.wait_dom_stable()
+            break
+        except Exception as e:
+            logger.debug("页面导航重试 #%d: %s", attempt, e)
+            time.sleep(0.5 + random.random())
+    else:
+        raise RuntimeError("页面导航失败")
+
+    sleep_random(1000, 1000)
+
+    # 检查页面可访问性
+    _check_page_accessible(page)
+
+    # 加载全部评论
+    if load_all_comments:
+        try:
+            _load_all_comments(page, config)
+        except Exception as e:
+            logger.warning("加载全部评论失败: %s", e)
+
+    return _extract_feed_detail(page, feed_id)
+
+
+# ========== 页面检查 ==========
+
+
+def _check_page_accessible(page: Page) -> None:
+    """检查页面是否可访问。"""
+    time.sleep(0.5)
+
+    text = page.get_element_text(ACCESS_ERROR_WRAPPER)
+    if not text:
+        return
+
+    text = text.strip()
+    for kw in _INACCESSIBLE_KEYWORDS:
+        if kw in text:
+            raise PageNotAccessibleError(kw)
+
+    if text:
+        raise PageNotAccessibleError(text)
+
+
+# ========== 数据提取 ==========
+
+
+_EXTRACT_DETAIL_JS = """
+(() => {
+    if (window.__INITIAL_STATE__ &&
+        window.__INITIAL_STATE__.note &&
+        window.__INITIAL_STATE__.note.noteDetailMap) {
+        return JSON.stringify(window.__INITIAL_STATE__.note.noteDetailMap);
+    }
+    return "";
+})()
+"""
+
+
+def _extract_feed_detail(page: Page, feed_id: str) -> FeedDetailResponse:
+    """从 __INITIAL_STATE__ 提取 Feed 详情。"""
+    result = None
+    for _ in range(3):
+        result = page.evaluate(_EXTRACT_DETAIL_JS)
+        if result:
+            break
+        time.sleep(0.2)
+
+    if not result:
+        raise NoFeedDetailError()
+
+    note_detail_map = json.loads(result)
+    note_data = note_detail_map.get(feed_id)
+    if not note_data:
+        raise NoFeedDetailError()
+
+    return FeedDetailResponse(
+        note=FeedDetail.from_dict(note_data.get("note", {})),
+        comments=CommentList.from_dict(note_data.get("comments", {})),
+    )
+
+
+# ========== 评论加载状态机 ==========
+
+
+def _load_all_comments(page: Page, config: CommentLoadConfig) -> None:
+    """加载全部评论的状态机。"""
+    max_attempts = (
+        config.max_comment_items * 3 if config.max_comment_items > 0 else DEFAULT_MAX_ATTEMPTS
+    )
+    scroll_interval = get_scroll_interval(config.scroll_speed)
+
+    logger.info("开始加载评论...")
+    _scroll_to_comments_area(page)
+    sleep_random(*HUMAN_DELAY)
+
+    # 检查是否无评论
+    if _check_no_comments(page):
+        logger.info("检测到无评论区域，跳过加载")
+        return
+
+    # 状态
+    last_count = 0
+    last_scroll_top = 0
+    stagnant_checks = 0
+    total_clicked = 0
+    total_skipped = 0
+
+    for attempt in range(max_attempts):
+        logger.debug("=== 尝试 %d/%d ===", attempt + 1, max_attempts)
+
+        # 检查是否到达底部
+        if _check_end_container(page):
+            count = _get_comment_count(page)
+            logger.info(
+                "检测到 THE END，加载完成: %d 条评论, 点击: %d, 跳过: %d",
+                count,
+                total_clicked,
+                total_skipped,
+            )
+            return
+
+        # 定期点击展开按钮
+        if config.click_more_replies and attempt % BUTTON_CLICK_INTERVAL == 0:
+            clicked, skipped = _click_show_more_buttons(page, config.max_replies_threshold)
+            total_clicked += clicked
+            total_skipped += skipped
+            if clicked > 0 or skipped > 0:
+                sleep_random(*READ_TIME)
+                # 第二轮
+                c2, s2 = _click_show_more_buttons(page, config.max_replies_threshold)
+                total_clicked += c2
+                total_skipped += s2
+                if c2 > 0 or s2 > 0:
+                    sleep_random(*SHORT_READ)
+
+        # 获取当前评论数
+        current_count = _get_comment_count(page)
+        if current_count != last_count:
+            logger.info("评论增加: %d -> %d", last_count, current_count)
+            last_count = current_count
+            stagnant_checks = 0
+        else:
+            stagnant_checks += 1
+
+        # 检查是否达到目标
+        if config.max_comment_items > 0 and current_count >= config.max_comment_items:
+            logger.info("已达到目标评论数: %d/%d", current_count, config.max_comment_items)
+            return
+
+        # 滚动
+        if current_count > 0:
+            _scroll_to_last_comment(page)
+            sleep_random(*POST_SCROLL)
+
+        large_mode = stagnant_checks >= LARGE_SCROLL_TRIGGER
+        push_count = 1
+        if large_mode:
+            push_count = 3 + random.randint(0, 2)
+
+        scroll_delta, current_scroll_top = _human_scroll(
+            page, config.scroll_speed, large_mode, push_count
+        )
+
+        if scroll_delta < MIN_SCROLL_DELTA or current_scroll_top == last_scroll_top:
+            stagnant_checks += 1
+        else:
+            stagnant_checks = 0
+            last_scroll_top = current_scroll_top
+
+        # 停滞处理
+        if stagnant_checks >= STAGNANT_LIMIT:
+            logger.info("停滞过多，尝试大冲刺...")
+            _human_scroll(page, config.scroll_speed, True, 10)
+            stagnant_checks = 0
+
+        time.sleep(scroll_interval)
+
+    # 最终冲刺
+    logger.info("达到最大尝试次数，最后冲刺...")
+    _human_scroll(page, config.scroll_speed, True, FINAL_SPRINT_PUSH_COUNT)
+    count = _get_comment_count(page)
+    logger.info("加载结束: %d 条评论, 点击: %d, 跳过: %d", count, total_clicked, total_skipped)
+
+
+# ========== 滚动 ==========
+
+
+def _human_scroll(
+    page: Page,
+    speed: str,
+    large_mode: bool,
+    push_count: int,
+) -> tuple[int, int]:
+    """人类化滚动。
+
+    Returns:
+        (actual_delta, current_scroll_top)
+    """
+    before_top = page.get_scroll_top()
+    viewport_height = page.get_viewport_height()
+
+    base_ratio = get_scroll_ratio(speed)
+    if large_mode:
+        base_ratio *= 2.0
+
+    actual_delta = 0
+    current_scroll_top = before_top
+
+    for i in range(max(1, push_count)):
+        scroll_delta = calculate_scroll_delta(viewport_height, base_ratio)
+        page.scroll_by(0, int(scroll_delta))
+        sleep_random(*SCROLL_WAIT)
+
+        current_scroll_top = page.get_scroll_top()
+        delta_this = current_scroll_top - before_top
+        actual_delta += delta_this
+        before_top = current_scroll_top
+
+        if i < push_count - 1:
+            sleep_random(*HUMAN_DELAY)
+
+    # 如果没有滚动，强制到底部
+    if actual_delta < MIN_SCROLL_DELTA and push_count > 0:
+        page.scroll_to_bottom()
+        sleep_random(*POST_SCROLL)
+        current_scroll_top = page.get_scroll_top()
+        actual_delta = current_scroll_top - (before_top - actual_delta)
+
+    return actual_delta, current_scroll_top
+
+
+def _scroll_to_comments_area(page: Page) -> None:
+    """滚动到评论区。"""
+    logger.info("滚动到评论区...")
+    page.scroll_element_into_view(".comments-container")
+    time.sleep(0.5)
+    # 触发懒加载
+    page.dispatch_wheel_event(100)
+
+
+def _scroll_to_last_comment(page: Page) -> None:
+    """滚动到最后一条评论。"""
+    count = page.get_elements_count(PARENT_COMMENT)
+    if count > 0:
+        page.scroll_nth_element_into_view(PARENT_COMMENT, count - 1)
+
+
+# ========== DOM 查询 ==========
+
+
+def _get_comment_count(page: Page) -> int:
+    """获取当前评论数量。"""
+    return page.get_elements_count(PARENT_COMMENT)
+
+
+def _get_total_comment_count(page: Page) -> int:
+    """获取总评论数（从 "共N条评论" 提取）。"""
+    text = page.get_element_text(".comments-container .total")
+    if not text:
+        return 0
+    match = _TOTAL_COMMENT_RE.search(text)
+    if match:
+        return int(match.group(1))
+    return 0
+
+
+def _check_no_comments(page: Page) -> bool:
+    """检查是否无评论区域。"""
+    text = page.get_element_text(NO_COMMENTS_TEXT)
+    if not text:
+        return False
+    return "这是一片荒地" in text.strip()
+
+
+def _check_end_container(page: Page) -> bool:
+    """检查是否到达底部 THE END。"""
+    text = page.get_element_text(END_CONTAINER)
+    if not text:
+        return False
+    upper = text.strip().upper()
+    return "THE END" in upper or "THEEND" in upper
+
+
+# ========== 按钮点击 ==========
+
+
+def _click_show_more_buttons(page: Page, max_threshold: int) -> tuple[int, int]:
+    """点击"展开N条回复"按钮。
+
+    Returns:
+        (clicked, skipped)
+    """
+    count = page.get_elements_count(SHOW_MORE_BUTTON)
+    if count == 0:
+        return 0, 0
+
+    max_click = MAX_CLICK_PER_ROUND + random.randint(0, MAX_CLICK_PER_ROUND - 1)
+    clicked = 0
+    skipped = 0
+
+    for i in range(count):
+        if clicked >= max_click:
+            break
+
+        # 获取按钮文本
+        text = page.evaluate(
+            f"document.querySelectorAll({json.dumps(SHOW_MORE_BUTTON)})[{i}]?.textContent || ''"
+        )
+        if not text:
+            continue
+
+        # 检查是否应该跳过
+        if max_threshold > 0:
+            match = _REPLY_COUNT_RE.search(text)
+            if match:
+                reply_count = int(match.group(1))
+                if reply_count > max_threshold:
+                    logger.debug(
+                        "跳过 '%s'（回复数 %d > 阈值 %d）", text, reply_count, max_threshold
+                    )
+                    skipped += 1
+                    continue
+
+        # 滚动到按钮并点击
+        page.scroll_nth_element_into_view(SHOW_MORE_BUTTON, i)
+        sleep_random(*REACTION_TIME)
+        page.evaluate(f"document.querySelectorAll({json.dumps(SHOW_MORE_BUTTON)})[{i}]?.click()")
+        sleep_random(*READ_TIME)
+        clicked += 1
+
+    return clicked, skipped
--- a/scripts/xhs/feeds.py 0 → 100644
View file @b8ec00a
+++ b/scripts/xhs/feeds.py 0 → 100644
View file @b8ec00a
+"""首页 Feed 列表，对应 Go xiaohongshu/feeds.go。"""
+
+from __future__ import annotations
+
+import json
+import logging
+import time
+
+from .cdp import Page
+from .errors import NoFeedsError
+from .types import Feed
+from .urls import HOME_URL
+
+logger = logging.getLogger(__name__)
+
+# 从 __INITIAL_STATE__ 提取 feeds 的 JS
+_EXTRACT_FEEDS_JS = """
+(() => {
+    if (window.__INITIAL_STATE__ &&
+        window.__INITIAL_STATE__.feed &&
+        window.__INITIAL_STATE__.feed.feeds) {
+        const feeds = window.__INITIAL_STATE__.feed.feeds;
+        const feedsData = feeds.value !== undefined ? feeds.value : feeds._value;
+        if (feedsData) {
+            return JSON.stringify(feedsData);
+        }
+    }
+    return "";
+})()
+"""
+
+
+def list_feeds(page: Page) -> list[Feed]:
+    """获取首页 Feed 列表。
+
+    Raises:
+        NoFeedsError: 没有捕获到 feeds 数据。
+    """
+    page.navigate(HOME_URL)
+    page.wait_for_load()
+    page.wait_dom_stable()
+    time.sleep(1)
+
+    result = page.evaluate(_EXTRACT_FEEDS_JS)
+    if not result:
+        raise NoFeedsError()
+
+    feeds_data = json.loads(result)
+    return [Feed.from_dict(f) for f in feeds_data]
--- a/scripts/xhs/human.py 0 → 100644
View file @b8ec00a
+++ b/scripts/xhs/human.py 0 → 100644
View file @b8ec00a
+"""人类行为模拟参数（延迟、滚动、悬停），对应 Go feed_detail.go 中的常量。"""
+
+import random
+import time
+
+# ========== 配置常量 ==========
+DEFAULT_MAX_ATTEMPTS = 500
+STAGNANT_LIMIT = 20
+MIN_SCROLL_DELTA = 10
+MAX_CLICK_PER_ROUND = 3
+STAGNANT_CHECK_THRESHOLD = 2
+LARGE_SCROLL_TRIGGER = 5
+BUTTON_CLICK_INTERVAL = 3
+FINAL_SPRINT_PUSH_COUNT = 15
+
+# ========== 延迟范围（毫秒） ==========
+HUMAN_DELAY = (300, 700)
+REACTION_TIME = (300, 800)
+HOVER_TIME = (100, 300)
+READ_TIME = (500, 1200)
+SHORT_READ = (600, 1200)
+SCROLL_WAIT = (100, 200)
+POST_SCROLL = (300, 500)
+
+
+def sleep_random(min_ms: int, max_ms: int) -> None:
+    """随机延迟。"""
+    if max_ms <= min_ms:
+        time.sleep(min_ms / 1000.0)
+        return
+    delay = random.randint(min_ms, max_ms) / 1000.0
+    time.sleep(delay)
+
+
+def get_scroll_interval(speed: str) -> float:
+    """根据速度获取滚动间隔（秒）。"""
+    if speed == "slow":
+        return (1200 + random.randint(0, 300)) / 1000.0
+    if speed == "fast":
+        return (300 + random.randint(0, 100)) / 1000.0
+    # normal
+    return (600 + random.randint(0, 200)) / 1000.0
+
+
+def get_scroll_ratio(speed: str) -> float:
+    """根据速度获取滚动比例。"""
+    if speed == "slow":
+        return 0.5
+    if speed == "fast":
+        return 0.9
+    return 0.7
+
+
+def calculate_scroll_delta(viewport_height: int, base_ratio: float) -> float:
+    """计算滚动距离。"""
+    scroll_delta = viewport_height * (base_ratio + random.random() * 0.2)
+    if scroll_delta < 400:
+        scroll_delta = 400.0
+    return scroll_delta + random.randint(-50, 50)
+
+
+# 页面不可访问关键词
+INACCESSIBLE_KEYWORDS = [
+    "当前笔记暂时无法浏览",
+    "该内容因违规已被删除",
+    "该笔记已被删除",
+    "内容不存在",
+    "笔记不存在",
+    "已失效",
+    "私密笔记",
+    "仅作者可见",
+    "因用户设置，你无法查看",
+    "因违规无法查看",
+]
--- a/scripts/xhs/like_favorite.py 0 → 100644
View file @b8ec00a
+++ b/scripts/xhs/like_favorite.py 0 → 100644
View file @b8ec00a
+"""点赞/收藏操作，对应 Go xiaohongshu/like_favorite.go。"""
+
+from __future__ import annotations
+
+import json
+import logging
+import time
+
+from .cdp import Page
+from .errors import NoFeedDetailError
+from .selectors import COLLECT_BUTTON, LIKE_BUTTON
+from .types import ActionResult
+from .urls import make_feed_detail_url
+
+logger = logging.getLogger(__name__)
+
+# 从 __INITIAL_STATE__ 读取互动状态的 JS
+_GET_INTERACT_STATE_JS = """
+(() => {
+    if (window.__INITIAL_STATE__ &&
+        window.__INITIAL_STATE__.note &&
+        window.__INITIAL_STATE__.note.noteDetailMap) {
+        return JSON.stringify(window.__INITIAL_STATE__.note.noteDetailMap);
+    }
+    return "";
+})()
+"""
+
+
+def _get_interact_state(page: Page, feed_id: str) -> tuple[bool, bool]:
+    """读取笔记的点赞/收藏状态。
+
+    Returns:
+        (liked, collected)
+
+    Raises:
+        NoFeedDetailError: 无法获取状态。
+    """
+    result = page.evaluate(_GET_INTERACT_STATE_JS)
+    if not result:
+        raise NoFeedDetailError()
+
+    note_detail_map = json.loads(result)
+    detail = note_detail_map.get(feed_id)
+    if not detail:
+        raise NoFeedDetailError()
+
+    interact = detail.get("note", {}).get("interactInfo", {})
+    return interact.get("liked", False), interact.get("collected", False)
+
+
+def _prepare_page(page: Page, feed_id: str, xsec_token: str) -> None:
+    """导航到 feed 详情页。"""
+    url = make_feed_detail_url(feed_id, xsec_token)
+    page.navigate(url)
+    page.wait_for_load()
+    page.wait_dom_stable()
+    time.sleep(1)
+
+
+# ========== 点赞 ==========
+
+
+def like_feed(page: Page, feed_id: str, xsec_token: str) -> ActionResult:
+    """点赞笔记（幂等：已点赞则跳过）。"""
+    _prepare_page(page, feed_id, xsec_token)
+    return _toggle_like(page, feed_id, target_liked=True)
+
+
+def unlike_feed(page: Page, feed_id: str, xsec_token: str) -> ActionResult:
+    """取消点赞（幂等：未点赞则跳过）。"""
+    _prepare_page(page, feed_id, xsec_token)
+    return _toggle_like(page, feed_id, target_liked=False)
+
+
+def _toggle_like(page: Page, feed_id: str, target_liked: bool) -> ActionResult:
+    """执行点赞/取消点赞操作。"""
+    action_name = "点赞" if target_liked else "取消点赞"
+
+    try:
+        liked, _ = _get_interact_state(page, feed_id)
+    except NoFeedDetailError:
+        logger.warning("无法读取互动状态，直接点击")
+        liked = not target_liked  # 强制执行点击
+
+    # 幂等检查
+    if liked == target_liked:
+        logger.info("feed %s 已%s，跳过", feed_id, action_name)
+        return ActionResult(feed_id=feed_id, success=True, message=f"已{action_name}")
+
+    # 点击
+    page.click_element(LIKE_BUTTON)
+    time.sleep(3)
+
+    # 验证
+    try:
+        liked, _ = _get_interact_state(page, feed_id)
+        if liked == target_liked:
+            logger.info("feed %s %s成功", feed_id, action_name)
+            return ActionResult(feed_id=feed_id, success=True, message=f"{action_name}成功")
+    except NoFeedDetailError:
+        pass
+
+    # 重试一次
+    logger.warning("feed %s %s可能未成功，重试", feed_id, action_name)
+    page.click_element(LIKE_BUTTON)
+    time.sleep(2)
+
+    return ActionResult(feed_id=feed_id, success=True, message=f"{action_name}已执行")
+
+
+# ========== 收藏 ==========
+
+
+def favorite_feed(page: Page, feed_id: str, xsec_token: str) -> ActionResult:
+    """收藏笔记（幂等：已收藏则跳过）。"""
+    _prepare_page(page, feed_id, xsec_token)
+    return _toggle_favorite(page, feed_id, target_collected=True)
+
+
+def unfavorite_feed(page: Page, feed_id: str, xsec_token: str) -> ActionResult:
+    """取消收藏（幂等：未收藏则跳过）。"""
+    _prepare_page(page, feed_id, xsec_token)
+    return _toggle_favorite(page, feed_id, target_collected=False)
+
+
+def _toggle_favorite(page: Page, feed_id: str, target_collected: bool) -> ActionResult:
+    """执行收藏/取消收藏操作。"""
+    action_name = "收藏" if target_collected else "取消收藏"
+
+    try:
+        _, collected = _get_interact_state(page, feed_id)
+    except NoFeedDetailError:
+        logger.warning("无法读取互动状态，直接点击")
+        collected = not target_collected
+
+    # 幂等检查
+    if collected == target_collected:
+        logger.info("feed %s 已%s，跳过", feed_id, action_name)
+        return ActionResult(feed_id=feed_id, success=True, message=f"已{action_name}")
+
+    # 点击
+    page.click_element(COLLECT_BUTTON)
+    time.sleep(3)
+
+    # 验证
+    try:
+        _, collected = _get_interact_state(page, feed_id)
+        if collected == target_collected:
+            logger.info("feed %s %s成功", feed_id, action_name)
+            return ActionResult(feed_id=feed_id, success=True, message=f"{action_name}成功")
+    except NoFeedDetailError:
+        pass
+
+    # 重试
+    logger.warning("feed %s %s可能未成功，重试", feed_id, action_name)
+    page.click_element(COLLECT_BUTTON)
+    time.sleep(2)
+
+    return ActionResult(feed_id=feed_id, success=True, message=f"{action_name}已执行")
--- a/scripts/xhs/login.py 0 → 100644
View file @b8ec00a
+++ b/scripts/xhs/login.py 0 → 100644
View file @b8ec00a
+"""登录管理，对应 Go xiaohongshu/login.go。"""
+
+from __future__ import annotations
+
+import base64
+import logging
+import os
+import tempfile
+import time
+
+from .cdp import Page
+from .selectors import LOGIN_STATUS, QRCODE_IMG
+from .urls import EXPLORE_URL
+
+logger = logging.getLogger(__name__)
+
+
+def check_login_status(page: Page) -> bool:
+    """检查登录状态。
+
+    Returns:
+        True 已登录，False 未登录。
+    """
+    page.navigate(EXPLORE_URL)
+    page.wait_for_load()
+    time.sleep(1)
+
+    return page.has_element(LOGIN_STATUS)
+
+
+def fetch_qrcode(page: Page) -> tuple[str, bool]:
+    """获取登录二维码。
+
+    Returns:
+        (qrcode_src, already_logged_in)
+        - 如果已登录，返回 ("", True)
+        - 如果未登录，返回 (qrcode_base64_or_url, False)
+    """
+    page.navigate(EXPLORE_URL)
+    page.wait_for_load()
+    time.sleep(2)
+
+    # 检查是否已登录
+    if page.has_element(LOGIN_STATUS):
+        return "", True
+
+    # 获取二维码图片 src
+    src = page.get_element_attribute(QRCODE_IMG, "src")
+    if not src:
+        raise RuntimeError("二维码图片 src 为空")
+
+    return src, False
+
+
+def save_qrcode_to_file(src: str) -> str:
+    """将二维码 data URL 保存为临时 PNG 文件。
+
+    Args:
+        src: 二维码图片的 data URL（data:image/png;base64,...）或普通 URL。
+
+    Returns:
+        保存的文件绝对路径。
+    """
+    prefix = "data:image/png;base64,"
+    if src.startswith(prefix):
+        img_data = base64.b64decode(src[len(prefix) :])
+    elif src.startswith("data:image/"):
+        # 处理其他 MIME 类型，如 data:image/jpeg;base64,...
+        _, encoded = src.split(",", 1)
+        img_data = base64.b64decode(encoded)
+    else:
+        # 不是 data URL，无法保存
+        raise ValueError(f"不支持的二维码格式，需要 data URL: {src[:50]}...")
+
+    qr_dir = os.path.join(tempfile.gettempdir(), "xhs")
+    os.makedirs(qr_dir, exist_ok=True)
+    filepath = os.path.join(qr_dir, "login_qrcode.png")
+
+    with open(filepath, "wb") as f:
+        f.write(img_data)
+
+    logger.info("二维码已保存: %s", filepath)
+    return filepath
+
+
+def wait_for_login(page: Page, timeout: float = 120.0) -> bool:
+    """等待扫码登录完成。
+
+    Args:
+        page: CDP 页面对象。
+        timeout: 超时时间（秒）。
+
+    Returns:
+        True 登录成功，False 超时。
+    """
+    deadline = time.monotonic() + timeout
+    while time.monotonic() < deadline:
+        if page.has_element(LOGIN_STATUS):
+            logger.info("登录成功")
+            return True
+        time.sleep(0.5)
+    return False
--- a/scripts/xhs/publish.py 0 → 100644
View file @b8ec00a
+++ b/scripts/xhs/publish.py 0 → 100644
View file @b8ec00a
+"""图文发布，对应 Go xiaohongshu/publish.go（837 行）。"""
+
+from __future__ import annotations
+
+import json
+import logging
+import random
+import time
+
+from .cdp import Page
+from .errors import ContentTooLongError, PublishError, TitleTooLongError, UploadTimeoutError
+from .selectors import (
+    CONTENT_EDITOR,
+    CONTENT_LENGTH_ERROR,
+    CREATOR_TAB,
+    DATETIME_INPUT,
+    FILE_INPUT,
+    IMAGE_PREVIEW,
+    ORIGINAL_SWITCH,
+    ORIGINAL_SWITCH_CARD,
+    POPOVER,
+    PUBLISH_BUTTON,
+    SCHEDULE_SWITCH,
+    TAG_FIRST_ITEM,
+    TAG_TOPIC_CONTAINER,
+    TITLE_INPUT,
+    TITLE_MAX_SUFFIX,
+    UPLOAD_CONTENT,
+    UPLOAD_INPUT,
+    VISIBILITY_DROPDOWN,
+    VISIBILITY_OPTIONS,
+)
+from .types import PublishImageContent
+from .urls import PUBLISH_URL
+
+logger = logging.getLogger(__name__)
+
+
+def publish_image_content(page: Page, content: PublishImageContent) -> None:
+    """发布图文内容。
+
+    Args:
+        page: CDP 页面对象。
+        content: 发布内容。
+
+    Raises:
+        PublishError: 发布失败。
+        UploadTimeoutError: 上传超时。
+        TitleTooLongError: 标题超长。
+        ContentTooLongError: 正文超长。
+    """
+    if not content.image_paths:
+        raise PublishError("图片不能为空")
+
+    # 导航到发布页
+    _navigate_to_publish_page(page)
+
+    # 点击"上传图文" TAB
+    _click_publish_tab(page, "上传图文")
+    time.sleep(1)
+
+    # 上传图片
+    _upload_images(page, content.image_paths)
+
+    # 标签截取
+    tags = content.tags[:10] if len(content.tags) > 10 else content.tags
+    if len(content.tags) > 10:
+        logger.warning("标签数量超过10，截取前10个")
+
+    logger.info(
+        "发布内容: title=%s, images=%d, tags=%d, schedule=%s, original=%s, visibility=%s",
+        content.title,
+        len(content.image_paths),
+        len(tags),
+        content.schedule_time,
+        content.is_original,
+        content.visibility,
+    )
+
+    # 提交发布
+    _submit_publish(
+        page,
+        content.title,
+        content.content,
+        tags,
+        content.schedule_time,
+        content.is_original,
+        content.visibility,
+    )
+
+
+# ========== 页面导航 ==========
+
+
+def _navigate_to_publish_page(page: Page) -> None:
+    """导航到发布页面。"""
+    page.navigate(PUBLISH_URL)
+    page.wait_for_load(timeout=300)
+    time.sleep(2)
+    page.wait_dom_stable()
+    time.sleep(1)
+
+
+def _click_publish_tab(page: Page, tab_name: str) -> None:
+    """点击发布页 TAB（上传图文/上传视频）。"""
+    page.wait_for_element(UPLOAD_CONTENT, timeout=15)
+
+    deadline = time.monotonic() + 15
+    while time.monotonic() < deadline:
+        # 查找匹配的 TAB
+        found = page.evaluate(
+            f"""
+            (() => {{
+                const tabs = document.querySelectorAll({json.dumps(CREATOR_TAB)});
+                for (const tab of tabs) {{
+                    if (tab.textContent.trim() === {json.dumps(tab_name)}) {{
+                        // 检查是否被遮挡
+                        const rect = tab.getBoundingClientRect();
+                        if (rect.width === 0 || rect.height === 0) continue;
+                        const x = rect.left + rect.width / 2;
+                        const y = rect.top + rect.height / 2;
+                        const target = document.elementFromPoint(x, y);
+                        if (target === tab || tab.contains(target)) {{
+                            tab.click();
+                            return 'clicked';
+                        }}
+                        return 'blocked';
+                    }}
+                }}
+                return 'not_found';
+            }})()
+            """
+        )
+
+        if found == "clicked":
+            return
+
+        if found == "blocked":
+            # 尝试移除弹窗
+            _remove_pop_cover(page)
+
+        time.sleep(0.2)
+
+    raise PublishError(f"没有找到发布 TAB - {tab_name}")
+
+
+def _remove_pop_cover(page: Page) -> None:
+    """移除弹窗遮挡。"""
+    if page.has_element(POPOVER):
+        page.remove_element(POPOVER)
+    # 点击空位置
+    x = 380 + random.randint(0, 100)
+    y = 20 + random.randint(0, 60)
+    page.mouse_click(float(x), float(y))
+
+
+# ========== 图片上传 ==========
+
+
+def _upload_images(page: Page, image_paths: list[str]) -> None:
+    """逐张上传图片。"""
+    import os
+
+    valid_paths = [p for p in image_paths if os.path.exists(p)]
+    if not valid_paths:
+        raise PublishError("没有有效的图片文件")
+
+    for i, path in enumerate(valid_paths):
+        selector = UPLOAD_INPUT if i == 0 else FILE_INPUT
+        logger.info("上传第 %d 张图片: %s", i + 1, path)
+
+        page.set_file_input(selector, [path])
+        _wait_for_upload_complete(page, i + 1)
+        time.sleep(1)
+
+
+def _wait_for_upload_complete(page: Page, expected_count: int) -> None:
+    """等待图片上传完成。"""
+    max_wait = 60.0
+    start = time.monotonic()
+
+    while time.monotonic() - start < max_wait:
+        count = page.get_elements_count(IMAGE_PREVIEW)
+        if count >= expected_count:
+            logger.info("图片上传完成: %d", count)
+            return
+        time.sleep(0.5)
+
+    raise UploadTimeoutError(f"第{expected_count}张图片上传超时(60s)")
+
+
+# ========== 表单提交 ==========
+
+
+def _submit_publish(
+    page: Page,
+    title: str,
+    content: str,
+    tags: list[str],
+    schedule_time: str | None,
+    is_original: bool,
+    visibility: str,
+) -> None:
+    """填写表单并提交。"""
+    # 标题
+    page.input_text(TITLE_INPUT, title)
+    time.sleep(0.5)
+    _check_title_max_length(page)
+    logger.info("标题长度检查通过")
+    time.sleep(1)
+
+    # 正文
+    content_selector = _find_content_element(page)
+    page.input_content_editable(content_selector, content)
+
+    # 回点标题（增强稳定性）
+    time.sleep(1)
+    page.click_element(TITLE_INPUT)
+    logger.info("已回点标题输入框")
+
+    # 标签
+    if tags:
+        _input_tags(page, content_selector, tags)
+    time.sleep(1)
+    _check_content_max_length(page)
+    logger.info("正文长度检查通过")
+
+    # 定时发布
+    if schedule_time:
+        _set_schedule_publish(page, schedule_time)
+
+    # 可见范围
+    _set_visibility(page, visibility)
+
+    # 原创声明
+    if is_original:
+        try:
+            _set_original(page)
+            logger.info("已声明原创")
+        except Exception as e:
+            logger.warning("设置原创声明失败: %s", e)
+
+    # 点击发布
+    page.click_element(PUBLISH_BUTTON)
+    time.sleep(3)
+    logger.info("发布完成")
+
+
+def _find_content_element(page: Page) -> str:
+    """查找内容输入框（兼容两种 UI）。"""
+    if page.has_element(CONTENT_EDITOR):
+        return CONTENT_EDITOR
+
+    # 查找带 placeholder 的 p 元素的 textbox 父元素
+    found = page.evaluate(
+        """
+        (() => {
+            const ps = document.querySelectorAll('p');
+            for (const p of ps) {
+                const placeholder = p.getAttribute('data-placeholder');
+                if (placeholder && placeholder.includes('输入正文描述')) {
+                    let current = p;
+                    for (let i = 0; i < 5; i++) {
+                        current = current.parentElement;
+                        if (!current) break;
+                        if (current.getAttribute('role') === 'textbox') {
+                            return 'found';
+                        }
+                    }
+                }
+            }
+            return '';
+        })()
+        """
+    )
+    if found == "found":
+        return "[role='textbox']"
+
+    raise PublishError("没有找到内容输入框")
+
+
+def _check_title_max_length(page: Page) -> None:
+    """检查标题长度是否超限。"""
+    text = page.get_element_text(TITLE_MAX_SUFFIX)
+    if text:
+        parts = text.split("/")
+        if len(parts) == 2:
+            raise TitleTooLongError(parts[0], parts[1])
+        raise TitleTooLongError(text, "?")
+
+
+def _check_content_max_length(page: Page) -> None:
+    """检查正文长度是否超限。"""
+    text = page.get_element_text(CONTENT_LENGTH_ERROR)
+    if text:
+        parts = text.split("/")
+        if len(parts) == 2:
+            raise ContentTooLongError(parts[0], parts[1])
+        raise ContentTooLongError(text, "?")
+
+
+# ========== 标签输入 ==========
+
+
+def _input_tags(page: Page, content_selector: str, tags: list[str]) -> None:
+    """输入标签。"""
+    time.sleep(1)
+
+    # 移动光标到正文末尾（20次 ArrowDown）
+    for _ in range(20):
+        page.press_key("ArrowDown")
+        time.sleep(0.01)
+
+    # 按两次回车换行
+    page.press_key("Enter")
+    page.press_key("Enter")
+    time.sleep(1)
+
+    for tag in tags:
+        tag = tag.lstrip("#")
+        _input_single_tag(page, content_selector, tag)
+
+
+def _input_single_tag(page: Page, content_selector: str, tag: str) -> None:
+    """输入单个标签。"""
+    # 输入 #
+    page.type_text("#", delay_ms=0)
+    time.sleep(0.2)
+
+    # 逐字输入标签
+    for char in tag:
+        page.type_text(char, delay_ms=50)
+
+    time.sleep(1)
+
+    # 尝试点击标签联想
+    if page.has_element(TAG_TOPIC_CONTAINER):
+        item_selector = f"{TAG_TOPIC_CONTAINER} {TAG_FIRST_ITEM}"
+        if page.has_element(item_selector):
+            page.click_element(item_selector)
+            logger.info("点击标签联想: %s", tag)
+            time.sleep(0.5)
+            return
+
+    # 没有联想，直接空格
+    logger.warning("未找到标签联想，直接输入空格: %s", tag)
+    page.type_text(" ", delay_ms=0)
+    time.sleep(0.5)
+
+
+# ========== 定时发布 ==========
+
+
+def _set_schedule_publish(page: Page, schedule_time: str) -> None:
+    """设置定时发布。"""
+    from datetime import datetime
+
+    # 解析 ISO8601 时间
+    try:
+        dt = datetime.fromisoformat(schedule_time)
+    except ValueError as e:
+        raise PublishError(f"定时发布时间格式错误: {e}") from e
+
+    # 点击定时发布开关
+    page.click_element(SCHEDULE_SWITCH)
+    time.sleep(0.8)
+
+    # 设置日期时间
+    datetime_str = dt.strftime("%Y-%m-%d %H:%M")
+    page.select_all_text(DATETIME_INPUT)
+    page.input_text(DATETIME_INPUT, datetime_str)
+    time.sleep(0.5)
+
+    logger.info("已设置定时发布: %s", datetime_str)
+
+
+# ========== 可见范围 ==========
+
+
+def _set_visibility(page: Page, visibility: str) -> None:
+    """设置可见范围。"""
+    if not visibility or visibility == "公开可见":
+        logger.info("可见范围: 公开可见（默认）")
+        return
+
+    supported = {"仅自己可见", "仅互关好友可见"}
+    if visibility not in supported:
+        raise PublishError(
+            f"不支持的可见范围: {visibility}，支持: 公开可见、仅自己可见、仅互关好友可见"
+        )
+
+    # 点击下拉框
+    page.click_element(VISIBILITY_DROPDOWN)
+    time.sleep(0.5)
+
+    # 查找并点击目标选项
+    clicked = page.evaluate(
+        f"""
+        (() => {{
+            const opts = document.querySelectorAll({json.dumps(VISIBILITY_OPTIONS)});
+            for (const opt of opts) {{
+                if (opt.textContent.includes({json.dumps(visibility)})) {{
+                    opt.click();
+                    return true;
+                }}
+            }}
+            return false;
+        }})()
+        """
+    )
+
+    if not clicked:
+        raise PublishError(f"未找到可见范围选项: {visibility}")
+
+    logger.info("已设置可见范围: %s", visibility)
+    time.sleep(0.2)
+
+
+# ========== 原创声明 ==========
+
+
+def _set_original(page: Page) -> None:
+    """设置原创声明。"""
+    # 查找原创声明卡片并点击开关
+    result = page.evaluate(
+        f"""
+        (() => {{
+            const cards = document.querySelectorAll({json.dumps(ORIGINAL_SWITCH_CARD)});
+            for (const card of cards) {{
+                if (!card.textContent.includes('原创声明')) continue;
+                const sw = card.querySelector({json.dumps(ORIGINAL_SWITCH)});
+                if (!sw) continue;
+                const input = sw.querySelector('input[type="checkbox"]');
+                if (input && input.checked) return 'already_on';
+                sw.click();
+                return 'clicked';
+            }}
+            return 'not_found';
+        }})()
+        """
+    )
+
+    if result == "already_on":
+        logger.info("原创声明已开启")
+        return
+
+    if result == "not_found":
+        raise PublishError("未找到原创声明选项")
+
+    time.sleep(0.5)
+
+    # 处理确认弹窗
+    _confirm_original_declaration(page)
+
+
+def _confirm_original_declaration(page: Page) -> None:
+    """处理原创声明确认弹窗。"""
+    time.sleep(0.8)
+
+    # 勾选 checkbox
+    page.evaluate(
+        """
+        (() => {
+            const footers = document.querySelectorAll('div.footer');
+            for (const footer of footers) {
+                if (!footer.textContent.includes('原创声明须知')) continue;
+                const cb = footer.querySelector('div.d-checkbox input[type="checkbox"]');
+                if (cb && !cb.checked) cb.click();
+                return;
+            }
+        })()
+        """
+    )
+    time.sleep(0.5)
+
+    # 点击声明原创按钮
+    result = page.evaluate(
+        """
+        (() => {
+            const footers = document.querySelectorAll('div.footer');
+            for (const footer of footers) {
+                if (!footer.textContent.includes('声明原创')) continue;
+                const btn = footer.querySelector('button.custom-button');
+                if (btn) {
+                    if (btn.classList.contains('disabled') || btn.disabled) {
+                        const cb = footer.querySelector('div.d-checkbox input[type="checkbox"]');
+                        if (cb && !cb.checked) cb.click();
+                        return 'button_disabled';
+                    }
+                    btn.click();
+                    return 'clicked';
+                }
+            }
+            return 'button_not_found';
+        })()
+        """
+    )
+
+    if result == "button_not_found":
+        raise PublishError("未找到声明原创按钮")
+    if result == "button_disabled":
+        raise PublishError("声明原创按钮仍处于禁用状态")
+
+    logger.info("已成功点击声明原创按钮")
+    time.sleep(0.3)
--- a/scripts/xhs/publish_video.py 0 → 100644
View file @b8ec00a
+++ b/scripts/xhs/publish_video.py 0 → 100644
View file @b8ec00a
+"""视频发布，对应 Go xiaohongshu/publish_video.go。"""
+
+from __future__ import annotations
+
+import logging
+import os
+import time
+
+from .cdp import Page
+from .errors import PublishError, UploadTimeoutError
+from .publish import (
+    _click_publish_tab,
+    _find_content_element,
+    _input_tags,
+    _navigate_to_publish_page,
+    _set_schedule_publish,
+    _set_visibility,
+)
+from .selectors import (
+    FILE_INPUT,
+    PUBLISH_BUTTON,
+    TITLE_INPUT,
+    UPLOAD_INPUT,
+)
+from .types import PublishVideoContent
+
+logger = logging.getLogger(__name__)
+
+
+def publish_video_content(page: Page, content: PublishVideoContent) -> None:
+    """发布视频内容。
+
+    Args:
+        page: CDP 页面对象。
+        content: 视频发布内容。
+
+    Raises:
+        PublishError: 发布失败。
+        UploadTimeoutError: 上传/处理超时。
+    """
+    if not content.video_path:
+        raise PublishError("视频不能为空")
+
+    # 导航到发布页
+    _navigate_to_publish_page(page)
+
+    # 点击"上传视频" TAB
+    _click_publish_tab(page, "上传视频")
+    time.sleep(1)
+
+    # 上传视频
+    _upload_video(page, content.video_path)
+
+    # 提交
+    _submit_publish_video(
+        page,
+        content.title,
+        content.content,
+        content.tags,
+        content.schedule_time,
+        content.visibility,
+    )
+
+
+def _upload_video(page: Page, video_path: str) -> None:
+    """上传视频文件。"""
+    if not os.path.exists(video_path):
+        raise PublishError(f"视频文件不存在: {video_path}")
+
+    # 查找上传输入框
+    selector = UPLOAD_INPUT if page.has_element(UPLOAD_INPUT) else FILE_INPUT
+    page.set_file_input(selector, [video_path])
+
+    # 等待发布按钮可点击（视频处理完成）
+    _wait_for_publish_button_clickable(page)
+    logger.info("视频上传/处理完成")
+
+
+def _wait_for_publish_button_clickable(page: Page) -> None:
+    """等待发布按钮可点击（视频处理可能需要较长时间）。"""
+    max_wait = 600.0  # 10 分钟
+    start = time.monotonic()
+
+    logger.info("开始等待发布按钮可点击(视频)")
+
+    while time.monotonic() - start < max_wait:
+        clickable = page.evaluate(
+            f"""
+            (() => {{
+                const btn = document.querySelector({_js_str(PUBLISH_BUTTON)});
+                if (!btn) return false;
+                const rect = btn.getBoundingClientRect();
+                if (rect.width === 0 || rect.height === 0) return false;
+                if (btn.disabled) return false;
+                if (btn.classList.contains('disabled')) return false;
+                return true;
+            }})()
+            """
+        )
+        if clickable:
+            return
+        time.sleep(1)
+
+    raise UploadTimeoutError("等待发布按钮可点击超时(10分钟)")
+
+
+def _submit_publish_video(
+    page: Page,
+    title: str,
+    content: str,
+    tags: list[str],
+    schedule_time: str | None,
+    visibility: str,
+) -> None:
+    """填写视频表单并提交。"""
+    # 标题
+    page.input_text(TITLE_INPUT, title)
+    time.sleep(1)
+
+    # 正文 + 标签
+    content_selector = _find_content_element(page)
+    page.input_content_editable(content_selector, content)
+
+    # 回点标题
+    time.sleep(1)
+    page.click_element(TITLE_INPUT)
+
+    if tags:
+        _input_tags(page, content_selector, tags)
+    time.sleep(1)
+
+    # 定时发布
+    if schedule_time:
+        _set_schedule_publish(page, schedule_time)
+
+    # 可见范围
+    _set_visibility(page, visibility)
+
+    # 等待发布按钮可点击
+    _wait_for_publish_button_clickable(page)
+
+    # 点击发布
+    page.click_element(PUBLISH_BUTTON)
+    time.sleep(3)
+    logger.info("视频发布完成")
+
+
+def _js_str(s: str) -> str:
+    """将 Python 字符串转为 JS 字面量。"""
+    import json
+
+    return json.dumps(s)
--- a/scripts/xhs/search.py 0 → 100644
View file @b8ec00a
+++ b/scripts/xhs/search.py 0 → 100644
View file @b8ec00a
+"""搜索 Feeds，对应 Go xiaohongshu/search.go。"""
+
+from __future__ import annotations
+
+import json
+import logging
+import time
+
+from .cdp import Page
+from .errors import NoFeedsError
+from .selectors import FILTER_BUTTON, FILTER_PANEL
+from .types import Feed, FilterOption
+from .urls import make_search_url
+
+logger = logging.getLogger(__name__)
+
+# 筛选选项映射表：{筛选组索引: [(标签索引, 文本), ...]}
+_FILTER_OPTIONS: dict[int, list[tuple[int, str]]] = {
+    1: [(1, "综合"), (2, "最新"), (3, "最多点赞"), (4, "最多评论"), (5, "最多收藏")],
+    2: [(1, "不限"), (2, "视频"), (3, "图文")],
+    3: [(1, "不限"), (2, "一天内"), (3, "一周内"), (4, "半年内")],
+    4: [(1, "不限"), (2, "已看过"), (3, "未看过"), (4, "已关注")],
+    5: [(1, "不限"), (2, "同城"), (3, "附近")],
+}
+
+# 从 __INITIAL_STATE__ 提取搜索结果的 JS
+_EXTRACT_SEARCH_JS = """
+(() => {
+    if (window.__INITIAL_STATE__ &&
+        window.__INITIAL_STATE__.search &&
+        window.__INITIAL_STATE__.search.feeds) {
+        const feeds = window.__INITIAL_STATE__.search.feeds;
+        const feedsData = feeds.value !== undefined ? feeds.value : feeds._value;
+        if (feedsData) {
+            return JSON.stringify(feedsData);
+        }
+    }
+    return "";
+})()
+"""
+
+
+def _find_internal_option(group_index: int, text: str) -> tuple[int, int]:
+    """查找内部筛选选项索引。
+
+    Returns:
+        (filters_index, tags_index)
+
+    Raises:
+        ValueError: 未找到匹配的选项。
+    """
+    options = _FILTER_OPTIONS.get(group_index)
+    if not options:
+        raise ValueError(f"筛选组 {group_index} 不存在")
+
+    for tags_index, option_text in options:
+        if option_text == text:
+            return group_index, tags_index
+
+    valid = [t for _, t in options]
+    raise ValueError(f"在筛选组 {group_index} 中未找到 '{text}'，有效值: {valid}")
+
+
+def _convert_filters(filter_opt: FilterOption) -> list[tuple[int, int]]:
+    """将 FilterOption 转换为内部 (filters_index, tags_index) 列表。"""
+    result: list[tuple[int, int]] = []
+
+    if filter_opt.sort_by:
+        result.append(_find_internal_option(1, filter_opt.sort_by))
+    if filter_opt.note_type:
+        result.append(_find_internal_option(2, filter_opt.note_type))
+    if filter_opt.publish_time:
+        result.append(_find_internal_option(3, filter_opt.publish_time))
+    if filter_opt.search_scope:
+        result.append(_find_internal_option(4, filter_opt.search_scope))
+    if filter_opt.location:
+        result.append(_find_internal_option(5, filter_opt.location))
+
+    return result
+
+
+def search_feeds(
+    page: Page,
+    keyword: str,
+    filter_option: FilterOption | None = None,
+) -> list[Feed]:
+    """搜索 Feeds。
+
+    Args:
+        page: CDP 页面对象。
+        keyword: 搜索关键词。
+        filter_option: 可选筛选条件。
+
+    Raises:
+        NoFeedsError: 没有捕获到搜索结果。
+        ValueError: 筛选选项无效。
+    """
+    search_url = make_search_url(keyword)
+    page.navigate(search_url)
+    page.wait_for_load()
+    page.wait_dom_stable()
+
+    # 等待 __INITIAL_STATE__ 初始化
+    _wait_for_initial_state(page)
+
+    # 应用筛选条件
+    if filter_option:
+        internal_filters = _convert_filters(filter_option)
+        if internal_filters:
+            _apply_filters(page, internal_filters)
+
+    # 提取搜索结果
+    result = page.evaluate(_EXTRACT_SEARCH_JS)
+    if not result:
+        raise NoFeedsError()
+
+    feeds_data = json.loads(result)
+    return [Feed.from_dict(f) for f in feeds_data]
+
+
+def _wait_for_initial_state(page: Page, timeout: float = 10.0) -> None:
+    """等待 __INITIAL_STATE__ 就绪。"""
+    deadline = time.monotonic() + timeout
+    while time.monotonic() < deadline:
+        ready = page.evaluate("window.__INITIAL_STATE__ !== undefined")
+        if ready:
+            return
+        time.sleep(0.5)
+    logger.warning("等待 __INITIAL_STATE__ 超时")
+
+
+def _apply_filters(page: Page, filters: list[tuple[int, int]]) -> None:
+    """应用筛选条件。"""
+    # 悬停筛选按钮
+    page.hover_element(FILTER_BUTTON)
+
+    # 等待筛选面板出现
+    deadline = time.monotonic() + 5.0
+    while time.monotonic() < deadline:
+        if page.has_element(FILTER_PANEL):
+            break
+        time.sleep(0.3)
+
+    # 点击各筛选项
+    for filters_index, tags_index in filters:
+        selector = (
+            f"div.filter-panel div.filters:nth-child({filters_index}) "
+            f"div.tags:nth-child({tags_index})"
+        )
+        page.click_element(selector)
+        time.sleep(0.3)
+
+    # 等待页面更新
+    page.wait_dom_stable()
+    _wait_for_initial_state(page)
--- a/scripts/xhs/selectors.py 0 → 100644
View file @b8ec00a
+++ b/scripts/xhs/selectors.py 0 → 100644
View file @b8ec00a
+"""小红书页面 CSS 选择器常量。"""
+
+# ========== 登录 ==========
+LOGIN_STATUS = ".main-container .user .link-wrapper .channel"
+QRCODE_IMG = ".login-container .qrcode-img"
+
+# ========== 首页 / 搜索 ==========
+FILTER_BUTTON = "div.filter"
+FILTER_PANEL = "div.filter-panel"
+
+# ========== Feed 详情 ==========
+COMMENTS_CONTAINER = ".comments-container"
+PARENT_COMMENT = ".parent-comment"
+NO_COMMENTS_TEXT = ".no-comments-text"
+END_CONTAINER = ".end-container"
+TOTAL_COMMENT = ".comments-container .total"
+SHOW_MORE_BUTTON = ".show-more"
+NOTE_SCROLLER = ".note-scroller"
+INTERACTION_CONTAINER = ".interaction-container"
+
+# 页面不可访问容器
+ACCESS_ERROR_WRAPPER = ".access-wrapper, .error-wrapper, .not-found-wrapper, .blocked-wrapper"
+
+# ========== 评论输入 ==========
+COMMENT_INPUT_TRIGGER = "div.input-box div.content-edit span"
+COMMENT_INPUT_FIELD = "div.input-box div.content-edit p.content-input"
+COMMENT_SUBMIT_BUTTON = "div.bottom button.submit"
+REPLY_BUTTON = ".right .interactions .reply"
+
+# ========== 点赞 / 收藏 ==========
+LIKE_BUTTON = ".interact-container .left .like-lottie"
+COLLECT_BUTTON = ".interact-container .left .reds-icon.collect-icon"
+
+# ========== 发布页 ==========
+UPLOAD_CONTENT = "div.upload-content"
+CREATOR_TAB = "div.creator-tab"
+UPLOAD_INPUT = ".upload-input"
+FILE_INPUT = 'input[type="file"]'
+TITLE_INPUT = "div.d-input input"
+CONTENT_EDITOR = "div.ql-editor"
+IMAGE_PREVIEW = ".img-preview-area .pr"
+PUBLISH_BUTTON = ".publish-page-publish-btn button.bg-red"
+
+# 标题/正文长度校验
+TITLE_MAX_SUFFIX = "div.title-container div.max_suffix"
+CONTENT_LENGTH_ERROR = "div.edit-container div.length-error"
+
+# 可见范围
+VISIBILITY_DROPDOWN = "div.permission-card-wrapper div.d-select-content"
+VISIBILITY_OPTIONS = "div.d-options-wrapper div.d-grid-item div.custom-option"
+
+# 定时发布
+SCHEDULE_SWITCH = ".post-time-wrapper .d-switch"
+DATETIME_INPUT = ".date-picker-container input"
+
+# 原创声明
+ORIGINAL_SWITCH_CARD = "div.custom-switch-card"
+ORIGINAL_SWITCH = "div.d-switch"
+
+# 标签联想
+TAG_TOPIC_CONTAINER = "#creator-editor-topic-container"
+TAG_FIRST_ITEM = ".item"
+
+# 弹窗
+POPOVER = "div.d-popover"
+
+# ========== 用户主页 ==========
+SIDEBAR_PROFILE = "div.main-container li.user.side-bar-component a.link-wrapper span.channel"
--- a/scripts/xhs/stealth.py 0 → 100644
View file @b8ec00a
+++ b/scripts/xhs/stealth.py 0 → 100644
View file @b8ec00a
+"""反检测 JS 注入 + Chrome 启动参数，对应 go-rod/stealth。"""
+
+# 反检测 JS 脚本：在页面加载时注入
+STEALTH_JS = """
+(() => {
+    // 1. navigator.webdriver
+    Object.defineProperty(navigator, 'webdriver', {
+        get: () => undefined,
+        configurable: true,
+    });
+
+    // 2. chrome.runtime
+    if (!window.chrome) {
+        window.chrome = {};
+    }
+    if (!window.chrome.runtime) {
+        window.chrome.runtime = {
+            connect: () => {},
+            sendMessage: () => {},
+        };
+    }
+
+    // 3. plugins
+    Object.defineProperty(navigator, 'plugins', {
+        get: () => {
+            return [
+                {
+                    0: {type: 'application/x-google-chrome-pdf'},
+                    description: 'Portable Document Format',
+                    filename: 'internal-pdf-viewer',
+                    length: 1,
+                    name: 'Chrome PDF Plugin',
+                },
+                {
+                    0: {type: 'application/pdf'},
+                    description: '',
+                    filename: 'mhjfbmdgcfjbbpaeojofohoefgiehjai',
+                    length: 1,
+                    name: 'Chrome PDF Viewer',
+                },
+                {
+                    0: {type: 'application/x-nacl'},
+                    description: '',
+                    filename: 'internal-nacl-plugin',
+                    length: 1,
+                    name: 'Native Client',
+                },
+            ];
+        },
+        configurable: true,
+    });
+
+    // 4. languages
+    Object.defineProperty(navigator, 'languages', {
+        get: () => ['zh-CN', 'zh', 'en-US', 'en'],
+        configurable: true,
+    });
+
+    // 5. permissions
+    const originalQuery = window.navigator.permissions?.query;
+    if (originalQuery) {
+        window.navigator.permissions.query = (parameters) =>
+            parameters.name === 'notifications'
+                ? Promise.resolve({ state: Notification.permission })
+                : originalQuery(parameters);
+    }
+
+    // 6. WebGL vendor/renderer
+    const getParameter = WebGLRenderingContext.prototype.getParameter;
+    WebGLRenderingContext.prototype.getParameter = function(parameter) {
+        if (parameter === 37445) return 'Intel Inc.';
+        if (parameter === 37446) return 'Intel Iris OpenGL Engine';
+        return getParameter.call(this, parameter);
+    };
+})();
+"""
+
+# Chrome 启动参数（反检测相关）
+STEALTH_ARGS = [
+    "--disable-blink-features=AutomationControlled",
+    "--disable-infobars",
+    "--no-first-run",
+    "--no-default-browser-check",
+    "--disable-background-timer-throttling",
+    "--disable-backgrounding-occluded-windows",
+    "--disable-renderer-backgrounding",
+    "--disable-component-update",
+]
--- a/scripts/xhs/types.py 0 → 100644
View file @b8ec00a
+++ b/scripts/xhs/types.py 0 → 100644
View file @b8ec00a
+"""小红书数据类型定义，对应 Go types.go。"""
+
+from __future__ import annotations
+
+from dataclasses import dataclass, field
+
+# ========== Feed 列表 ==========
+
+
+@dataclass
+class ImageInfo:
+    image_scene: str = ""
+    url: str = ""
+
+    @classmethod
+    def from_dict(cls, d: dict) -> ImageInfo:
+        return cls(
+            image_scene=d.get("imageScene", ""),
+            url=d.get("url", ""),
+        )
+
+
+@dataclass
+class VideoCapability:
+    duration: int = 0  # 秒
+
+    @classmethod
+    def from_dict(cls, d: dict) -> VideoCapability:
+        return cls(duration=d.get("duration", 0))
+
+
+@dataclass
+class Video:
+    capa: VideoCapability = field(default_factory=VideoCapability)
+
+    @classmethod
+    def from_dict(cls, d: dict) -> Video:
+        return cls(capa=VideoCapability.from_dict(d.get("capa", {})))
+
+
+@dataclass
+class Cover:
+    width: int = 0
+    height: int = 0
+    url: str = ""
+    file_id: str = ""
+    url_pre: str = ""
+    url_default: str = ""
+    info_list: list[ImageInfo] = field(default_factory=list)
+
+    @classmethod
+    def from_dict(cls, d: dict) -> Cover:
+        return cls(
+            width=d.get("width", 0),
+            height=d.get("height", 0),
+            url=d.get("url", ""),
+            file_id=d.get("fileId", ""),
+            url_pre=d.get("urlPre", ""),
+            url_default=d.get("urlDefault", ""),
+            info_list=[ImageInfo.from_dict(i) for i in d.get("infoList", [])],
+        )
+
+
+@dataclass
+class User:
+    user_id: str = ""
+    nickname: str = ""
+    nick_name: str = ""
+    avatar: str = ""
+
+    @classmethod
+    def from_dict(cls, d: dict) -> User:
+        return cls(
+            user_id=d.get("userId", ""),
+            nickname=d.get("nickname", ""),
+            nick_name=d.get("nickName", ""),
+            avatar=d.get("avatar", ""),
+        )
+
+
+@dataclass
+class InteractInfo:
+    liked: bool = False
+    liked_count: str = ""
+    shared_count: str = ""
+    comment_count: str = ""
+    collected_count: str = ""
+    collected: bool = False
+
+    @classmethod
+    def from_dict(cls, d: dict) -> InteractInfo:
+        return cls(
+            liked=d.get("liked", False),
+            liked_count=d.get("likedCount", ""),
+            shared_count=d.get("sharedCount", ""),
+            comment_count=d.get("commentCount", ""),
+            collected_count=d.get("collectedCount", ""),
+            collected=d.get("collected", False),
+        )
+
+
+@dataclass
+class NoteCard:
+    type: str = ""
+    display_title: str = ""
+    user: User = field(default_factory=User)
+    interact_info: InteractInfo = field(default_factory=InteractInfo)
+    cover: Cover = field(default_factory=Cover)
+    video: Video | None = None
+
+    @classmethod
+    def from_dict(cls, d: dict) -> NoteCard:
+        video_data = d.get("video")
+        return cls(
+            type=d.get("type", ""),
+            display_title=d.get("displayTitle", ""),
+            user=User.from_dict(d.get("user", {})),
+            interact_info=InteractInfo.from_dict(d.get("interactInfo", {})),
+            cover=Cover.from_dict(d.get("cover", {})),
+            video=Video.from_dict(video_data) if video_data else None,
+        )
+
+
+@dataclass
+class Feed:
+    xsec_token: str = ""
+    id: str = ""
+    model_type: str = ""
+    note_card: NoteCard = field(default_factory=NoteCard)
+    index: int = 0
+
+    @classmethod
+    def from_dict(cls, d: dict) -> Feed:
+        return cls(
+            xsec_token=d.get("xsecToken", ""),
+            id=d.get("id", ""),
+            model_type=d.get("modelType", ""),
+            note_card=NoteCard.from_dict(d.get("noteCard", {})),
+            index=d.get("index", 0),
+        )
+
+    def to_dict(self) -> dict:
+        """序列化为 JSON 兼容的字典。"""
+        result: dict = {
+            "id": self.id,
+            "xsecToken": self.xsec_token,
+            "modelType": self.model_type,
+            "index": self.index,
+            "displayTitle": self.note_card.display_title,
+            "type": self.note_card.type,
+            "user": {
+                "userId": self.note_card.user.user_id,
+                "nickname": self.note_card.user.nickname or self.note_card.user.nick_name,
+            },
+            "interactInfo": {
+                "likedCount": self.note_card.interact_info.liked_count,
+                "collectedCount": self.note_card.interact_info.collected_count,
+                "commentCount": self.note_card.interact_info.comment_count,
+                "sharedCount": self.note_card.interact_info.shared_count,
+            },
+        }
+        if self.note_card.video:
+            result["video"] = {"duration": self.note_card.video.capa.duration}
+        return result
+
+
+# ========== Feed 详情 ==========
+
+
+@dataclass
+class DetailImageInfo:
+    width: int = 0
+    height: int = 0
+    url_default: str = ""
+    url_pre: str = ""
+    live_photo: bool = False
+
+    @classmethod
+    def from_dict(cls, d: dict) -> DetailImageInfo:
+        return cls(
+            width=d.get("width", 0),
+            height=d.get("height", 0),
+            url_default=d.get("urlDefault", ""),
+            url_pre=d.get("urlPre", ""),
+            live_photo=d.get("livePhoto", False),
+        )
+
+
+@dataclass
+class Comment:
+    id: str = ""
+    note_id: str = ""
+    content: str = ""
+    like_count: str = ""
+    create_time: int = 0
+    ip_location: str = ""
+    liked: bool = False
+    user_info: User = field(default_factory=User)
+    sub_comment_count: str = ""
+    sub_comments: list[Comment] = field(default_factory=list)
+    show_tags: list[str] = field(default_factory=list)
+
+    @classmethod
+    def from_dict(cls, d: dict) -> Comment:
+        return cls(
+            id=d.get("id", ""),
+            note_id=d.get("noteId", ""),
+            content=d.get("content", ""),
+            like_count=d.get("likeCount", ""),
+            create_time=d.get("createTime", 0),
+            ip_location=d.get("ipLocation", ""),
+            liked=d.get("liked", False),
+            user_info=User.from_dict(d.get("userInfo", {})),
+            sub_comment_count=d.get("subCommentCount", ""),
+            sub_comments=[cls.from_dict(c) for c in d.get("subComments", []) or []],
+            show_tags=d.get("showTags", []) or [],
+        )
+
+    def to_dict(self) -> dict:
+        result: dict = {
+            "id": self.id,
+            "content": self.content,
+            "likeCount": self.like_count,
+            "createTime": self.create_time,
+            "ipLocation": self.ip_location,
+            "user": {
+                "userId": self.user_info.user_id,
+                "nickname": self.user_info.nickname or self.user_info.nick_name,
+            },
+            "subCommentCount": self.sub_comment_count,
+        }
+        if self.sub_comments:
+            result["subComments"] = [c.to_dict() for c in self.sub_comments]
+        return result
+
+
+@dataclass
+class CommentList:
+    list_: list[Comment] = field(default_factory=list)
+    cursor: str = ""
+    has_more: bool = False
+
+    @classmethod
+    def from_dict(cls, d: dict) -> CommentList:
+        return cls(
+            list_=[Comment.from_dict(c) for c in d.get("list", []) or []],
+            cursor=d.get("cursor", ""),
+            has_more=d.get("hasMore", False),
+        )
+
+
+@dataclass
+class FeedDetail:
+    note_id: str = ""
+    xsec_token: str = ""
+    title: str = ""
+    desc: str = ""
+    type: str = ""
+    time: int = 0
+    ip_location: str = ""
+    user: User = field(default_factory=User)
+    interact_info: InteractInfo = field(default_factory=InteractInfo)
+    image_list: list[DetailImageInfo] = field(default_factory=list)
+
+    @classmethod
+    def from_dict(cls, d: dict) -> FeedDetail:
+        return cls(
+            note_id=d.get("noteId", ""),
+            xsec_token=d.get("xsecToken", ""),
+            title=d.get("title", ""),
+            desc=d.get("desc", ""),
+            type=d.get("type", ""),
+            time=d.get("time", 0),
+            ip_location=d.get("ipLocation", ""),
+            user=User.from_dict(d.get("user", {})),
+            interact_info=InteractInfo.from_dict(d.get("interactInfo", {})),
+            image_list=[DetailImageInfo.from_dict(i) for i in d.get("imageList", []) or []],
+        )
+
+    def to_dict(self) -> dict:
+        return {
+            "noteId": self.note_id,
+            "title": self.title,
+            "desc": self.desc,
+            "type": self.type,
+            "time": self.time,
+            "ipLocation": self.ip_location,
+            "user": {
+                "userId": self.user.user_id,
+                "nickname": self.user.nickname or self.user.nick_name,
+            },
+            "interactInfo": {
+                "liked": self.interact_info.liked,
+                "likedCount": self.interact_info.liked_count,
+                "collectedCount": self.interact_info.collected_count,
+                "collected": self.interact_info.collected,
+                "commentCount": self.interact_info.comment_count,
+                "sharedCount": self.interact_info.shared_count,
+            },
+            "imageList": [
+                {
+                    "width": img.width,
+                    "height": img.height,
+                    "urlDefault": img.url_default,
+                }
+                for img in self.image_list
+            ],
+        }
+
+
+@dataclass
+class FeedDetailResponse:
+    note: FeedDetail = field(default_factory=FeedDetail)
+    comments: CommentList = field(default_factory=CommentList)
+
+    @classmethod
+    def from_dict(cls, d: dict) -> FeedDetailResponse:
+        return cls(
+            note=FeedDetail.from_dict(d.get("note", {})),
+            comments=CommentList.from_dict(d.get("comments", {})),
+        )
+
+    def to_dict(self) -> dict:
+        return {
+            "note": self.note.to_dict(),
+            "comments": [c.to_dict() for c in self.comments.list_],
+        }
+
+
+# ========== 用户主页 ==========
+
+
+@dataclass
+class UserBasicInfo:
+    gender: int = 0
+    ip_location: str = ""
+    desc: str = ""
+    imageb: str = ""
+    nickname: str = ""
+    images: str = ""
+    red_id: str = ""
+
+    @classmethod
+    def from_dict(cls, d: dict) -> UserBasicInfo:
+        return cls(
+            gender=d.get("gender", 0),
+            ip_location=d.get("ipLocation", ""),
+            desc=d.get("desc", ""),
+            imageb=d.get("imageb", ""),
+            nickname=d.get("nickname", ""),
+            images=d.get("images", ""),
+            red_id=d.get("redId", ""),
+        )
+
+
+@dataclass
+class UserInteraction:
+    type: str = ""
+    name: str = ""
+    count: str = ""
+
+    @classmethod
+    def from_dict(cls, d: dict) -> UserInteraction:
+        return cls(
+            type=d.get("type", ""),
+            name=d.get("name", ""),
+            count=d.get("count", ""),
+        )
+
+
+@dataclass
+class UserProfileResponse:
+    user_basic_info: UserBasicInfo = field(default_factory=UserBasicInfo)
+    interactions: list[UserInteraction] = field(default_factory=list)
+    feeds: list[Feed] = field(default_factory=list)
+
+    def to_dict(self) -> dict:
+        return {
+            "basicInfo": {
+                "nickname": self.user_basic_info.nickname,
+                "redId": self.user_basic_info.red_id,
+                "desc": self.user_basic_info.desc,
+                "gender": self.user_basic_info.gender,
+                "ipLocation": self.user_basic_info.ip_location,
+            },
+            "interactions": [
+                {"type": i.type, "name": i.name, "count": i.count} for i in self.interactions
+            ],
+            "feeds": [f.to_dict() for f in self.feeds],
+        }
+
+
+# ========== 搜索 ==========
+
+
+@dataclass
+class FilterOption:
+    """搜索筛选选项。"""
+
+    sort_by: str = ""  # 综合|最新|最多点赞|最多评论|最多收藏
+    note_type: str = ""  # 不限|视频|图文
+    publish_time: str = ""  # 不限|一天内|一周内|半年内
+    search_scope: str = ""  # 不限|已看过|未看过|已关注
+    location: str = ""  # 不限|同城|附近
+
+
+# ========== 发布 ==========
+
+
+@dataclass
+class PublishImageContent:
+    """图文发布内容。"""
+
+    title: str = ""
+    content: str = ""
+    tags: list[str] = field(default_factory=list)
+    image_paths: list[str] = field(default_factory=list)
+    schedule_time: str | None = None  # ISO8601 格式，None 表示立即发布
+    is_original: bool = False
+    visibility: str = ""  # 公开可见(默认)|仅自己可见|仅互关好友可见
+
+
+@dataclass
+class PublishVideoContent:
+    """视频发布内容。"""
+
+    title: str = ""
+    content: str = ""
+    tags: list[str] = field(default_factory=list)
+    video_path: str = ""
+    schedule_time: str | None = None  # ISO8601 格式
+    visibility: str = ""  # 公开可见(默认)|仅自己可见|仅互关好友可见
+
+
+# ========== 互动 ==========
+
+
+@dataclass
+class ActionResult:
+    """通用动作响应（点赞/收藏等）。"""
+
+    feed_id: str = ""
+    success: bool = False
+    message: str = ""
+
+    def to_dict(self) -> dict:
+        return {
+            "feed_id": self.feed_id,
+            "success": self.success,
+            "message": self.message,
+        }
+
+
+# ========== 评论加载配置 ==========
+
+
+@dataclass
+class CommentLoadConfig:
+    """评论加载配置。"""
+
+    click_more_replies: bool = False
+    max_replies_threshold: int = 10
+    max_comment_items: int = 0  # 0 = 不限
+    scroll_speed: str = "normal"  # slow|normal|fast
--- a/scripts/xhs/urls.py 0 → 100644
View file @b8ec00a
+++ b/scripts/xhs/urls.py 0 → 100644
View file @b8ec00a
+"""小红书 URL 常量和构建函数。"""
+
+from urllib.parse import urlencode
+
+# 基础页面
+EXPLORE_URL = "https://www.xiaohongshu.com/explore"
+HOME_URL = "https://www.xiaohongshu.com"
+PUBLISH_URL = "https://creator.xiaohongshu.com/publish/publish?source=official"
+
+
+def make_feed_detail_url(feed_id: str, xsec_token: str) -> str:
+    """构建 feed 详情页 URL。"""
+    return (
+        f"https://www.xiaohongshu.com/explore/{feed_id}?xsec_token={xsec_token}&xsec_source=pc_feed"
+    )
+
+
+def make_search_url(keyword: str) -> str:
+    """构建搜索结果页 URL。"""
+    params = urlencode({"keyword": keyword, "source": "web_explore_feed"})
+    return f"https://www.xiaohongshu.com/search_result?{params}"
+
+
+def make_user_profile_url(user_id: str, xsec_token: str) -> str:
+    """构建用户主页 URL。"""
+    return (
+        f"https://www.xiaohongshu.com/user/profile/{user_id}"
+        f"?xsec_token={xsec_token}&xsec_source=pc_note"
+    )