<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>论文 on Peng Tan's AI Blog</title><link>https://c44db530.hobbytp-github-io.pages.dev/zh/tags/%E8%AE%BA%E6%96%87/</link><description>一个关注 AI 各领域的专题博客</description><atom:link href="https://c44db530.hobbytp-github-io.pages.dev/zh/tags/%E8%AE%BA%E6%96%87/index.xml" rel="self" type="application/rss+xml"/><item><title>DeepSeek-OCR：重塑AI长文本处理</title><link>https://c44db530.hobbytp-github-io.pages.dev/zh/deepseek/deepseek_ocr/</link><pubDate>Tue, 21 Oct 2025 20:10:00 +0800</pubDate><guid>https://c44db530.hobbytp-github-io.pages.dev/zh/deepseek/deepseek_ocr/</guid><description>本文介绍了DeepSeek-OCR，一种革命性的AI模型，能够将长文本处理效率提升数十倍，从而实现对超长文档的快速处理。</description></item><item><title>ERNIE 4.5 技术报告解读</title><link>https://c44db530.hobbytp-github-io.pages.dev/zh/baidu/ernie4.5_open_now/</link><pubDate>Mon, 30 Jun 2025 22:10:00 +0800</pubDate><guid>https://c44db530.hobbytp-github-io.pages.dev/zh/baidu/ernie4.5_open_now/</guid><description>本文介绍了百度开源的ERNIE 4.5模型，并对其技术原理、主要贡献、论文方法、评估结果和局限性进行了详细解读。</description></item><item><title>Reinforced Self-play Reasoning with Zero Data 论文解读</title><link>https://c44db530.hobbytp-github-io.pages.dev/zh/training/reinforced_selfplay_reasoning_w_zero_data/</link><pubDate>Sun, 11 May 2025 20:10:00 +0800</pubDate><guid>https://c44db530.hobbytp-github-io.pages.dev/zh/training/reinforced_selfplay_reasoning_w_zero_data/</guid><description>论文介绍了强化自博弈推理的零数据范式，通过自博弈生成任务和验证，实现无需依赖人工标注数据或预设任务的自主学习推理。</description></item><item><title>多智能体强化学习（MARL）在多智能体系统（MAS）中的应用：理论、算法、应用与展望</title><link>https://c44db530.hobbytp-github-io.pages.dev/zh/mas/mas_reinforcement/</link><pubDate>Sat, 26 Apr 2025 20:10:00 +0800</pubDate><guid>https://c44db530.hobbytp-github-io.pages.dev/zh/mas/mas_reinforcement/</guid><description>本文介绍了多智能体强化学习（MARL）在多智能体系统（MAS）中的应用：理论、算法、应用与展望。</description></item><item><title>Cursor AI 最佳实践：提升编码效率与代码质量的权威指南</title><link>https://c44db530.hobbytp-github-io.pages.dev/zh/products/cursor/</link><pubDate>Sat, 12 Apr 2025 21:10:00 +0800</pubDate><guid>https://c44db530.hobbytp-github-io.pages.dev/zh/products/cursor/</guid><description>Cursor AI 最佳实践：提升编码效率与代码质量的权威指南</description></item><item><title>Chain of Draft 论文解读</title><link>https://c44db530.hobbytp-github-io.pages.dev/zh/papers/cod-chain-of-draft/</link><pubDate>Sat, 01 Mar 2025 20:00:00 +0800</pubDate><guid>https://c44db530.hobbytp-github-io.pages.dev/zh/papers/cod-chain-of-draft/</guid><description>本文介绍了Chain of Draft（CoD）论文，并对其技术原理、主要贡献、论文方法、评估结果和局限性进行了详细解读。</description></item><item><title>Test-Time Scaling 相关论文解读</title><link>https://c44db530.hobbytp-github-io.pages.dev/zh/training/test_time_scaling/</link><pubDate>Wed, 19 Feb 2025 16:40:00 +0800</pubDate><guid>https://c44db530.hobbytp-github-io.pages.dev/zh/training/test_time_scaling/</guid><description>本文介绍了Test-Time Scaling（测试时扩展）的概念，并对其技术原理、主要贡献、论文方法、评估结果和局限性进行了详细解读。</description></item><item><title>DeepSeek V3 论文解读</title><link>https://c44db530.hobbytp-github-io.pages.dev/zh/deepseek/deepseek_v3/</link><pubDate>Fri, 14 Feb 2025 18:10:00 +0800</pubDate><guid>https://c44db530.hobbytp-github-io.pages.dev/zh/deepseek/deepseek_v3/</guid><description>本文介绍了深度求索（DeepSeek）公司推出的新一代推理模型DeepSeek-V3，并对其技术原理、主要贡献、论文方法、评估结果和局限性进行了详细解读。</description></item><item><title>DeepSeek 微调</title><link>https://c44db530.hobbytp-github-io.pages.dev/zh/deepseek/deepseek-finetuning/</link><pubDate>Fri, 14 Feb 2025 18:10:00 +0800</pubDate><guid>https://c44db530.hobbytp-github-io.pages.dev/zh/deepseek/deepseek-finetuning/</guid><description>本文介绍了如何使用合成推理数据集微调DeepSeek-R1模型.</description></item><item><title>Simple Test-Time Scaling 论文解读</title><link>https://c44db530.hobbytp-github-io.pages.dev/zh/training/s1_simple_testtimescaling/</link><pubDate>Mon, 10 Feb 2025 21:36:00 +0800</pubDate><guid>https://c44db530.hobbytp-github-io.pages.dev/zh/training/s1_simple_testtimescaling/</guid><description>本文介绍了来自李飞飞团队的Simple Test-Time Scaling论文，并对其技术原理、主要贡献、论文方法、评估结果和局限性进行了详细解读。</description></item><item><title>DeepSeek R1 论文解读</title><link>https://c44db530.hobbytp-github-io.pages.dev/zh/deepseek/deepseek_r1/</link><pubDate>Mon, 10 Feb 2025 20:10:00 +0800</pubDate><guid>https://c44db530.hobbytp-github-io.pages.dev/zh/deepseek/deepseek_r1/</guid><description>本文介绍了深度求索（DeepSeek）公司推出的新一代推理模型DeepSeek-R1，并对其技术原理、主要贡献、论文方法、评估结果和局限性进行了详细解读。</description></item><item><title>Pangu Deep Dive - 论文深度解析</title><link>https://c44db530.hobbytp-github-io.pages.dev/zh/papers/pangu_deepdive/</link><pubDate>Mon, 01 Jan 2024 00:00:00 +0800</pubDate><guid>https://c44db530.hobbytp-github-io.pages.dev/zh/papers/pangu_deepdive/</guid><description>Pangu相关论文的深度解析和资源链接</description></item></channel></rss>