我要投稿

LongWriter：现在 LLM 可以生成 20,000 字的输出

发布日期：2024-09-29 18:51:05 浏览次数： 1967 作者：barry的异想世界

在本文中，我们将探讨 LongWriter，它能够生成更长的序列。它可以生成高质量且连贯的 20,000 字。

LLMs的问题是什么？

为什么现有的LLMs无法生成超过2K字的输出？

在这个八月份，发布了一项名为LongWriter的研究发现，现有的预训练LLMs无法生成超过2K字的长序列。这是因为在SFT（监督微调）阶段提供的数据长度有限而导致的困难。

然而，输入没有问题。LLM可以接受更长的输入，比如100,000字。

模型实验

在实验中，要求写一篇关于任何主题的文章，并指定字数，比如10,000字。以下是一个提示的例子。

Write a 10000-word article on the history of the Roman Empire

这个提示的例子请求了总共8个LLM。4个是专有的，4个是开源的。

但是它们都无法生成超过2,000字的长序列。

训练实验

为了确认观察到的假设，使用不同长度的数据集训练特定模型。

通过用三种不同长度设置（500，1000和2000）训练GLM-4-9B模型。但我们可以观察到模型难以生成超过提供的数据长度。

观察的结论

由于训练数据集中缺乏更长的序列，模型将无法生成更长的序列。

如何生成更长的输出？

对于更长的输出，比如一篇文章，通常会有多个主题和子主题。作为人类，首先要对想法进行概述，然后写下每个主题和子主题的内容。

我们也可以使用 LLM 做同样的事情。

第一步：撰写文章大纲（计划）

首先，我们要求 LLM 制定主要观点及其所需的内容长度。输出必须每个主题占一行。

## PROMPT OF GENERATE OUTLINE ##

I need you to help me break down the following long-form writing instruction into multiple
subtasks. Each subtask will guide the writing of one paragraph in the essay, and should include
the main points and word count requirements for that paragraph.
The writing instruction is as follows:
{User Instruction}
Please break it down in the following format, with each subtask taking up one line:
Paragraph 1 - Main Point: [Describe the main point of the paragraph, in detail] - Word Count:
[Word count requirement, e.g., 400 words]
Paragraph 2 - Main Point: [Describe the main point of the paragraph, in detail] - Word Count:
[word count requirement, e.g. 1000 words].
...
Make sure that each subtask is clear and specific, and that all subtasks cover the entire content
of the writing instruction. Do not split the subtasks too finely; each subtask’s paragraph should
be no less than 200 words and no more than 1000 words. Do not output any other content.

第2步：根据提供的主题撰写内容

在第二阶段，我们将根据第一步的输出生成内容。在这一阶段，我们将之前生成的内容整合，以使输出更连贯。

这意味着我们不能并行生成输出内容。

## PROMPT OF WRITE BASED on TOPIC ##

You are an excellent writing assistant. I will give you an original writing instruction and my
planned writing steps. I will also provide you with the text I have already written. Please help
me continue writing the next paragraph based on the writing instruction, writing steps, and the
already written text.
Writing instruction:
{User Instruction}
Writing steps:
{The writing plan generated in Step I}
Already written text:
{Previous generated (n-1) paragraphs}
Please integrate the original writing instruction, writing steps, and the already written text, and
now continue writing {The plan for the n-th paragraph, i.e., the n-th line in the writing plan}