Posted on 2024-04-07 Views: Views: Word count in article: 844 Reading time ≈ 3 mins.

Poster: FrameBlending in LLM

BackGround

Problem

For the current Large Language Model, it can be used to solve the most general problems. But when it comes to the integration of two specific different themes, sometimes the new content generated is a pile of vocabulary.

The Large Language Model is like a huge library that contains books and knowledge on various topics. How to find the needed topics in this library and integrate these contents together, rather than simply piling up information, and ensure their consistency in style and narrative.

The existing problems are as follows:

The generated content is not smooth and natural enough.
The generated text, although smooth and natural, is not true to the source of the content (intrinsic problem) and/or is uncertain (extrinsic problem). This is usually caused by insufficient information given by the context.

Approches

Increase the granularity through fine tunning, and select a more suitable connection word as the fusion point for fusion.

1.1 Find the connected keywords of these two different frames

1.2 Find connected words based on keywords in different frames

1.3 Generate different content based on two frames based on connected words

1.4 Integrate different contents generated based on connected words

Data Sources: AG’s corpus of news articles

Link: http://groups.di.unipi.it/~gulli/AG_corpus_of_news_articles.html

目前建议数据库主要有10主题

分别是

computers, fantasy, kitchen, legal, military, politics, positive_words, religion, science, space.

我们可以输入两个关于上面10个frame topic 里面的两个进入融合，然后对比

利用现有的大语言生成模型，通过插入额外的模型 or 代码来调整生成的内容。

引入额外的基础模型来强化基础语言模型，改善生成文本的质量。

这个引入的基础模型的功能：

改善主题一致性

找到两个不同frame之间关联词，在关联词的基础上进行生成。
提升文本的连贯性

明确输入的大纲，使用连接词和过渡词为LLM提供足够的上下文。
对 LLM的生成内容进行方向性调整

对于LLM来说，即便是输入内容的一点改进，也会导致输出内容变化，我们可以利用这一点，尝试让LLM生成特定方向的内容，并根据生成的内容改进调试LLM的输入参数，从而让LLM的输出内容更接近我们想要的结果。

OutComes

show cases

Figure 1: An example of the Frame Blending System automatically suggesting Frame blends for the inputs of Jurassic Park and Sport Shoes. The system first expands both inputs into associations, then finds connections between the associations.

Conclusions

我的目前通过在大语言模型前面增加新的框架，这个框架允许人类更进一步控制所要生成的故事主题。这个框架会和大语言生成模型一起工作，通过控制生成志向特定的方向，同时保证生成句子的流畅性。

设计一套系统，这套系统可以让人们控制主题的改变范围，或者选择不同的侧重点来生成新的故事。系统的重点在于支持不同颗粒度级别的不同主题。目前的系统仅支持四个主题，虽然可以证明该系统的可行性，但是，如何将其优化为适用于多个主题，并且广泛的应用还需要进一步的研究。

系统例子

未来提升空间

增加反馈机制，根据用户 or 自动系统对生成的文本进行评分，然后基于这些分数进一步调整 LLM的输入策略。
优化数据库，使用更高质量，更丰富的数据库来作为调整LLM的指标。