artificial-intelligence

Recently, I came across a fascinating paper titled “Automatic Chain-of-Thought Prompting”. You can read it here. The work explores how large language models can be guided to reason step by step automatically, without requiring manually crafted examples. This is an exciting development because it makes structured reasoning more accessible and scalable, moving us closer to AI systems that can solve complex problems with minimal human intervention.

There are two common CoT paradigms. One uses a short trigger (for example, “Let’s think step by step”) to nudge the model into a stepwise reasoning mode before answering a question. The other uses a few manual demonstrations, each composed of a question and a reasoning chain that leads to an answer. The former is easy to use, but the latter tends to be more effective, though it requires manual effort to craft high-quality examples. To eliminate this manual effort, LLMs can be used to generate reasoning chains. However, these generated chains often contain mistakes. To mitigate this issue, a technique called Auto-CoT is used.

Auto-CoT consists of two main stages:

Question Clustering: A question bank is sampled that contains a diverse set of questions, each with a single correct answer. Each question is converted into an encoding using Sentence-BERT. The question representations are then processed by the k-means clustering algorithm to produce k clusters. For each cluster i, the questions are sorted into a list in ascending order of their distance from the cluster center.

Demonstration Sampling: A representative question is selected from each cluster, and its reasoning chain is generated using Zero-Shot-CoT with simple heuristics. For example, a heuristic might prioritize shorter questions with shorter thought chains. Once this step is finished, there will be k constructed demonstrations. Each demonstration is a tuple consisting of a question, its reasoning chain, and the corresponding answer. These constructed demonstrations are then used for in-context learning and fed to LLMs to obtain reasoning chains with answers for a given question.

Points to Keep in Mind:

1. With the rapid development of foundational models, CoT or its variations may not be necessary in the future.
2. Maintenance overhead due to many moving parts.
3.This clustering-based sampling method can be considered as diversity-based, which is in sharp contrast to similarity-based. if we took each demonstration as a kind of skill, diverse demonstrations seem to cover more alternative skills for solving target questions.
4. Auto-CoT may not work in situations where the required logic is not sequential.
5. This technique may be more helpful for smaller, less powerful domain specific models.

My Notebook

Tag: artificial-intelligence

Unlocking Smarter AI Reasoning with Automatic Chain-of-Thought Prompting