About
Blog
Pills
Software
Trainings
Seminar
⚲
Search results
Reference
Chain of Thought Imitation with Procedure Cloning
,
Mengjiao (Sherry) Yang, Dale Schuurmans, Pieter Abbeel, Ofir Nachum.
Advances in Neural Information Processing Systems
(2022)
Publication
Code
Content citing this item
Pill
Reasoning Traces as Learning Signal
An important feature of large language models is their ability to provide detailed responses that resemble “thinking step by …
Pill
Augmented Language Models: a survey
A survey of recent advances in augmenting (large) language models with new capabilities such as reasoning, tool use, and more. While the …
All works referenced in our site...