The Era of Experience Paper

rw-book-cover

Metadata

Highlights

  • Artificial intelligence (AI) has made remarkable strides over recent years by training on massive amounts of human-generated data and fine-tuning with expert human examples and preferences. (View Highlight)
  • In key domains such as mathematics, coding, and science, the knowledge extracted from human data is rapidly approaching a limit (View Highlight)
  • To progress significantly further, a new source of data is required. (View Highlight)
  • This can be achieved by allowing agents to learn continually from their own experience, i.e., data that is generated by the agent interacting with its environment (View Highlight)
  • AlphaProof [20] recently became the first program to achieve a medal in the International Mathematical Olympiad, eclipsing the performance of human-centric approaches [27, 19]. (View Highlight)