[TOC]

  1. Title: Evaluating Large Language Models Trained on Code
  2. Author: Mark Chen et. al. OPENAI
  3. Publish Year: 14 Jul 2021
  4. Review Date: Mon, Oct 16, 2023
  5. url: https://arxiv.org/pdf/2107.03374.pdf

Summary of paper

Motivation

Contribution

limitation

Some key terms

HumanEval

preliminary test

evaluation framework

functional correctness

Methods

observation

supervised fine-tuning

image-20231016114211216

image-20231016114728663

multiple samples generation and ranking

image-20231016111234421

**back-translation to pick the sample **

image-20231016120022927

image-20231016120037943