Extended Data Table 4 Summary of the best performing models on the MedQA (USMLE) dataset questions with 4 options

From: Large language models encode clinical knowledge

  1. Our results with Flan-PaLM exceed previous state-of-the-art by over 17%.