Extended Data Table 1 Summary of MultiMedQA describing the format, size, and domain of the datasets in the benchmark

From: Large language models encode clinical knowledge