Examples
Basic Examples (1)
Obtain the benchmark data:
Visualizations (1)
Display a bar chart with the top 10 models:
Analysis (4)
Get the top three models by code generation correctness:
Select all models from Meta:
Select the top model for each vendor:
Sort the vendors by their average model score on generating valid Wolfram Language syntax:
External Links
Bibliographic Citation
Wolfram Research,
"LLMBenchmarks Data"
from the Wolfram Data Repository
(2024)
Data Resource History
Publisher Information