Software & Apps

Pelicans on bicycles

Pelicans on bicycles. I decided to launch my own LLM benchmark: how well can different models render an SVG of a pelican riding a bicycle?

I chose that because a) I like pelicans and b) I’m pretty sure there aren’t any pelicans in a bicycle SVG file floating around (yet) that might have been absorbed by the training data.

My prompt:

Generate an SVG of a pelican riding a bicycle

I’ve run it on 16 models so far – from OpenAI, Anthropic, Google Gemini and Meta (Llama running Cerebras), all using my LLM CLI utility. Here is my (Claude helped) bash script: generate-svgs.sh

Here are Claude 3.5 Sonnet (2024-06-20) and Claude 3.5 Sonnet (2024-10-22):

Gemini 1.5 Flash 001 and Gemini 1.5 Flash 002:

GPT-4o mini and GPT-4o:

o1-mini and o1-preview:

Cerebras Llama 3.1 70B and Llama 3.1 8B:

And a special mention for Gemini 1.5 Flash 8B:

Some of them are linked from the README.

2024-12-16 18:10:51

Related Articles

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to top button