Skip to main content

AI Leaderboard

Word of Lore leaderboard and ratings are curated based on AI Field Trials between AI technologies that solve real-world problems.

Global Standings

Rank Name Rating RD W L D Face-offs
1 Claude AI 2150 91 10 0 8 18
2 Claude 3.5 Sonnet 1961 111 6 0 6 12
3 ContentDetector.AI 1885 228 3 0 0 3
4 QuillBot AI Detector 1885 228 3 0 0 3
5 Mistral NeMo 1842 195 4 1 0 5
6 GPT-4o 1802 96 4 5 8 17
7 Cohere Chat 1760 133 4 1 5 10
8 Command R+ 1760 133 4 1 5 10
9 DuckDuckGo AI Chat 1710 89 8 8 4 20
10 You.com Chat 1651 173 2 1 2 5
11 You.com Smart 1651 173 2 1 2 5
12 Mistral Le Chat 1610 129 4 3 3 10
13 Claude 3.5 Haiku 1580 135 4 0 2 6
14 Perplexity.ai Chat 1553 121 4 3 3 10
15 Perplexity AI Companion 1552 150 3 1 1 5
16 Perplexity.ai Pro 1552 150 3 1 1 5
17 Perplexity.ai Quick Search 1549 121 4 3 3 10
18 Mixtral 8x7B Instruct 1548 135 4 4 3 11
19 OpenAI o1 mini 1526 137 2 0 4 6
20 Claude 3 Haiku 1514 150 2 2 1 5
21 Command R 1500 182 0 0 6 6
22 Llama 3 70B 1500 182 0 0 6 6
23 Mistral Large 2 1450 154 0 2 3 5
24 OpenAI o1 1448 112 0 5 7 12
25 Reader Ghostreader 1446 157 1 3 1 5
26 Gemini Chat 1359 173 1 2 2 5
27 ChatGPT 1317 71 6 13 22 41
28 Llama 3 8B 1249 182 1 3 2 6
29 Llama 3.1 70B 1176 121 3 6 2 11
30 Gemini Advanced 1156 173 1 4 0 5
31 Sapling AI Detector 1114 228 0 3 0 3
32 Writer AI Content Detector 1114 228 0 3 0 3
33 Meta AI 1075 162 0 5 1 6
34 GPT-4o mini 1059 99 3 8 5 16

Model Standings

Rank Name Rating RD W L D Face-offs
1 Claude 3.5 Sonnet 1961 111 6 0 6 12
2 Mistral NeMo 1842 195 4 1 0 5
3 GPT-4o 1802 96 4 5 8 17
4 Command R+ 1760 133 4 1 5 10
5 Claude 3.5 Haiku 1580 135 4 0 2 6
6 Mixtral 8x7B Instruct 1548 135 4 4 3 11
7 OpenAI o1 mini 1526 137 2 0 4 6
8 Claude 3 Haiku 1514 150 2 2 1 5
9 Command R 1500 182 0 0 6 6
10 Llama 3 70B 1500 182 0 0 6 6
11 Mistral Large 2 1450 154 0 2 3 5
12 OpenAI o1 1448 112 0 5 7 12
13 Llama 3 8B 1249 182 1 3 2 6
14 Llama 3.1 70B 1176 121 3 6 2 11
15 GPT-4o mini 1059 99 3 8 5 16

Chat Standings

Rank Name Rating RD W L D Face-offs
1 Claude AI 2150 91 10 0 8 18
2 Cohere Chat 1760 133 4 1 5 10
3 DuckDuckGo AI Chat 1710 89 8 8 4 20
4 You.com Chat 1651 173 2 1 2 5
5 Mistral Le Chat 1610 129 4 3 3 10
6 Perplexity.ai Chat 1553 121 4 3 3 10
7 Gemini Chat 1359 173 1 2 2 5
8 ChatGPT 1317 71 6 13 22 41
9 Gemini Advanced 1156 173 1 4 0 5
10 Meta AI 1075 162 0 5 1 6

AI Trial Face-Offs

Workflow Name Opponent Name Outcome Rating Period Trial Code Trial Number
Perplexity AI Companion Readwise Reader & GPT-4o mini 1-0 20241130 SUM 7/RELEVANCE
Perplexity AI Companion Readwise Reader & GPT-4o mini 1-1 20241130 SUM 7/CONVENIENCE
Perplexity AI Companion Readwise Reader & GPT-4o mini 0-1 20241130 SUM 7/CONCISENESS
Perplexity AI Companion Readwise Reader & GPT-4o mini 1-0 20241130 SUM 7/COHERENCE
Perplexity AI Companion Readwise Reader & GPT-4o mini 1-0 20241130 SUM 7/ACCURACY
Cohere Command R+ Mistral Large 2 1-1 20241130 JDG 5/RATIONALITY
Cohere Command R+ Mistral Large 2 1-0 20241130 JDG 5/MARGINALITY
Cohere Command R+ Mistral Large 2 1-1 20241130 JDG 5/IMPARTIALITY
Cohere Command R+ Mistral Large 2 1-1 20241130 JDG 5/ETHICS
Cohere Command R+ Mistral Large 2 1-0 20241130 JDG 5/CONSISTENCY
Claude 3.5 Haiku ChatGPT OpenAI o1-preview 1-0 20241130 EGW 3/UX
Claude 3.5 Haiku ChatGPT OpenAI o1-preview 1-0 20241130 EGW 3/RELEVANCE
Claude 3.5 Haiku ChatGPT OpenAI o1-preview 1-1 20241130 EGW 3/QUALITY
Claude 3.5 Haiku ChatGPT OpenAI o1-preview 1-1 20241130 EGW 3/CONSISTENCY
Claude 3.5 Haiku ChatGPT OpenAI o1-preview 1-0 20241130 EGW 3/AUTHENTICITY
Claude 3.5 Haiku ChatGPT OpenAI o1-preview 1-0 20241130 EGW 3/ACCURACY
ContentDetector.AI Writer AI Content Detector 1-0 20241130 AGT 2/ROBUSTNESS
ContentDetector.AI Writer AI Content Detector 1-0 20241130 AGT 2/EXPLAINABILITY
ContentDetector.AI Writer AI Content Detector 1-0 20241130 AGT 2/ACCURACY
DuckDuckGo AI Chat & GPT-4o mini DuckDuckGo AI Chat & Claude 3 Haiku 0-1 20241031 SUM 6/RELEVANCE
DuckDuckGo AI Chat & GPT-4o mini DuckDuckGo AI Chat & Claude 3 Haiku 1-0 20241031 SUM 6/CONVENIENCE
DuckDuckGo AI Chat & GPT-4o mini DuckDuckGo AI Chat & Claude 3 Haiku 1-0 20241031 SUM 6/CONCISENESS
DuckDuckGo AI Chat & GPT-4o mini DuckDuckGo AI Chat & Claude 3 Haiku 1-1 20241031 SUM 6/COHERENCE
DuckDuckGo AI Chat & GPT-4o mini DuckDuckGo AI Chat & Claude 3 Haiku 0-1 20241031 SUM 6/ACCURACY
Mistral Le Chat & NeMo Gemini Advanced 1-0 20240930 SUM 5/RELEVANCE
Mistral Le Chat & NeMo Gemini Advanced 1-0 20240930 SUM 5/CONVENIENCE
Mistral Le Chat & NeMo Gemini Advanced 0-1 20240930 SUM 5/CONCISENESS
Mistral Le Chat & NeMo Gemini Advanced 1-0 20240930 SUM 5/COHERENCE
Mistral Le Chat & NeMo Gemini Advanced 1-0 20240930 SUM 5/ACCURACY
QuillBot AI Detector Sapling AI Detector 1-0 20240930 AGT 1/ROBUSTNESS
QuillBot AI Detector Sapling AI Detector 1-0 20240930 AGT 1/EXPLAINABILITY
QuillBot AI Detector Sapling AI Detector 1-0 20240930 AGT 1/ACCURACY
DuckDuckGo AI Chat & Llama 3.1 70B DuckDuckGo AI Chat & Mixtral 8x7B 1-0 20240915 SUM 4/RELEVANCE
DuckDuckGo AI Chat & Llama 3.1 70B DuckDuckGo AI Chat & Mixtral 8x7B 1-1 20240915 SUM 4/CONVENIENCE
DuckDuckGo AI Chat & Llama 3.1 70B DuckDuckGo AI Chat & Mixtral 8x7B 0-1 20240915 SUM 4/CONCISENESS
DuckDuckGo AI Chat & Llama 3.1 70B DuckDuckGo AI Chat & Mixtral 8x7B 1-0 20240915 SUM 4/COHERENCE
DuckDuckGo AI Chat & Llama 3.1 70B DuckDuckGo AI Chat & Mixtral 8x7B 1-0 20240915 SUM 4/ACCURACY
Claude 3.5 Sonnet ChatGPT OpenAI o1 1-1 20240915 JDG 4/TRANSPARENCY
Claude 3.5 Sonnet ChatGPT OpenAI o1 1-1 20240915 JDG 4/RATIONALITY
Claude 3.5 Sonnet ChatGPT OpenAI o1 1-0 20240915 JDG 4/MARGINALITY
Claude 3.5 Sonnet ChatGPT OpenAI o1 1-1 20240915 JDG 4/IMPARTIALITY
Claude 3.5 Sonnet ChatGPT OpenAI o1 1-1 20240915 JDG 4/ETHICS
Claude 3.5 Sonnet ChatGPT OpenAI o1 1-1 20240915 JDG 4/CONSISTENCY
ChatGPT GPT-4o ChatGPT OpenAI o1-mini 1-1 20240915 EGW 2/UX
ChatGPT GPT-4o ChatGPT OpenAI o1-mini 0-1 20240915 EGW 2/RELEVANCE
ChatGPT GPT-4o ChatGPT OpenAI o1-mini 0-1 20240915 EGW 2/QUALITY
ChatGPT GPT-4o ChatGPT OpenAI o1-mini 1-1 20240915 EGW 2/CONSISTENCY
ChatGPT GPT-4o ChatGPT OpenAI o1-mini 1-1 20240915 EGW 2/AUTHENTICITY
ChatGPT GPT-4o ChatGPT OpenAI o1-mini 1-1 20240915 EGW 2/ACCURACY
Claude 3.5 Sonnet Llama 3.1 70B 1-0 20240915 EGW 1/UX
Claude 3.5 Sonnet Llama 3.1 70B 1-0 20240915 EGW 1/RELEVANCE
Claude 3.5 Sonnet Llama 3.1 70B 1-0 20240915 EGW 1/QUALITY
Claude 3.5 Sonnet Llama 3.1 70B 1-0 20240915 EGW 1/QUALITY
Claude 3.5 Sonnet Llama 3.1 70B 1-0 20240915 EGW 1/CONSISTENCY
Claude 3.5 Sonnet Llama 3.1 70B 1-1 20240915 EGW 1/AUTHENTICITY
ChatGPT GPT-4o Perplexity.ai Pro 0-1 20240830 SUM 3/RELEVANCE
ChatGPT GPT-4o Perplexity.ai Pro 1-1 20240830 SUM 3/CONVENIENCE
ChatGPT GPT-4o Perplexity.ai Pro 1-0 20240830 SUM 3/CONCISENESS
ChatGPT GPT-4o Perplexity.ai Pro 0-1 20240830 SUM 3/COHERENCE
ChatGPT GPT-4o Perplexity.ai Pro 0-1 20240830 SUM 3/ACCURACY
Mixtral 8x7B Instruct Llama 3 8B Instruct 1-1 20240830 JDG 3/TRANSPARENCY
Mixtral 8x7B Instruct Llama 3 8B Instruct 1-0 20240830 JDG 3/RATIONALITY
Mixtral 8x7B Instruct Llama 3 8B Instruct 0-0 20240830 JDG 3/MARGINALITY
Mixtral 8x7B Instruct Llama 3 8B Instruct 1-0 20240830 JDG 3/IMPARTIALITY
Mixtral 8x7B Instruct Llama 3 8B Instruct 1-1 20240830 JDG 3/ETHICS
Mixtral 8x7B Instruct Llama 3 8B Instruct 1-0 20240830 JDG 3/CONSISTENCY
You.com Smart Perplexity.ai Quick Search 0-1 20240812 SUM 2/RELEVANCE
You.com Smart Perplexity.ai Quick Search 1-1 20240812 SUM 2/CONVENIENCE
You.com Smart Perplexity.ai Quick Search 1-0 20240812 SUM 2/CONCISENESS
You.com Smart Perplexity.ai Quick Search 1-0 20240812 SUM 2/COHERENCE
You.com Smart Perplexity.ai Quick Search 1-1 20240812 SUM 2/ACCURACY
Cohere Chat Command R+ Gemini Chat 1-0 20240812 SUM 1/RELEVANCE
Cohere Chat Command R+ Gemini Chat 1-1 20240812 SUM 1/CONVENIENCE
Cohere Chat Command R+ Gemini Chat 0-1 20240812 SUM 1/CONCISENESS
Cohere Chat Command R+ Gemini Chat 1-1 20240812 SUM 1/COHERENCE
Cohere Chat Command R+ Gemini Chat 1-0 20240812 SUM 1/ACCURACY
Cohere Command R Llama 3 70B 1-1 20240729 JDG 2/TRANSPARENCY
Cohere Command R Llama 3 70B 1-1 20240729 JDG 2/RATIONALITY
Cohere Command R Llama 3 70B 1-1 20240729 JDG 2/MARGINALITY
Cohere Command R Llama 3 70B 1-1 20240729 JDG 2/IMPARTIALITY
Cohere Command R Llama 3 70B 1-1 20240729 JDG 2/ETHICS
Cohere Command R Llama 3 70B 1-1 20240729 JDG 2/CONSISTENCY
ChatGPT GPT-4o ChatGPT GPT-4o mini 1-1 20240729 JDG 1/TRANSPARENCY
ChatGPT GPT-4o ChatGPT GPT-4o mini 1-0 20240729 JDG 1/RATIONALITY
ChatGPT GPT-4o ChatGPT GPT-4o mini 1-1 20240729 JDG 1/MARGINALITY
ChatGPT GPT-4o ChatGPT GPT-4o mini 1-0 20240729 JDG 1/IMPARTIALITY
ChatGPT GPT-4o ChatGPT GPT-4o mini 1-1 20240729 JDG 1/ETHICS
ChatGPT GPT-4o ChatGPT GPT-4o mini 1-0 20240729 JDG 1/CONSISTENCY

The ratings are as of November 30, 2024. The current rating period ends on December 31, 2024. For the latest trial results from the current rating period, see AI Field Trial Face-Offs.