Skip to main content

AI Leaderboard

Word of Lore leaderboard and ratings are curated based on AI Field Trials between AI technologies that solve real-world problems.

Global Standings

Rank Name Rating RD W L D Face-offs
1 Claude AI 2069 87 10 3 11 24
2 Claude 3.5 Sonnet 1961 111 6 0 6 12
3 ContentDetector.AI 1885 228 3 0 0 3
4 QuillBot AI Detector 1885 228 3 0 0 3
5 GPT-4o 1860 91 7 6 9 22
6 Mistral NeMo 1842 195 4 1 0 5
7 Mistral Le Chat 1795 113 8 3 5 16
8 Cohere Chat 1760 133 4 1 5 10
9 Command R+ 1760 133 4 1 5 10
10 Llama 3.3 70B 1756 137 3 0 3 6
11 DuckDuckGo AI Chat 1710 89 8 8 4 20
12 Gemini Chat 1690 117 9 4 4 17
13 You.com Chat 1651 173 2 1 2 5
14 You.com Smart 1651 173 2 1 2 5
15 Gemini 2.0 Flash Experimental 1608 182 5 0 1 6
16 Claude 3.5 Haiku 1580 135 4 0 2 6
17 Perplexity.ai Chat 1553 121 4 3 3 10
18 Perplexity AI Companion 1552 150 3 1 1 5
19 Perplexity.ai Pro 1552 150 3 1 1 5
20 Perplexity.ai Quick Search 1549 121 4 3 3 10
21 Mixtral 8x7B Instruct 1548 135 4 4 3 11
22 Gemini Advanced 2.0 Experimental 1545 146 3 0 3 6
23 OpenAI o1 mini 1526 137 2 0 4 6
24 xAI Grok 2 1524 146 2 2 1 5
25 Claude 3 Haiku 1514 150 2 2 1 5
26 Gemini 1.5 Flash 1510 182 3 2 1 6
27 Command R 1500 182 0 0 6 6
28 Llama 3 70B 1500 182 0 0 6 6
29 Mistral Large 2 1477 126 4 2 5 11
30 Reader Ghostreader 1461 132 4 4 2 10
31 Writer 1460 146 2 3 1 6
32 Gemini 1.5 Pro 1453 146 0 3 3 6
33 OpenAI o1 1448 103 2 7 8 17
34 Claude 3 Opus 1443 182 0 3 3 6
35 ChatGPT 1318 72 8 15 23 46
36 Microsoft Copilot 1298 147 1 3 1 5
37 Llama 3 8B 1249 182 1 3 2 6
38 Rytr 1218 141 0 4 2 6
39 Llama 3.1 70B 1176 121 3 6 2 11
40 Gemini Advanced 1170 98 4 7 6 17
41 Sapling AI Detector 1114 228 0 3 0 3
42 Writer AI Content Detector 1114 228 0 3 0 3
43 Meta AI 1075 162 0 5 1 6
44 GPT-4o mini 1059 99 3 8 5 16
45 Grammarly 976 146 0 5 1 6

Model Standings

Model Standings

Rank Name Rating RD W L D Face-offs
1 Claude 3.5 Sonnet 1961 111 6 0 6 12
2 GPT-4o 1860 91 7 6 9 22
3 Mistral NeMo 1842 195 4 1 0 5
4 Command R+ 1760 133 4 1 5 10
5 Llama 3.3 70B 1756 137 3 0 3 6
6 Gemini 2.0 Flash Experimental 1608 182 5 0 1 6
7 Claude 3.5 Haiku 1580 135 4 0 2 6
8 Mixtral 8x7B Instruct 1548 135 4 4 3 11
9 Gemini Advanced 2.0 Experimental 1545 146 3 0 3 6
10 OpenAI o1 mini 1526 137 2 0 4 6
11 xAI Grok 2 1524 146 2 2 1 5
12 Claude 3 Haiku 1514 150 2 2 1 5
13 Gemini 1.5 Flash 1510 182 3 2 1 6
14 Command R 1500 182 0 0 6 6
15 Llama 3 70B 1500 182 0 0 6 6
16 Mistral Large 2 1477 126 4 2 5 11
17 Gemini 1.5 Pro 1453 146 0 3 3 6
18 OpenAI o1 1448 103 2 7 8 17
19 Claude 3 Opus 1443 182 0 3 3 6
20 Llama 3 8B 1249 182 1 3 2 6
21 Llama 3.1 70B 1176 121 3 6 2 11
22 GPT-4o mini 1059 99 3 8 5 16

Chat Standings

Chat Standings

Rank Name Rating RD W L D Face-offs
1 Claude AI 2069 87 10 3 11 24
2 Mistral Le Chat 1795 113 8 3 5 16
3 Cohere Chat 1760 133 4 1 5 10
4 DuckDuckGo AI Chat 1710 89 8 8 4 20
5 Gemini Chat 1690 117 9 4 4 17
6 You.com Chat 1651 173 2 1 2 5
7 Perplexity.ai Chat 1553 121 4 3 3 10
8 xAI Grok 2 1524 146 2 2 1 5
9 ChatGPT 1318 72 8 15 23 46
10 Microsoft Copilot 1298 147 1 3 1 5
11 Gemini Advanced 1170 98 4 7 6 17
12 Meta AI 1075 162 0 5 1 6

AI Trial Face-Offs

Workflow Name Opponent Name Outcome Rating Period Trial Code Trial Number
ChatGPT & OpenAI o1 xAI Grok 2 0-1 20241231 SUM 9/RELEVANCE
ChatGPT & OpenAI o1 xAI Grok 2 1-1 20241231 SUM 9/CONVENIENCE
ChatGPT & OpenAI o1 xAI Grok 2 1-0 20241231 SUM 9/CONCISENESS
ChatGPT & OpenAI o1 xAI Grok 2 1-0 20241231 SUM 9/COHERENCE
ChatGPT & OpenAI o1 xAI Grok 2 0-1 20241231 SUM 9/ACCURACY
Reader Ghostreader & GPT-4o Microsoft Copilot 1-0 20241231 SUM 8/RELEVANCE
Reader Ghostreader & GPT-4o Microsoft Copilot 1-1 20241231 SUM 8/CONVENIENCE
Reader Ghostreader & GPT-4o Microsoft Copilot 1-0 20241231 SUM 8/CONCISENESS
Reader Ghostreader & GPT-4o Microsoft Copilot 0-1 20241231 SUM 8/COHERENCE
Reader Ghostreader & GPT-4o Microsoft Copilot 1-0 20241231 SUM 8/ACCURACY
Llama 3.3 70B Claude 3 Opus 1-1 20241231 EGW 8/UX
Llama 3.3 70B Claude 3 Opus 1-0 20241231 EGW 8/RELEVANCE
Llama 3.3 70B Claude 3 Opus 1-0 20241231 EGW 8/QUALITY
Llama 3.3 70B Claude 3 Opus 1-1 20241231 EGW 8/CONSISTENCY
Llama 3.3 70B Claude 3 Opus 1-0 20241231 EGW 8/AUTHENTICITY
Llama 3.3 70B Claude 3 Opus 1-1 20241231 EGW 8/ACCURACY
Gemini 2.0 Flash Experimental Grammarly 1-1 20241231 EGW 7/UX
Gemini 2.0 Flash Experimental Grammarly 1-0 20241231 EGW 7/RELEVANCE
Gemini 2.0 Flash Experimental Grammarly 1-0 20241231 EGW 7/QUALITY
Gemini 2.0 Flash Experimental Grammarly 1-0 20241231 EGW 7/CONSISTENCY
Gemini 2.0 Flash Experimental Grammarly 1-0 20241231 EGW 7/AUTHENTICITY
Gemini 2.0 Flash Experimental Grammarly 1-0 20241231 EGW 7/ACCURACY
Gemini Advanced 2.0 Experimental Gemini Advanced 1.5 Pro 1-1 20241231 EGW 6/UX
Gemini Advanced 2.0 Experimental Gemini Advanced 1.5 Pro 1-0 20241231 EGW 6/RELEVANCE
Gemini Advanced 2.0 Experimental Gemini Advanced 1.5 Pro 1-0 20241231 EGW 6/QUALITY
Gemini Advanced 2.0 Experimental Gemini Advanced 1.5 Pro 1-0 20241231 EGW 6/CONSISTENCY
Gemini Advanced 2.0 Experimental Gemini Advanced 1.5 Pro 1-1 20241231 EGW 6/AUTHENTICITY
Gemini Advanced 2.0 Experimental Gemini Advanced 1.5 Pro 1-1 20241231 EGW 6/ACCURACY
Writer.com Gemini 1.5 Flash 0-1 20241231 EGW 5/UX
Writer.com Gemini 1.5 Flash 1-0 20241231 EGW 5/RELEVANCE
Writer.com Gemini 1.5 Flash 0-1 20241231 EGW 5/QUALITY
Writer.com Gemini 1.5 Flash 0-1 20241231 EGW 5/CONSISTENCY
Writer.com Gemini 1.5 Flash 1-0 20241231 EGW 5/AUTHENTICITY
Writer.com Gemini 1.5 Flash 1-1 20241231 EGW 5/ACCURACY
Mistral AI Le Chat & Mistral Large 2 Rytr 1-0 20241231 EGW 4/UX
Mistral AI Le Chat & Mistral Large 2 Rytr 1-0 20241231 EGW 4/RELEVANCE
Mistral AI Le Chat & Mistral Large 2 Rytr 1-0 20241231 EGW 4/QUALITY
Mistral AI Le Chat & Mistral Large 2 Rytr 1-0 20241231 EGW 4/CONSISTENCY
Mistral AI Le Chat & Mistral Large 2 Rytr 1-1 20241231 EGW 4/AUTHENTICITY
Mistral AI Le Chat & Mistral Large 2 Rytr 1-1 20241231 EGW 4/ACCURACY
Perplexity AI Companion Readwise Reader & GPT-4o mini 1-0 20241130 SUM 7/RELEVANCE
Perplexity AI Companion Readwise Reader & GPT-4o mini 1-1 20241130 SUM 7/CONVENIENCE
Perplexity AI Companion Readwise Reader & GPT-4o mini 0-1 20241130 SUM 7/CONCISENESS
Perplexity AI Companion Readwise Reader & GPT-4o mini 1-0 20241130 SUM 7/COHERENCE
Perplexity AI Companion Readwise Reader & GPT-4o mini 1-0 20241130 SUM 7/ACCURACY
Cohere Command R+ Mistral Large 2 1-1 20241130 JDG 5/RATIONALITY
Cohere Command R+ Mistral Large 2 1-0 20241130 JDG 5/MARGINALITY
Cohere Command R+ Mistral Large 2 1-1 20241130 JDG 5/IMPARTIALITY
Cohere Command R+ Mistral Large 2 1-1 20241130 JDG 5/ETHICS
Cohere Command R+ Mistral Large 2 1-0 20241130 JDG 5/CONSISTENCY
Claude 3.5 Haiku ChatGPT OpenAI o1-preview 1-0 20241130 EGW 3/UX
Claude 3.5 Haiku ChatGPT OpenAI o1-preview 1-0 20241130 EGW 3/RELEVANCE
Claude 3.5 Haiku ChatGPT OpenAI o1-preview 1-1 20241130 EGW 3/QUALITY
Claude 3.5 Haiku ChatGPT OpenAI o1-preview 1-1 20241130 EGW 3/CONSISTENCY
Claude 3.5 Haiku ChatGPT OpenAI o1-preview 1-0 20241130 EGW 3/AUTHENTICITY
Claude 3.5 Haiku ChatGPT OpenAI o1-preview 1-0 20241130 EGW 3/ACCURACY
ContentDetector.AI Writer AI Content Detector 1-0 20241130 AGT 2/ROBUSTNESS
ContentDetector.AI Writer AI Content Detector 1-0 20241130 AGT 2/EXPLAINABILITY
ContentDetector.AI Writer AI Content Detector 1-0 20241130 AGT 2/ACCURACY
DuckDuckGo AI Chat & GPT-4o mini DuckDuckGo AI Chat & Claude 3 Haiku 0-1 20241031 SUM 6/RELEVANCE
DuckDuckGo AI Chat & GPT-4o mini DuckDuckGo AI Chat & Claude 3 Haiku 1-0 20241031 SUM 6/CONVENIENCE
DuckDuckGo AI Chat & GPT-4o mini DuckDuckGo AI Chat & Claude 3 Haiku 1-0 20241031 SUM 6/CONCISENESS
DuckDuckGo AI Chat & GPT-4o mini DuckDuckGo AI Chat & Claude 3 Haiku 1-1 20241031 SUM 6/COHERENCE
DuckDuckGo AI Chat & GPT-4o mini DuckDuckGo AI Chat & Claude 3 Haiku 0-1 20241031 SUM 6/ACCURACY
Mistral Le Chat & NeMo Gemini Advanced 1-0 20240930 SUM 5/RELEVANCE
Mistral Le Chat & NeMo Gemini Advanced 1-0 20240930 SUM 5/CONVENIENCE
Mistral Le Chat & NeMo Gemini Advanced 0-1 20240930 SUM 5/CONCISENESS
Mistral Le Chat & NeMo Gemini Advanced 1-0 20240930 SUM 5/COHERENCE
Mistral Le Chat & NeMo Gemini Advanced 1-0 20240930 SUM 5/ACCURACY
QuillBot AI Detector Sapling AI Detector 1-0 20240930 AGT 1/ROBUSTNESS
QuillBot AI Detector Sapling AI Detector 1-0 20240930 AGT 1/EXPLAINABILITY
QuillBot AI Detector Sapling AI Detector 1-0 20240930 AGT 1/ACCURACY
DuckDuckGo AI Chat & Llama 3.1 70B DuckDuckGo AI Chat & Mixtral 8x7B 1-0 20240915 SUM 4/RELEVANCE
DuckDuckGo AI Chat & Llama 3.1 70B DuckDuckGo AI Chat & Mixtral 8x7B 1-1 20240915 SUM 4/CONVENIENCE
DuckDuckGo AI Chat & Llama 3.1 70B DuckDuckGo AI Chat & Mixtral 8x7B 0-1 20240915 SUM 4/CONCISENESS
DuckDuckGo AI Chat & Llama 3.1 70B DuckDuckGo AI Chat & Mixtral 8x7B 1-0 20240915 SUM 4/COHERENCE
DuckDuckGo AI Chat & Llama 3.1 70B DuckDuckGo AI Chat & Mixtral 8x7B 1-0 20240915 SUM 4/ACCURACY
Claude 3.5 Sonnet ChatGPT OpenAI o1 1-1 20240915 JDG 4/TRANSPARENCY
Claude 3.5 Sonnet ChatGPT OpenAI o1 1-1 20240915 JDG 4/RATIONALITY
Claude 3.5 Sonnet ChatGPT OpenAI o1 1-0 20240915 JDG 4/MARGINALITY
Claude 3.5 Sonnet ChatGPT OpenAI o1 1-1 20240915 JDG 4/IMPARTIALITY
Claude 3.5 Sonnet ChatGPT OpenAI o1 1-1 20240915 JDG 4/ETHICS
Claude 3.5 Sonnet ChatGPT OpenAI o1 1-1 20240915 JDG 4/CONSISTENCY
ChatGPT GPT-4o ChatGPT OpenAI o1-mini 1-1 20240915 EGW 2/UX
ChatGPT GPT-4o ChatGPT OpenAI o1-mini 0-1 20240915 EGW 2/RELEVANCE
ChatGPT GPT-4o ChatGPT OpenAI o1-mini 0-1 20240915 EGW 2/QUALITY
ChatGPT GPT-4o ChatGPT OpenAI o1-mini 1-1 20240915 EGW 2/CONSISTENCY
ChatGPT GPT-4o ChatGPT OpenAI o1-mini 1-1 20240915 EGW 2/AUTHENTICITY
ChatGPT GPT-4o ChatGPT OpenAI o1-mini 1-1 20240915 EGW 2/ACCURACY
Claude 3.5 Sonnet Llama 3.1 70B 1-0 20240915 EGW 1/UX
Claude 3.5 Sonnet Llama 3.1 70B 1-0 20240915 EGW 1/RELEVANCE
Claude 3.5 Sonnet Llama 3.1 70B 1-0 20240915 EGW 1/QUALITY
Claude 3.5 Sonnet Llama 3.1 70B 1-0 20240915 EGW 1/QUALITY
Claude 3.5 Sonnet Llama 3.1 70B 1-0 20240915 EGW 1/CONSISTENCY
Claude 3.5 Sonnet Llama 3.1 70B 1-1 20240915 EGW 1/AUTHENTICITY
ChatGPT GPT-4o Perplexity.ai Pro 0-1 20240830 SUM 3/RELEVANCE
ChatGPT GPT-4o Perplexity.ai Pro 1-1 20240830 SUM 3/CONVENIENCE
ChatGPT GPT-4o Perplexity.ai Pro 1-0 20240830 SUM 3/CONCISENESS
ChatGPT GPT-4o Perplexity.ai Pro 0-1 20240830 SUM 3/COHERENCE
ChatGPT GPT-4o Perplexity.ai Pro 0-1 20240830 SUM 3/ACCURACY
Mixtral 8x7B Instruct Llama 3 8B Instruct 1-1 20240830 JDG 3/TRANSPARENCY
Mixtral 8x7B Instruct Llama 3 8B Instruct 1-0 20240830 JDG 3/RATIONALITY
Mixtral 8x7B Instruct Llama 3 8B Instruct 0-0 20240830 JDG 3/MARGINALITY
Mixtral 8x7B Instruct Llama 3 8B Instruct 1-0 20240830 JDG 3/IMPARTIALITY
Mixtral 8x7B Instruct Llama 3 8B Instruct 1-1 20240830 JDG 3/ETHICS
Mixtral 8x7B Instruct Llama 3 8B Instruct 1-0 20240830 JDG 3/CONSISTENCY
You.com Smart Perplexity.ai Quick Search 0-1 20240812 SUM 2/RELEVANCE
You.com Smart Perplexity.ai Quick Search 1-1 20240812 SUM 2/CONVENIENCE
You.com Smart Perplexity.ai Quick Search 1-0 20240812 SUM 2/CONCISENESS
You.com Smart Perplexity.ai Quick Search 1-0 20240812 SUM 2/COHERENCE
You.com Smart Perplexity.ai Quick Search 1-1 20240812 SUM 2/ACCURACY
Cohere Chat Command R+ Gemini Chat 1-0 20240812 SUM 1/RELEVANCE
Cohere Chat Command R+ Gemini Chat 1-1 20240812 SUM 1/CONVENIENCE
Cohere Chat Command R+ Gemini Chat 0-1 20240812 SUM 1/CONCISENESS
Cohere Chat Command R+ Gemini Chat 1-1 20240812 SUM 1/COHERENCE
Cohere Chat Command R+ Gemini Chat 1-0 20240812 SUM 1/ACCURACY
Cohere Command R Llama 3 70B 1-1 20240729 JDG 2/TRANSPARENCY
Cohere Command R Llama 3 70B 1-1 20240729 JDG 2/RATIONALITY
Cohere Command R Llama 3 70B 1-1 20240729 JDG 2/MARGINALITY
Cohere Command R Llama 3 70B 1-1 20240729 JDG 2/IMPARTIALITY
Cohere Command R Llama 3 70B 1-1 20240729 JDG 2/ETHICS
Cohere Command R Llama 3 70B 1-1 20240729 JDG 2/CONSISTENCY
ChatGPT GPT-4o ChatGPT GPT-4o mini 1-1 20240729 JDG 1/TRANSPARENCY
ChatGPT GPT-4o ChatGPT GPT-4o mini 1-0 20240729 JDG 1/RATIONALITY
ChatGPT GPT-4o ChatGPT GPT-4o mini 1-1 20240729 JDG 1/MARGINALITY
ChatGPT GPT-4o ChatGPT GPT-4o mini 1-0 20240729 JDG 1/IMPARTIALITY
ChatGPT GPT-4o ChatGPT GPT-4o mini 1-1 20240729 JDG 1/ETHICS
ChatGPT GPT-4o ChatGPT GPT-4o mini 1-0 20240729 JDG 1/CONSISTENCY

The ratings are as of December 31, 2024. The current rating period ends on January 31, 2025. For the latest trial results from the current rating period, see AI Field Trial Face-Offs.