| Workflow Name |
Opponent Name |
Outcome |
Rating Period |
Trial Code |
Trial Number |
| ChatGPT & OpenAI o1 |
xAI Grok 2 |
0-1 |
20241231 |
SUM |
9/RELEVANCE |
| ChatGPT & OpenAI o1 |
xAI Grok 2 |
1-1 |
20241231 |
SUM |
9/CONVENIENCE |
| ChatGPT & OpenAI o1 |
xAI Grok 2 |
1-0 |
20241231 |
SUM |
9/CONCISENESS |
| ChatGPT & OpenAI o1 |
xAI Grok 2 |
1-0 |
20241231 |
SUM |
9/COHERENCE |
| ChatGPT & OpenAI o1 |
xAI Grok 2 |
0-1 |
20241231 |
SUM |
9/ACCURACY |
| Reader Ghostreader & GPT-4o |
Microsoft Copilot |
1-0 |
20241231 |
SUM |
8/RELEVANCE |
| Reader Ghostreader & GPT-4o |
Microsoft Copilot |
1-1 |
20241231 |
SUM |
8/CONVENIENCE |
| Reader Ghostreader & GPT-4o |
Microsoft Copilot |
1-0 |
20241231 |
SUM |
8/CONCISENESS |
| Reader Ghostreader & GPT-4o |
Microsoft Copilot |
0-1 |
20241231 |
SUM |
8/COHERENCE |
| Reader Ghostreader & GPT-4o |
Microsoft Copilot |
1-0 |
20241231 |
SUM |
8/ACCURACY |
| Llama 3.3 70B |
Claude 3 Opus |
1-1 |
20241231 |
EGW |
8/UX |
| Llama 3.3 70B |
Claude 3 Opus |
1-0 |
20241231 |
EGW |
8/RELEVANCE |
| Llama 3.3 70B |
Claude 3 Opus |
1-0 |
20241231 |
EGW |
8/QUALITY |
| Llama 3.3 70B |
Claude 3 Opus |
1-1 |
20241231 |
EGW |
8/CONSISTENCY |
| Llama 3.3 70B |
Claude 3 Opus |
1-0 |
20241231 |
EGW |
8/AUTHENTICITY |
| Llama 3.3 70B |
Claude 3 Opus |
1-1 |
20241231 |
EGW |
8/ACCURACY |
| Gemini 2.0 Flash Experimental |
Grammarly |
1-1 |
20241231 |
EGW |
7/UX |
| Gemini 2.0 Flash Experimental |
Grammarly |
1-0 |
20241231 |
EGW |
7/RELEVANCE |
| Gemini 2.0 Flash Experimental |
Grammarly |
1-0 |
20241231 |
EGW |
7/QUALITY |
| Gemini 2.0 Flash Experimental |
Grammarly |
1-0 |
20241231 |
EGW |
7/CONSISTENCY |
| Gemini 2.0 Flash Experimental |
Grammarly |
1-0 |
20241231 |
EGW |
7/AUTHENTICITY |
| Gemini 2.0 Flash Experimental |
Grammarly |
1-0 |
20241231 |
EGW |
7/ACCURACY |
| Gemini Advanced 2.0 Experimental |
Gemini Advanced 1.5 Pro |
1-1 |
20241231 |
EGW |
6/UX |
| Gemini Advanced 2.0 Experimental |
Gemini Advanced 1.5 Pro |
1-0 |
20241231 |
EGW |
6/RELEVANCE |
| Gemini Advanced 2.0 Experimental |
Gemini Advanced 1.5 Pro |
1-0 |
20241231 |
EGW |
6/QUALITY |
| Gemini Advanced 2.0 Experimental |
Gemini Advanced 1.5 Pro |
1-0 |
20241231 |
EGW |
6/CONSISTENCY |
| Gemini Advanced 2.0 Experimental |
Gemini Advanced 1.5 Pro |
1-1 |
20241231 |
EGW |
6/AUTHENTICITY |
| Gemini Advanced 2.0 Experimental |
Gemini Advanced 1.5 Pro |
1-1 |
20241231 |
EGW |
6/ACCURACY |
| Writer.com |
Gemini 1.5 Flash |
0-1 |
20241231 |
EGW |
5/UX |
| Writer.com |
Gemini 1.5 Flash |
1-0 |
20241231 |
EGW |
5/RELEVANCE |
| Writer.com |
Gemini 1.5 Flash |
0-1 |
20241231 |
EGW |
5/QUALITY |
| Writer.com |
Gemini 1.5 Flash |
0-1 |
20241231 |
EGW |
5/CONSISTENCY |
| Writer.com |
Gemini 1.5 Flash |
1-0 |
20241231 |
EGW |
5/AUTHENTICITY |
| Writer.com |
Gemini 1.5 Flash |
1-1 |
20241231 |
EGW |
5/ACCURACY |
| Mistral AI Le Chat & Mistral Large 2 |
Rytr |
1-0 |
20241231 |
EGW |
4/UX |
| Mistral AI Le Chat & Mistral Large 2 |
Rytr |
1-0 |
20241231 |
EGW |
4/RELEVANCE |
| Mistral AI Le Chat & Mistral Large 2 |
Rytr |
1-0 |
20241231 |
EGW |
4/QUALITY |
| Mistral AI Le Chat & Mistral Large 2 |
Rytr |
1-0 |
20241231 |
EGW |
4/CONSISTENCY |
| Mistral AI Le Chat & Mistral Large 2 |
Rytr |
1-1 |
20241231 |
EGW |
4/AUTHENTICITY |
| Mistral AI Le Chat & Mistral Large 2 |
Rytr |
1-1 |
20241231 |
EGW |
4/ACCURACY |
| Perplexity AI Companion |
Readwise Reader & GPT-4o mini |
1-0 |
20241130 |
SUM |
7/RELEVANCE |
| Perplexity AI Companion |
Readwise Reader & GPT-4o mini |
1-1 |
20241130 |
SUM |
7/CONVENIENCE |
| Perplexity AI Companion |
Readwise Reader & GPT-4o mini |
0-1 |
20241130 |
SUM |
7/CONCISENESS |
| Perplexity AI Companion |
Readwise Reader & GPT-4o mini |
1-0 |
20241130 |
SUM |
7/COHERENCE |
| Perplexity AI Companion |
Readwise Reader & GPT-4o mini |
1-0 |
20241130 |
SUM |
7/ACCURACY |
| Cohere Command R+ |
Mistral Large 2 |
1-1 |
20241130 |
JDG |
5/RATIONALITY |
| Cohere Command R+ |
Mistral Large 2 |
1-0 |
20241130 |
JDG |
5/MARGINALITY |
| Cohere Command R+ |
Mistral Large 2 |
1-1 |
20241130 |
JDG |
5/IMPARTIALITY |
| Cohere Command R+ |
Mistral Large 2 |
1-1 |
20241130 |
JDG |
5/ETHICS |
| Cohere Command R+ |
Mistral Large 2 |
1-0 |
20241130 |
JDG |
5/CONSISTENCY |
| Claude 3.5 Haiku |
ChatGPT OpenAI o1-preview |
1-0 |
20241130 |
EGW |
3/UX |
| Claude 3.5 Haiku |
ChatGPT OpenAI o1-preview |
1-0 |
20241130 |
EGW |
3/RELEVANCE |
| Claude 3.5 Haiku |
ChatGPT OpenAI o1-preview |
1-1 |
20241130 |
EGW |
3/QUALITY |
| Claude 3.5 Haiku |
ChatGPT OpenAI o1-preview |
1-1 |
20241130 |
EGW |
3/CONSISTENCY |
| Claude 3.5 Haiku |
ChatGPT OpenAI o1-preview |
1-0 |
20241130 |
EGW |
3/AUTHENTICITY |
| Claude 3.5 Haiku |
ChatGPT OpenAI o1-preview |
1-0 |
20241130 |
EGW |
3/ACCURACY |
| ContentDetector.AI |
Writer AI Content Detector |
1-0 |
20241130 |
AGT |
2/ROBUSTNESS |
| ContentDetector.AI |
Writer AI Content Detector |
1-0 |
20241130 |
AGT |
2/EXPLAINABILITY |
| ContentDetector.AI |
Writer AI Content Detector |
1-0 |
20241130 |
AGT |
2/ACCURACY |
| DuckDuckGo AI Chat & GPT-4o mini |
DuckDuckGo AI Chat & Claude 3 Haiku |
0-1 |
20241031 |
SUM |
6/RELEVANCE |
| DuckDuckGo AI Chat & GPT-4o mini |
DuckDuckGo AI Chat & Claude 3 Haiku |
1-0 |
20241031 |
SUM |
6/CONVENIENCE |
| DuckDuckGo AI Chat & GPT-4o mini |
DuckDuckGo AI Chat & Claude 3 Haiku |
1-0 |
20241031 |
SUM |
6/CONCISENESS |
| DuckDuckGo AI Chat & GPT-4o mini |
DuckDuckGo AI Chat & Claude 3 Haiku |
1-1 |
20241031 |
SUM |
6/COHERENCE |
| DuckDuckGo AI Chat & GPT-4o mini |
DuckDuckGo AI Chat & Claude 3 Haiku |
0-1 |
20241031 |
SUM |
6/ACCURACY |
| Mistral Le Chat & NeMo |
Gemini Advanced |
1-0 |
20240930 |
SUM |
5/RELEVANCE |
| Mistral Le Chat & NeMo |
Gemini Advanced |
1-0 |
20240930 |
SUM |
5/CONVENIENCE |
| Mistral Le Chat & NeMo |
Gemini Advanced |
0-1 |
20240930 |
SUM |
5/CONCISENESS |
| Mistral Le Chat & NeMo |
Gemini Advanced |
1-0 |
20240930 |
SUM |
5/COHERENCE |
| Mistral Le Chat & NeMo |
Gemini Advanced |
1-0 |
20240930 |
SUM |
5/ACCURACY |
| QuillBot AI Detector |
Sapling AI Detector |
1-0 |
20240930 |
AGT |
1/ROBUSTNESS |
| QuillBot AI Detector |
Sapling AI Detector |
1-0 |
20240930 |
AGT |
1/EXPLAINABILITY |
| QuillBot AI Detector |
Sapling AI Detector |
1-0 |
20240930 |
AGT |
1/ACCURACY |
| DuckDuckGo AI Chat & Llama 3.1 70B |
DuckDuckGo AI Chat & Mixtral 8x7B |
1-0 |
20240915 |
SUM |
4/RELEVANCE |
| DuckDuckGo AI Chat & Llama 3.1 70B |
DuckDuckGo AI Chat & Mixtral 8x7B |
1-1 |
20240915 |
SUM |
4/CONVENIENCE |
| DuckDuckGo AI Chat & Llama 3.1 70B |
DuckDuckGo AI Chat & Mixtral 8x7B |
0-1 |
20240915 |
SUM |
4/CONCISENESS |
| DuckDuckGo AI Chat & Llama 3.1 70B |
DuckDuckGo AI Chat & Mixtral 8x7B |
1-0 |
20240915 |
SUM |
4/COHERENCE |
| DuckDuckGo AI Chat & Llama 3.1 70B |
DuckDuckGo AI Chat & Mixtral 8x7B |
1-0 |
20240915 |
SUM |
4/ACCURACY |
| Claude 3.5 Sonnet |
ChatGPT OpenAI o1 |
1-1 |
20240915 |
JDG |
4/TRANSPARENCY |
| Claude 3.5 Sonnet |
ChatGPT OpenAI o1 |
1-1 |
20240915 |
JDG |
4/RATIONALITY |
| Claude 3.5 Sonnet |
ChatGPT OpenAI o1 |
1-0 |
20240915 |
JDG |
4/MARGINALITY |
| Claude 3.5 Sonnet |
ChatGPT OpenAI o1 |
1-1 |
20240915 |
JDG |
4/IMPARTIALITY |
| Claude 3.5 Sonnet |
ChatGPT OpenAI o1 |
1-1 |
20240915 |
JDG |
4/ETHICS |
| Claude 3.5 Sonnet |
ChatGPT OpenAI o1 |
1-1 |
20240915 |
JDG |
4/CONSISTENCY |
| ChatGPT GPT-4o |
ChatGPT OpenAI o1-mini |
1-1 |
20240915 |
EGW |
2/UX |
| ChatGPT GPT-4o |
ChatGPT OpenAI o1-mini |
0-1 |
20240915 |
EGW |
2/RELEVANCE |
| ChatGPT GPT-4o |
ChatGPT OpenAI o1-mini |
0-1 |
20240915 |
EGW |
2/QUALITY |
| ChatGPT GPT-4o |
ChatGPT OpenAI o1-mini |
1-1 |
20240915 |
EGW |
2/CONSISTENCY |
| ChatGPT GPT-4o |
ChatGPT OpenAI o1-mini |
1-1 |
20240915 |
EGW |
2/AUTHENTICITY |
| ChatGPT GPT-4o |
ChatGPT OpenAI o1-mini |
1-1 |
20240915 |
EGW |
2/ACCURACY |
| Claude 3.5 Sonnet |
Llama 3.1 70B |
1-0 |
20240915 |
EGW |
1/UX |
| Claude 3.5 Sonnet |
Llama 3.1 70B |
1-0 |
20240915 |
EGW |
1/RELEVANCE |
| Claude 3.5 Sonnet |
Llama 3.1 70B |
1-0 |
20240915 |
EGW |
1/QUALITY |
| Claude 3.5 Sonnet |
Llama 3.1 70B |
1-0 |
20240915 |
EGW |
1/QUALITY |
| Claude 3.5 Sonnet |
Llama 3.1 70B |
1-0 |
20240915 |
EGW |
1/CONSISTENCY |
| Claude 3.5 Sonnet |
Llama 3.1 70B |
1-1 |
20240915 |
EGW |
1/AUTHENTICITY |
| ChatGPT GPT-4o |
Perplexity.ai Pro |
0-1 |
20240830 |
SUM |
3/RELEVANCE |
| ChatGPT GPT-4o |
Perplexity.ai Pro |
1-1 |
20240830 |
SUM |
3/CONVENIENCE |
| ChatGPT GPT-4o |
Perplexity.ai Pro |
1-0 |
20240830 |
SUM |
3/CONCISENESS |
| ChatGPT GPT-4o |
Perplexity.ai Pro |
0-1 |
20240830 |
SUM |
3/COHERENCE |
| ChatGPT GPT-4o |
Perplexity.ai Pro |
0-1 |
20240830 |
SUM |
3/ACCURACY |
| Mixtral 8x7B Instruct |
Llama 3 8B Instruct |
1-1 |
20240830 |
JDG |
3/TRANSPARENCY |
| Mixtral 8x7B Instruct |
Llama 3 8B Instruct |
1-0 |
20240830 |
JDG |
3/RATIONALITY |
| Mixtral 8x7B Instruct |
Llama 3 8B Instruct |
0-0 |
20240830 |
JDG |
3/MARGINALITY |
| Mixtral 8x7B Instruct |
Llama 3 8B Instruct |
1-0 |
20240830 |
JDG |
3/IMPARTIALITY |
| Mixtral 8x7B Instruct |
Llama 3 8B Instruct |
1-1 |
20240830 |
JDG |
3/ETHICS |
| Mixtral 8x7B Instruct |
Llama 3 8B Instruct |
1-0 |
20240830 |
JDG |
3/CONSISTENCY |
| You.com Smart |
Perplexity.ai Quick Search |
0-1 |
20240812 |
SUM |
2/RELEVANCE |
| You.com Smart |
Perplexity.ai Quick Search |
1-1 |
20240812 |
SUM |
2/CONVENIENCE |
| You.com Smart |
Perplexity.ai Quick Search |
1-0 |
20240812 |
SUM |
2/CONCISENESS |
| You.com Smart |
Perplexity.ai Quick Search |
1-0 |
20240812 |
SUM |
2/COHERENCE |
| You.com Smart |
Perplexity.ai Quick Search |
1-1 |
20240812 |
SUM |
2/ACCURACY |
| Cohere Chat Command R+ |
Gemini Chat |
1-0 |
20240812 |
SUM |
1/RELEVANCE |
| Cohere Chat Command R+ |
Gemini Chat |
1-1 |
20240812 |
SUM |
1/CONVENIENCE |
| Cohere Chat Command R+ |
Gemini Chat |
0-1 |
20240812 |
SUM |
1/CONCISENESS |
| Cohere Chat Command R+ |
Gemini Chat |
1-1 |
20240812 |
SUM |
1/COHERENCE |
| Cohere Chat Command R+ |
Gemini Chat |
1-0 |
20240812 |
SUM |
1/ACCURACY |
| Cohere Command R |
Llama 3 70B |
1-1 |
20240729 |
JDG |
2/TRANSPARENCY |
| Cohere Command R |
Llama 3 70B |
1-1 |
20240729 |
JDG |
2/RATIONALITY |
| Cohere Command R |
Llama 3 70B |
1-1 |
20240729 |
JDG |
2/MARGINALITY |
| Cohere Command R |
Llama 3 70B |
1-1 |
20240729 |
JDG |
2/IMPARTIALITY |
| Cohere Command R |
Llama 3 70B |
1-1 |
20240729 |
JDG |
2/ETHICS |
| Cohere Command R |
Llama 3 70B |
1-1 |
20240729 |
JDG |
2/CONSISTENCY |
| ChatGPT GPT-4o |
ChatGPT GPT-4o mini |
1-1 |
20240729 |
JDG |
1/TRANSPARENCY |
| ChatGPT GPT-4o |
ChatGPT GPT-4o mini |
1-0 |
20240729 |
JDG |
1/RATIONALITY |
| ChatGPT GPT-4o |
ChatGPT GPT-4o mini |
1-1 |
20240729 |
JDG |
1/MARGINALITY |
| ChatGPT GPT-4o |
ChatGPT GPT-4o mini |
1-0 |
20240729 |
JDG |
1/IMPARTIALITY |
| ChatGPT GPT-4o |
ChatGPT GPT-4o mini |
1-1 |
20240729 |
JDG |
1/ETHICS |
| ChatGPT GPT-4o |
ChatGPT GPT-4o mini |
1-0 |
20240729 |
JDG |
1/CONSISTENCY |