Workflow Name |
Opponent Name |
Outcome |
Rating Period |
Trial Code |
Trial Number |
ChatGPT & OpenAI o1 |
xAI Grok 2 |
0-1 |
20241231 |
SUM |
9/RELEVANCE |
ChatGPT & OpenAI o1 |
xAI Grok 2 |
1-1 |
20241231 |
SUM |
9/CONVENIENCE |
ChatGPT & OpenAI o1 |
xAI Grok 2 |
1-0 |
20241231 |
SUM |
9/CONCISENESS |
ChatGPT & OpenAI o1 |
xAI Grok 2 |
1-0 |
20241231 |
SUM |
9/COHERENCE |
ChatGPT & OpenAI o1 |
xAI Grok 2 |
0-1 |
20241231 |
SUM |
9/ACCURACY |
Reader Ghostreader & GPT-4o |
Microsoft Copilot |
1-0 |
20241231 |
SUM |
8/RELEVANCE |
Reader Ghostreader & GPT-4o |
Microsoft Copilot |
1-1 |
20241231 |
SUM |
8/CONVENIENCE |
Reader Ghostreader & GPT-4o |
Microsoft Copilot |
1-0 |
20241231 |
SUM |
8/CONCISENESS |
Reader Ghostreader & GPT-4o |
Microsoft Copilot |
0-1 |
20241231 |
SUM |
8/COHERENCE |
Reader Ghostreader & GPT-4o |
Microsoft Copilot |
1-0 |
20241231 |
SUM |
8/ACCURACY |
Llama 3.3 70B |
Claude 3 Opus |
1-1 |
20241231 |
EGW |
8/UX |
Llama 3.3 70B |
Claude 3 Opus |
1-0 |
20241231 |
EGW |
8/RELEVANCE |
Llama 3.3 70B |
Claude 3 Opus |
1-0 |
20241231 |
EGW |
8/QUALITY |
Llama 3.3 70B |
Claude 3 Opus |
1-1 |
20241231 |
EGW |
8/CONSISTENCY |
Llama 3.3 70B |
Claude 3 Opus |
1-0 |
20241231 |
EGW |
8/AUTHENTICITY |
Llama 3.3 70B |
Claude 3 Opus |
1-1 |
20241231 |
EGW |
8/ACCURACY |
Gemini 2.0 Flash Experimental |
Grammarly |
1-1 |
20241231 |
EGW |
7/UX |
Gemini 2.0 Flash Experimental |
Grammarly |
1-0 |
20241231 |
EGW |
7/RELEVANCE |
Gemini 2.0 Flash Experimental |
Grammarly |
1-0 |
20241231 |
EGW |
7/QUALITY |
Gemini 2.0 Flash Experimental |
Grammarly |
1-0 |
20241231 |
EGW |
7/CONSISTENCY |
Gemini 2.0 Flash Experimental |
Grammarly |
1-0 |
20241231 |
EGW |
7/AUTHENTICITY |
Gemini 2.0 Flash Experimental |
Grammarly |
1-0 |
20241231 |
EGW |
7/ACCURACY |
Gemini Advanced 2.0 Experimental |
Gemini Advanced 1.5 Pro |
1-1 |
20241231 |
EGW |
6/UX |
Gemini Advanced 2.0 Experimental |
Gemini Advanced 1.5 Pro |
1-0 |
20241231 |
EGW |
6/RELEVANCE |
Gemini Advanced 2.0 Experimental |
Gemini Advanced 1.5 Pro |
1-0 |
20241231 |
EGW |
6/QUALITY |
Gemini Advanced 2.0 Experimental |
Gemini Advanced 1.5 Pro |
1-0 |
20241231 |
EGW |
6/CONSISTENCY |
Gemini Advanced 2.0 Experimental |
Gemini Advanced 1.5 Pro |
1-1 |
20241231 |
EGW |
6/AUTHENTICITY |
Gemini Advanced 2.0 Experimental |
Gemini Advanced 1.5 Pro |
1-1 |
20241231 |
EGW |
6/ACCURACY |
Writer.com |
Gemini 1.5 Flash |
0-1 |
20241231 |
EGW |
5/UX |
Writer.com |
Gemini 1.5 Flash |
1-0 |
20241231 |
EGW |
5/RELEVANCE |
Writer.com |
Gemini 1.5 Flash |
0-1 |
20241231 |
EGW |
5/QUALITY |
Writer.com |
Gemini 1.5 Flash |
0-1 |
20241231 |
EGW |
5/CONSISTENCY |
Writer.com |
Gemini 1.5 Flash |
1-0 |
20241231 |
EGW |
5/AUTHENTICITY |
Writer.com |
Gemini 1.5 Flash |
1-1 |
20241231 |
EGW |
5/ACCURACY |
Mistral AI Le Chat & Mistral Large 2 |
Rytr |
1-0 |
20241231 |
EGW |
4/UX |
Mistral AI Le Chat & Mistral Large 2 |
Rytr |
1-0 |
20241231 |
EGW |
4/RELEVANCE |
Mistral AI Le Chat & Mistral Large 2 |
Rytr |
1-0 |
20241231 |
EGW |
4/QUALITY |
Mistral AI Le Chat & Mistral Large 2 |
Rytr |
1-0 |
20241231 |
EGW |
4/CONSISTENCY |
Mistral AI Le Chat & Mistral Large 2 |
Rytr |
1-1 |
20241231 |
EGW |
4/AUTHENTICITY |
Mistral AI Le Chat & Mistral Large 2 |
Rytr |
1-1 |
20241231 |
EGW |
4/ACCURACY |
Perplexity AI Companion |
Readwise Reader & GPT-4o mini |
1-0 |
20241130 |
SUM |
7/RELEVANCE |
Perplexity AI Companion |
Readwise Reader & GPT-4o mini |
1-1 |
20241130 |
SUM |
7/CONVENIENCE |
Perplexity AI Companion |
Readwise Reader & GPT-4o mini |
0-1 |
20241130 |
SUM |
7/CONCISENESS |
Perplexity AI Companion |
Readwise Reader & GPT-4o mini |
1-0 |
20241130 |
SUM |
7/COHERENCE |
Perplexity AI Companion |
Readwise Reader & GPT-4o mini |
1-0 |
20241130 |
SUM |
7/ACCURACY |
Cohere Command R+ |
Mistral Large 2 |
1-1 |
20241130 |
JDG |
5/RATIONALITY |
Cohere Command R+ |
Mistral Large 2 |
1-0 |
20241130 |
JDG |
5/MARGINALITY |
Cohere Command R+ |
Mistral Large 2 |
1-1 |
20241130 |
JDG |
5/IMPARTIALITY |
Cohere Command R+ |
Mistral Large 2 |
1-1 |
20241130 |
JDG |
5/ETHICS |
Cohere Command R+ |
Mistral Large 2 |
1-0 |
20241130 |
JDG |
5/CONSISTENCY |
Claude 3.5 Haiku |
ChatGPT OpenAI o1-preview |
1-0 |
20241130 |
EGW |
3/UX |
Claude 3.5 Haiku |
ChatGPT OpenAI o1-preview |
1-0 |
20241130 |
EGW |
3/RELEVANCE |
Claude 3.5 Haiku |
ChatGPT OpenAI o1-preview |
1-1 |
20241130 |
EGW |
3/QUALITY |
Claude 3.5 Haiku |
ChatGPT OpenAI o1-preview |
1-1 |
20241130 |
EGW |
3/CONSISTENCY |
Claude 3.5 Haiku |
ChatGPT OpenAI o1-preview |
1-0 |
20241130 |
EGW |
3/AUTHENTICITY |
Claude 3.5 Haiku |
ChatGPT OpenAI o1-preview |
1-0 |
20241130 |
EGW |
3/ACCURACY |
ContentDetector.AI |
Writer AI Content Detector |
1-0 |
20241130 |
AGT |
2/ROBUSTNESS |
ContentDetector.AI |
Writer AI Content Detector |
1-0 |
20241130 |
AGT |
2/EXPLAINABILITY |
ContentDetector.AI |
Writer AI Content Detector |
1-0 |
20241130 |
AGT |
2/ACCURACY |
DuckDuckGo AI Chat & GPT-4o mini |
DuckDuckGo AI Chat & Claude 3 Haiku |
0-1 |
20241031 |
SUM |
6/RELEVANCE |
DuckDuckGo AI Chat & GPT-4o mini |
DuckDuckGo AI Chat & Claude 3 Haiku |
1-0 |
20241031 |
SUM |
6/CONVENIENCE |
DuckDuckGo AI Chat & GPT-4o mini |
DuckDuckGo AI Chat & Claude 3 Haiku |
1-0 |
20241031 |
SUM |
6/CONCISENESS |
DuckDuckGo AI Chat & GPT-4o mini |
DuckDuckGo AI Chat & Claude 3 Haiku |
1-1 |
20241031 |
SUM |
6/COHERENCE |
DuckDuckGo AI Chat & GPT-4o mini |
DuckDuckGo AI Chat & Claude 3 Haiku |
0-1 |
20241031 |
SUM |
6/ACCURACY |
Mistral Le Chat & NeMo |
Gemini Advanced |
1-0 |
20240930 |
SUM |
5/RELEVANCE |
Mistral Le Chat & NeMo |
Gemini Advanced |
1-0 |
20240930 |
SUM |
5/CONVENIENCE |
Mistral Le Chat & NeMo |
Gemini Advanced |
0-1 |
20240930 |
SUM |
5/CONCISENESS |
Mistral Le Chat & NeMo |
Gemini Advanced |
1-0 |
20240930 |
SUM |
5/COHERENCE |
Mistral Le Chat & NeMo |
Gemini Advanced |
1-0 |
20240930 |
SUM |
5/ACCURACY |
QuillBot AI Detector |
Sapling AI Detector |
1-0 |
20240930 |
AGT |
1/ROBUSTNESS |
QuillBot AI Detector |
Sapling AI Detector |
1-0 |
20240930 |
AGT |
1/EXPLAINABILITY |
QuillBot AI Detector |
Sapling AI Detector |
1-0 |
20240930 |
AGT |
1/ACCURACY |
DuckDuckGo AI Chat & Llama 3.1 70B |
DuckDuckGo AI Chat & Mixtral 8x7B |
1-0 |
20240915 |
SUM |
4/RELEVANCE |
DuckDuckGo AI Chat & Llama 3.1 70B |
DuckDuckGo AI Chat & Mixtral 8x7B |
1-1 |
20240915 |
SUM |
4/CONVENIENCE |
DuckDuckGo AI Chat & Llama 3.1 70B |
DuckDuckGo AI Chat & Mixtral 8x7B |
0-1 |
20240915 |
SUM |
4/CONCISENESS |
DuckDuckGo AI Chat & Llama 3.1 70B |
DuckDuckGo AI Chat & Mixtral 8x7B |
1-0 |
20240915 |
SUM |
4/COHERENCE |
DuckDuckGo AI Chat & Llama 3.1 70B |
DuckDuckGo AI Chat & Mixtral 8x7B |
1-0 |
20240915 |
SUM |
4/ACCURACY |
Claude 3.5 Sonnet |
ChatGPT OpenAI o1 |
1-1 |
20240915 |
JDG |
4/TRANSPARENCY |
Claude 3.5 Sonnet |
ChatGPT OpenAI o1 |
1-1 |
20240915 |
JDG |
4/RATIONALITY |
Claude 3.5 Sonnet |
ChatGPT OpenAI o1 |
1-0 |
20240915 |
JDG |
4/MARGINALITY |
Claude 3.5 Sonnet |
ChatGPT OpenAI o1 |
1-1 |
20240915 |
JDG |
4/IMPARTIALITY |
Claude 3.5 Sonnet |
ChatGPT OpenAI o1 |
1-1 |
20240915 |
JDG |
4/ETHICS |
Claude 3.5 Sonnet |
ChatGPT OpenAI o1 |
1-1 |
20240915 |
JDG |
4/CONSISTENCY |
ChatGPT GPT-4o |
ChatGPT OpenAI o1-mini |
1-1 |
20240915 |
EGW |
2/UX |
ChatGPT GPT-4o |
ChatGPT OpenAI o1-mini |
0-1 |
20240915 |
EGW |
2/RELEVANCE |
ChatGPT GPT-4o |
ChatGPT OpenAI o1-mini |
0-1 |
20240915 |
EGW |
2/QUALITY |
ChatGPT GPT-4o |
ChatGPT OpenAI o1-mini |
1-1 |
20240915 |
EGW |
2/CONSISTENCY |
ChatGPT GPT-4o |
ChatGPT OpenAI o1-mini |
1-1 |
20240915 |
EGW |
2/AUTHENTICITY |
ChatGPT GPT-4o |
ChatGPT OpenAI o1-mini |
1-1 |
20240915 |
EGW |
2/ACCURACY |
Claude 3.5 Sonnet |
Llama 3.1 70B |
1-0 |
20240915 |
EGW |
1/UX |
Claude 3.5 Sonnet |
Llama 3.1 70B |
1-0 |
20240915 |
EGW |
1/RELEVANCE |
Claude 3.5 Sonnet |
Llama 3.1 70B |
1-0 |
20240915 |
EGW |
1/QUALITY |
Claude 3.5 Sonnet |
Llama 3.1 70B |
1-0 |
20240915 |
EGW |
1/QUALITY |
Claude 3.5 Sonnet |
Llama 3.1 70B |
1-0 |
20240915 |
EGW |
1/CONSISTENCY |
Claude 3.5 Sonnet |
Llama 3.1 70B |
1-1 |
20240915 |
EGW |
1/AUTHENTICITY |
ChatGPT GPT-4o |
Perplexity.ai Pro |
0-1 |
20240830 |
SUM |
3/RELEVANCE |
ChatGPT GPT-4o |
Perplexity.ai Pro |
1-1 |
20240830 |
SUM |
3/CONVENIENCE |
ChatGPT GPT-4o |
Perplexity.ai Pro |
1-0 |
20240830 |
SUM |
3/CONCISENESS |
ChatGPT GPT-4o |
Perplexity.ai Pro |
0-1 |
20240830 |
SUM |
3/COHERENCE |
ChatGPT GPT-4o |
Perplexity.ai Pro |
0-1 |
20240830 |
SUM |
3/ACCURACY |
Mixtral 8x7B Instruct |
Llama 3 8B Instruct |
1-1 |
20240830 |
JDG |
3/TRANSPARENCY |
Mixtral 8x7B Instruct |
Llama 3 8B Instruct |
1-0 |
20240830 |
JDG |
3/RATIONALITY |
Mixtral 8x7B Instruct |
Llama 3 8B Instruct |
0-0 |
20240830 |
JDG |
3/MARGINALITY |
Mixtral 8x7B Instruct |
Llama 3 8B Instruct |
1-0 |
20240830 |
JDG |
3/IMPARTIALITY |
Mixtral 8x7B Instruct |
Llama 3 8B Instruct |
1-1 |
20240830 |
JDG |
3/ETHICS |
Mixtral 8x7B Instruct |
Llama 3 8B Instruct |
1-0 |
20240830 |
JDG |
3/CONSISTENCY |
You.com Smart |
Perplexity.ai Quick Search |
0-1 |
20240812 |
SUM |
2/RELEVANCE |
You.com Smart |
Perplexity.ai Quick Search |
1-1 |
20240812 |
SUM |
2/CONVENIENCE |
You.com Smart |
Perplexity.ai Quick Search |
1-0 |
20240812 |
SUM |
2/CONCISENESS |
You.com Smart |
Perplexity.ai Quick Search |
1-0 |
20240812 |
SUM |
2/COHERENCE |
You.com Smart |
Perplexity.ai Quick Search |
1-1 |
20240812 |
SUM |
2/ACCURACY |
Cohere Chat Command R+ |
Gemini Chat |
1-0 |
20240812 |
SUM |
1/RELEVANCE |
Cohere Chat Command R+ |
Gemini Chat |
1-1 |
20240812 |
SUM |
1/CONVENIENCE |
Cohere Chat Command R+ |
Gemini Chat |
0-1 |
20240812 |
SUM |
1/CONCISENESS |
Cohere Chat Command R+ |
Gemini Chat |
1-1 |
20240812 |
SUM |
1/COHERENCE |
Cohere Chat Command R+ |
Gemini Chat |
1-0 |
20240812 |
SUM |
1/ACCURACY |
Cohere Command R |
Llama 3 70B |
1-1 |
20240729 |
JDG |
2/TRANSPARENCY |
Cohere Command R |
Llama 3 70B |
1-1 |
20240729 |
JDG |
2/RATIONALITY |
Cohere Command R |
Llama 3 70B |
1-1 |
20240729 |
JDG |
2/MARGINALITY |
Cohere Command R |
Llama 3 70B |
1-1 |
20240729 |
JDG |
2/IMPARTIALITY |
Cohere Command R |
Llama 3 70B |
1-1 |
20240729 |
JDG |
2/ETHICS |
Cohere Command R |
Llama 3 70B |
1-1 |
20240729 |
JDG |
2/CONSISTENCY |
ChatGPT GPT-4o |
ChatGPT GPT-4o mini |
1-1 |
20240729 |
JDG |
1/TRANSPARENCY |
ChatGPT GPT-4o |
ChatGPT GPT-4o mini |
1-0 |
20240729 |
JDG |
1/RATIONALITY |
ChatGPT GPT-4o |
ChatGPT GPT-4o mini |
1-1 |
20240729 |
JDG |
1/MARGINALITY |
ChatGPT GPT-4o |
ChatGPT GPT-4o mini |
1-0 |
20240729 |
JDG |
1/IMPARTIALITY |
ChatGPT GPT-4o |
ChatGPT GPT-4o mini |
1-1 |
20240729 |
JDG |
1/ETHICS |
ChatGPT GPT-4o |
ChatGPT GPT-4o mini |
1-0 |
20240729 |
JDG |
1/CONSISTENCY |