Phi-3 Medium vs GPT-4o

Side-by-side comparison of Phi-3 Medium (Microsoft) and GPT-4o (OpenAI) — benchmarks, pricing, and capabilities.

Share:

	Phi-3 Medium Microsoft	GPT-4o OpenAI
Category	LLMs	LLMs
Specifications
Context Window	128K	128K
Pricing (per 1M tokens)
Input Cost	Free	$2.50
Output Cost	Free	$10.00
Performance
Overall Score	78.2	92.5
ARC-Challenge	—	96.3
BigBench Hard	68.0	87.2
Chatbot Arena ELO	—	1150.0
DROP	70.0	88.1
GSM8K	78.0	73.1
HumanEval	70.0	92.0
MATH	48.0	76.6
MMLU	78.0	88.7
TruthfulQA	52.0	72.2
WinoGrande	—	89.9
Community
User Rating	★ 4.2	★ 4.7
Reviews	312	1240

Open in Interactive Comparison Tool View Phi-3 Medium View GPT-4o

People Also Compare

Phi-3 MediumvsGPT-o1 GPT-4ovsGPT-o1

Phi-3 MediumvsDeepSeek R1 GPT-4ovsDeepSeek R1

Phi-3 MediumvsClaude 3.5 Sonnet GPT-4ovsClaude 3.5 Sonnet

Phi-3 MediumvsGemini 1.5 Pro GPT-4ovsGemini 1.5 Pro

Phi-3 MediumvsClaude 3 Opus GPT-4ovsClaude 3 Opus

Phi-3 MediumvsDeepSeek V3 GPT-4ovsDeepSeek V3