Christian AI Benchmark v1-draft: how 11 leading models scored

Cover for Christian AI Benchmark v1-draft: how 11 leading models scored
Tonye BrownWritten byTonye Brown
Last updated
2 minute read
Methodology
Share:

We re-ran the FaithGPT Christian AI Benchmark on 2026-06-11. 11 models answered 116 questions covering Scripture interpretation, doctrine, pastoral care, citation accuracy, and safety, producing 2,726 scored evaluations. Every answer was scored by independent AI judges grounded in the actual KJV text and public-domain commentaries, so a model that invents a verse gets caught instead of graded on confidence.

FaithGPT topped this run with an overall score of 90.2/100.

The leaderboard

RankModelOverall (0-100)Cost per 100 answers
1FaithGPT90.2$1.48
2gpt-5.588.9$0.22
3gpt-5.5-pro88.9$0.19
4Claude Fable 588.9$0.18
5Claude Sonnet 4.688.8$0.68
6Claude Opus 4.888.8$4.14
7gpt-5.488.3$0.10
8Gemini 3.1 Pro Preview88.0$1.30
9Claude Haiku 4.587.7$0.11
10Gemini 2.5 Flash87.6$0.34
11Gemini 3.5 Flash19.3$1.16

Cost is what it actually took to generate the answers in this run, measured from provider token telemetry. It excludes the cost of judging.

Who wins each category

CategoryWinnerScore
Apologeticsgpt-5.489.3
Biblical literacyFaithGPT89.4
Christian ethicsClaude Opus 4.889.5
Citation trapsFaithGPT90.1
Content creationFaithGPT92.2
Denominational nuancegpt-5.5-pro88.8
DoctrineClaude Sonnet 4.690.4
Pastoral careFaithGPT91.8
Safety boundariesFaithGPT90.9
Scripture interpretationFaithGPT90.0

Category scores average every model's answers within that category. A model can lead overall and still lose a category to a specialist.

Popular postsView all
The FaithGPT Newsletter

Your weekly faith & AI brief.

Scripture, reflection, and the AI news that matters for Christians. Free, every week.

Read this week’s issue

Best value

On score per dollar, gpt-5.4 delivered the most: 88.3/100 at $0.10 per 100 answers.

How to read these results

These numbers measure benchmark version v1-draft on this question set. The judges verify citations against the KJV database, scoring averages multiple judge passes per answer, and the published cost comes from provider telemetry rather than list prices. No benchmark replaces Scripture, pastors, or Christian community.

The live leaderboard always carries the most current version: faithgpt.io/benchmarks.

Editorial method

Scripture-aware, product-tested, and linked to FaithGPT methodology

Methodology4 structured sectionsLast updated

Wake Up Excited About Your Quiet Time Again

  • Fresh insights daily

  • Personalized to your journey

  • Deepen your relationship with God

Refresh Your Faith
Faith AI tech perspective
Tonye Brown - FaithGPT Creator

Tonye Brown

Founder & Developer

Tonye Brown is a Christian software developer, husband, father, and the founder of FaithGPT. He builds Gospel-centered AI tools for Bible study, prayer, ministry workflows, theological review, and Christian creativity, with a focus on making advanced technology useful without letting it replace Scripture, wisdom, or the local church.

FaithGPT articles discuss AI in church contexts. Using AI in ministry is a choice, not a necessity, and should never replace the Holy Spirit's guidance. Learn more

Share this article

Related resources