Christian AI Benchmark v1-draft: how 11 leading models scored

Cover for Christian AI Benchmark v1-draft: how 11 leading models scored
Tonye BrownEcrit parTonye Brown
Derniere mise a jour
2 minutes de lecture
Methodologie
Partager :

We re-ran the FaithGPT Christian AI Benchmark on 2026-06-11. 11 models answered 116 questions covering Scripture interpretation, doctrine, pastoral care, citation accuracy, and safety, producing 2,726 scored evaluations. Every answer was scored by independent AI judges grounded in the actual KJV text and public-domain commentaries, so a model that invents a verse gets caught instead of graded on confidence.

FaithGPT topped this run with an overall score of 90.2/100.

The leaderboard

RankModelOverall (0-100)Cost per 100 answers
1FaithGPT90.2$1.48
2gpt-5.588.9$0.22
3gpt-5.5-pro88.9$0.19
4Claude Fable 588.9$0.18
5Claude Sonnet 4.688.8$0.68
6Claude Opus 4.888.8$4.14
7gpt-5.488.3$0.10
8Gemini 3.1 Pro Preview88.0$1.30
9Claude Haiku 4.587.7$0.11
10Gemini 2.5 Flash87.6$0.34
11Gemini 3.5 Flash19.3$1.16

Cost is what it actually took to generate the answers in this run, measured from provider token telemetry. It excludes the cost of judging.

Who wins each category

CategoryWinnerScore
Apologeticsgpt-5.489.3
Biblical literacyFaithGPT89.4
Christian ethicsClaude Opus 4.889.5
Citation trapsFaithGPT90.1
Content creationFaithGPT92.2
Denominational nuancegpt-5.5-pro88.8
DoctrineClaude Sonnet 4.690.4
Pastoral careFaithGPT91.8
Safety boundariesFaithGPT90.9
Scripture interpretationFaithGPT90.0

Category scores average every model's answers within that category. A model can lead overall and still lose a category to a specialist.

Articles populairesVoir tout
The FaithGPT Newsletter

Your weekly faith & AI brief.

Scripture, reflection, and the AI news that matters for Christians. Free, every week.

Read this week’s issue

Best value

On score per dollar, gpt-5.4 delivered the most: 88.3/100 at $0.10 per 100 answers.

How to read these results

These numbers measure benchmark version v1-draft on this question set. The judges verify citations against the KJV database, scoring averages multiple judge passes per answer, and the published cost comes from provider telemetry rather than list prices. No benchmark replaces Scripture, pastors, or Christian community.

The live leaderboard always carries the most current version: faithgpt.io/benchmarks.

Methode editoriale

Attentive a l Ecriture, testee dans le produit et liee a la methodologie FaithGPT

Methodologie4 sections structureesDerniere mise a jour

Approfondissez votre foi avec une IA qui respecte l Ecriture

  • Ancre dans la Bible

  • Theologiquement attentif

  • Concu pour les croyants

Essayer FaithGPT
Perspective sur l IA chretienne
Tonye Brown - createur de FaithGPT

Tonye Brown

Fondateur et developpeur

Tonye Brown est developpeur logiciel chretien, mari, pere et fondateur de FaithGPT. Il cree des outils d IA centres sur l Evangile pour l etude biblique, la priere, les flux de travail ministeriels, la revision theologique et la creativite chretienne, avec l objectif de rendre la technologie avancee utile sans remplacer l Ecriture, la sagesse ni l Eglise locale.

Les articles FaithGPT parlent de l IA dans des contextes d Eglise. Utiliser l IA dans le ministere est un choix, pas une necessite, et ne doit jamais remplacer la direction du Saint-Esprit. En savoir plus

Partager cet article

Ressources liees