Slop-Score: Writing Metrics Analyzer

An in-browser text analysis tool that detects AI-generated "slop" patterns, repetitive language, and "Not X, but Y" constructions in writing.

Try it at: https://eqbench.com/slop-score.html

Implementation Details

Slop List Matches

First, the evaluated model generates 150 creative writing outputs and 150 essay outputs. Next, its writing is matched to a precomputed list of words and trigrams which are overused in AI text compared to human baselines.

Not-X-But-Y Pattern Detection

The not-x-but-y detector implements a 2-stage regex pipeline:

Stage 1: Surface Patterns

Runs 10 regex patterns on normalized text
Patterns include: "not X, but Y", dash forms, cross-sentence contrasts
Maps matches to sentence spans

Stage 2: POS-Tagged Patterns

Tags text with wink-pos-tagger (maps Penn Treebank tags to simplified VERB/NOUN/ADJ/ADV)
Runs 35 regex patterns on tagged stream
Maintains character offset mapping between tagged stream and raw text
Uses binary search for efficient offset lookups

Slop Score Calculation

The Slop Score is a weighted composite metric designed to detect AI-generated text patterns:

60% - Slop Words: Frequency of individual words that appear unnaturally often in LLM outputs
25% - Not-x-but-y Patterns: Frequency of contrast patterns like "not just X, but Y" which are overused by AI
15% - Slop Trigrams: Frequency of 3-word phrases that appear unnaturally often in LLM outputs

Usage

Analysing Your Text

To compute a slop score on your own text, visit https://eqbench.com/slop-score.html and click "Analyse text". A slop score and related metrics will be computed locally, in browser.

Alternatively, serve this directory with: python -m http.server and browse to: http://localhost:8000

Benchmarking Models

A benchmarking script is also included. It generates 300 outputs from the same set of standardised prompts (Creative writing and essays) used for the leaderboard.

Use bench-model.mjs to generate text completions from any OpenAI-compatible API:

# Setup: Create .env file with your API credentials
cp .env.example .env
# Edit .env and set BASE_URL and API_KEY

# Basic usage (uses .env)
./bench-model.mjs --model "openai/gpt-4o"

# Or override with command-line args
./bench-model.mjs \
  --model "openai/gpt-4o" \
  --base-url "https://openrouter.ai/api/v1/chat/completions" \
  --api-key "your-key"

# Custom settings
./bench-model.mjs \
  --model "meta-llama/llama-3.1-70b-instruct" \
  --workers 16 \
  --max-tokens 4096 \
  --n-prompts 300

Features:

Parallel generation with configurable worker count
Resume support (skips already-generated prompts)
Atomic file writes for safe concurrent operations
Progress tracking and error reporting
Saves to results/[model-id].json by default

Parameters:

--model (required): Model identifier
--base-url: API endpoint (default: from .env BASE_URL or OPENAI_BASE_URL)
--api-key: API key (default: from .env API_KEY or OPENAI_API_KEY)
--workers: Number of parallel requests (default: 8)
--max-tokens: Max tokens per generation (default: 8096)
--n-prompts: Number of prompts to use (default: 300)
--max-retries: Max retries per request (default: 3)
--prompts: Path to prompts file (default: data/prompts.json)
--results: Custom output path

Environment variables (.env file):

BASE_URL: API endpoint (e.g., https://openrouter.ai/api/v1/chat/completions)
API_KEY: Your API key

Generating the Leaderboard

Use generate-leaderboard.mjs to analyze all results and create the leaderboard:

# Generate leaderboard from all results/*.json files, skipping ones already computed
./generate-leaderboard.mjs

# Force recalculation of all models
./generate-leaderboard.mjs --force

# Force recalculation of human baseline (BYO human writing samples)
./generate-leaderboard.mjs --force-recalc-human

Metrics calculated:

Slop list matches (word and trigram)
N-gram repetition score
Not-x-but-y contrast patterns
Lexical diversity (MATTR-500)
Flesch-Kincaid grade level
Average sentence/paragraph length
Dialogue frequency
Top over-represented words, bigrams, and trigrams

License

This project uses a dual-license structure:

MIT License (Primary Components)

The following components are licensed under the MIT License:

Slop Score computation and dependencies (human writing profile, slop lists in data/)
Results datasets in results/
JavaScript code in js/ (excluding wordfreq.js)
All code in project root (./) including leaderboard generation and website
slop-score.html and related HTML files

See LICENSE for full MIT license text.

Apache 2.0 & CC-BY-SA 4.0 (wordfreq Components)

The following components retain their original wordfreq licensing:

js/wordfreq.js - Apache License 2.0
data/large_en.msgpack.gz - Creative Commons Attribution-ShareAlike 4.0

This project includes a JavaScript port of wordfreq, used for baseline frequency analysis.

Citation:

Robyn Speer. (2022). rspeer/wordfreq: v3.0 (v3.0.2). Zenodo. https://doi.org/10.5281/zenodo.7199437

@software{robyn_speer_2022_7199437,
  author       = {Robyn Speer},
  title        = {rspeer/wordfreq: v3.0},
  month        = sep,
  year         = 2022,
  publisher    = {Zenodo},
  version      = {v3.0.2},
  doi          = {10.5281/zenodo.7199437},
  url          = {https://doi.org/10.5281/zenodo.7199437}
}

wordfreq data sources:

Google Books Ngrams and Google Books Syntactic Ngrams
The Leeds Internet Corpus from the University of Leeds Centre for Translation Studies
Wikipedia
ParaCrawl, a multilingual Web crawl
OPUS OpenSubtitles 2018, originating from OpenSubtitles
SUBTLEX word lists (SUBTLEX-US, SUBTLEX-UK, SUBTLEX-CH, SUBTLEX-DE, and SUBTLEX-NL) created by Marc Brysbaert et al., available at http://crr.ugent.be/programs-data/subtitle-frequencies

How to Cite

If you use slop-score in your research, please cite it as:

@misc{paech2025slopScore,
      title={slop-score},
      author={Samuel J. Paech},
      year={2025},
      howpublished={\url{https://github.com/sam-paech/slop-score}},
      note={GitHub repository}
}

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
data		data
js		js
results		results
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
WORDFREQ_PORT.md		WORDFREQ_PORT.md
bench-model.mjs		bench-model.mjs
generate-leaderboard.mjs		generate-leaderboard.mjs
package-lock.json		package-lock.json
package.json		package.json
slop-score.html		slop-score.html
style.css		style.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Slop-Score: Writing Metrics Analyzer

Implementation Details

Slop List Matches

Not-X-But-Y Pattern Detection

Slop Score Calculation

Usage

Analysing Your Text

Benchmarking Models

Generating the Leaderboard

License

MIT License (Primary Components)

Apache 2.0 & CC-BY-SA 4.0 (wordfreq Components)

How to Cite

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Slop-Score: Writing Metrics Analyzer

Implementation Details

Slop List Matches

Not-X-But-Y Pattern Detection

Slop Score Calculation

Usage

Analysing Your Text

Benchmarking Models

Generating the Leaderboard

License

MIT License (Primary Components)

Apache 2.0 & CC-BY-SA 4.0 (wordfreq Components)

How to Cite

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages