AI-SLOP Detector v2.6.4

Production-grade static analyzer for detecting AI-generated code quality issues with evidence-based validation.

Detects six critical categories of AI-generated code problems with actionable, context-aware questions.

Quick Navigation: 🚀 Quick Start • ✨ What's New • 🏗️ Architecture • 📊 Core Features • ⚙️ Configuration • 🔧 CLI Usage • 🚦 CI/CD Integration • 👨‍💻 Development

Quick Start

# Install from PyPI
pip install ai-slop-detector

# Analyze a single file
slop-detector mycode.py

# Scan entire project
slop-detector --project ./src

# CI/CD Integration (Soft mode - PR comments only)
slop-detector --project ./src --ci-mode soft --ci-report

# CI/CD Integration (Hard mode - fail build on issues)
slop-detector --project ./src --ci-mode hard --ci-report

# Generate JSON report
slop-detector mycode.py --json --output report.json

What's New in v2.6.4

Scanner Reliability Update

run_scan.py now applies configured ignore patterns before analysis.
Corpus and manual test fixtures are no longer treated as production targets when ignored by config.
Project scan output is now consistent with Config.ignore and SlopDetector project filtering behavior.

Evidence-Based Validation

"Trust, but Verify" - Now Enforced:

✅ Integration Test Requirement: Claims like production-ready, scalable, or enterprise-grade now FAIL if no integration tests are detected.
✅ 4-Layer Evidence Detection: Scans paths (tests/integration), filenames (*_integration_test.py), markers (@pytest.mark.e2e), and runtime usage (TestClient).
✅ Enhanced Reporting: Reports now explicitly break down tests_unit vs tests_integration.

Quality Improvements

Tests: 170 comprehensive tests (up from 165)
Coverage: 95% Context-Jargon coverage
Zero-False-Positive Tuning: Helper files excluded from test counts

Core Features (v2.x)

Context-Based Jargon Detection - Cross-validates quality claims with actual evidence
Docstring Inflation Analysis - Detects documentation-heavy, implementation-light code
Placeholder Pattern Catalog - 14 patterns detecting unfinished/scaffolded code
Hallucination Dependencies - Identifies purpose-specific imports that are never used
Question Generation UX - Converts findings into actionable review questions
CI Gate 3-Tier System - Soft/Hard/Quarantine enforcement modes (fully tested)

What is AI Slop?

AI Slop refers to code patterns commonly produced by AI code generators that lack substance:

Pattern 1: Placeholder Code

def quantum_encode(self, data):
    """Apply quantum encoding with advanced algorithms."""
    pass  # [CRITICAL] Empty implementation

def process_data(self):
    """Process data comprehensively."""
    raise NotImplementedError  # [HIGH] Unimplemented

Detection: 14 placeholder patterns (empty except, NotImplementedError, pass, ellipsis, return None, etc.)

Pattern 2: Buzzword Inflation

class EnterpriseProcessor:
    """
    Production-ready, enterprise-grade, highly scalable processor
    with fault-tolerant architecture and comprehensive error handling.
    """
    def process(self, data):
        return data + 1  # [CRITICAL] Claims without evidence

Detection: Cross-validates claims like "production-ready" against actual evidence (error handling, logging, tests, etc.)

Pattern 3: Docstring Inflation

def add(a, b):
    """
    Sophisticated addition algorithm with advanced optimization.

    This function implements a state-of-the-art arithmetic operation
    using enterprise-grade validation and comprehensive error handling
    with production-ready reliability guarantees.

    Args:
        a: First operand with advanced type validation
        b: Second operand with enterprise-grade checking

    Returns:
        Optimized sum with comprehensive quality assurance
    """
    return a + b  # [WARNING] 12 lines of docs, 1 line of code

Detection: Ratio analysis (docstring lines / implementation lines)

Pattern 4: Hallucinated Dependencies

# [CRITICAL] 10 unused purpose-specific imports detected
import torch  # ML: never used
import tensorflow as tf  # ML: never used
import requests  # HTTP: never used
import sqlalchemy  # Database: never used

def process():
    return "hello"  # None of the imports are actually used

Detection: Categorizes imports by purpose (ML, HTTP, database) and validates usage

Architecture Overview

AI-SLOP Detector v2.6.2 uses a multi-dimensional analysis engine:

graph TD
    A[Python Code] --> B[Core Metrics v2.0]
    B --> C[Pattern Detection v2.1]
    C --> D[Evidence Validation v2.2]
    D --> E[Question Generation v2.2]
    E --> F[Deficit Score + Report]

    B1[LDR Logic Density Ratio<br/>Inflation Jargon Detection<br/>DDC Dependency Check]
    C1[14 Placeholder Patterns<br/>4 Structural Anti-patterns<br/>6 Cross-language Patterns]
    D1[Context-Based Jargon<br/>Docstring Inflation<br/>Hallucination Dependencies]
    E1[Critical/Warning/Info Questions<br/>Actionable Review Guidance]

    B -.-> B1
    C -.-> C1
    D -.-> D1
    E -.-> E1

    style A fill:#e1f5ff
    style F fill:#ffe1e1
    style B fill:#f0f0f0
    style C fill:#f0f0f0
    style D fill:#f0f0f0
    style E fill:#f0f0f0

Core Features

1. Context-Based Jargon Detection

Validates quality claims against actual codebase evidence:

# Claims "production-ready" but missing:
# - error_handling
# - logging
# - tests
# - input_validation
# - config_management

# [CRITICAL] "production-ready" claim lacks 5/5 required evidence

Evidence tracked (15 types):

Category	Evidence Types	Detection Signals
Testing	Unit tests	test functions, test files, test directories
	Integration tests	tests/integration path, pytest markers, TestClient/testcontainers
Quality Assurance	Error handling	try/except with non-empty handlers
	Logging	actual logger usage, not just imports
	Input validation	isinstance, type checks, assertions
	Documentation	meaningful docstrings
Configuration	Config management	settings, .env, yaml references
	Monitoring	prometheus, statsd, sentry
Security	Security measures	auth, encryption, sanitization
Performance	Caching	@cache, redis, memcache
	Async support	async/await usage
	Optimization	vectorization, memoization
Reliability	Retry logic	@retry, backoff, circuit breaker
Architecture	Design patterns	Factory, Singleton, Observer
	Advanced algorithms	complexity >= 10

2. Docstring Inflation Analysis

Detects documentation-heavy, implementation-light functions:

Ratio = docstring_lines / implementation_lines

CRITICAL: ratio >= 2.0  (2x more docs than code)
WARNING:  ratio >= 1.0  (more docs than code)
INFO:     ratio >= 0.5  (substantial docs)
PASS:     ratio <  0.5  (balanced or code-heavy)

3. Placeholder Pattern Catalog

14 patterns detecting unfinished/scaffolded code:

Critical Severity:

Empty exception handlers (except: pass)
Bare except blocks

High Severity:

raise NotImplementedError
Ellipsis placeholders (...)
HACK comments

Medium Severity:

return None placeholders
Interface-only classes (75%+ placeholder methods)

Low Severity:

pass statements
TODO/FIXME comments

4. Hallucination Dependencies

Categorizes imports by purpose and validates usage:

12 Categories tracked:

ML: torch, tensorflow, keras, transformers
Vision: cv2, PIL, imageio
HTTP: requests, httpx, aiohttp, flask
Database: sqlalchemy, pymongo, redis
Async: asyncio, trio, anyio
Data: pandas, polars, dask
Serialization: json, yaml, toml
Testing: pytest, unittest, mock
Logging: logging, loguru, structlog
CLI: argparse, click, typer, rich
Cloud: boto3, google-cloud, azure
Security: cryptography, jwt, passlib

5. Question Generation UX

Converts findings into actionable review questions:

CRITICAL QUESTIONS:
1. Only 14% of quality claims are backed by evidence.
   Are these marketing buzzwords without substance?

2. Claims like "fault-tolerant", "scalable" have ZERO supporting evidence.
   Where are the tests, error handling, and other indicators?

WARNING QUESTIONS:
3. (Line 4) "production-ready" claim lacks: error_handling, logging, tests.
   Only 20% of required evidence present.

4. Function "process" has 15 lines of docstring but only 2 lines of implementation.
   Is this AI-generated documentation without substance?

5. Why import "torch" for machine learning but never use it?
   Was this AI-generated boilerplate?

6. CI Gate 3-Tier System

Progressive enforcement for CI/CD pipelines:

Soft Mode (Informational):

slop-detector --project . --ci-mode soft --ci-report
# Posts PR comment, never fails build
# Use for: visibility, onboarding

Hard Mode (Strict):

slop-detector --project . --ci-mode hard --ci-report
# Fails build if deficit_score >= 70 or critical_patterns >= 3
# Exit code 1 on failure
# Use for: production branches

Quarantine Mode (Gradual):

slop-detector --project . --ci-mode quarantine --ci-report
# Tracks repeat offenders in .slop_quarantine.json
# Escalates to FAIL after 3 violations
# Use for: gradual rollout

GitHub Action Example:

- name: Quality Gate
  run: |
    pip install ai-slop-detector
    slop-detector --project . --ci-mode quarantine --ci-report

CLI Usage

# Single file
slop-detector mycode.py

# Project scan
slop-detector --project ./src

# CI/CD Integration
slop-detector --project . --ci-mode hard --ci-report

# With custom config
slop-detector --project ./src --config .slopconfig.yaml

📖 Complete CLI Reference →

Configuration

Create .slopconfig.yaml for custom thresholds:

weights:
  ldr: 0.40        # Logic Density Ratio
  inflation: 0.35  # Jargon Detection
  ddc: 0.25        # Dependency Check

thresholds:
  ldr:
    critical: 0.30
    warning: 0.60

⚙️ Full Configuration Guide →

CI/CD Integration

# Soft mode - informational only
slop-detector --project . --ci-mode soft --ci-report

# Hard mode - fail build on issues
slop-detector --project . --ci-mode hard --ci-report

# Claim-based enforcement (v2.6.2)
slop-detector --project . --ci-mode hard --ci-claims-strict

🚦 CI/CD Integration Guide →

VS Code Extension

Coming Soon: Real-time analysis in VS Code with inline diagnostics.

Current status: Local testing complete, marketplace publishing pending.

Development & Contributing

Contributions welcome! Quick setup:

git clone https://github.com/flamehaven01/AI-SLOP-Detector.git
cd AI-SLOP-Detector
pip install -e ".[dev]"
pytest tests/ -v --cov

Guidelines: 80%+ coverage • Tests required • Follow code style

👨‍💻 Development Guide →

License

MIT License - see LICENSE file for details.

Citation

If you use AI-SLOP Detector in research, please cite:

@software{ai_slop_detector,
  title = {AI-SLOP Detector: Evidence-Based Static Analysis for AI-Generated Code},
  author = {Flamehaven},
  year = {2024},
  version = {2.6.2},
  url = {https://github.com/yourusername/ai-slop-detector}
}

Acknowledgments

Built with Python 3.8+
AST analysis powered by Python's ast module
Pattern detection inspired by traditional linters
Evidence validation methodology developed in-house
Thanks to the open-source community

Roadmap

v2.7 (In Progress):

VS Code Extension marketplace release (dev complete, pending publish)
Enhanced evidence types (15+ types) - currently 14 types ✅
Custom pattern DSL for user-defined rules
Performance optimizations for large codebases

v2.8 (Planned Q1 2025):

Multi-language support (JavaScript, TypeScript)
Enhanced CI/CD integrations (GitLab CI, CircleCI)
Real-time analysis daemon mode
Team analytics dashboard (beta)

v3.0 (Planned Q2 2025):

ML-based pattern recognition
Auto-fix suggestions with confidence scores
IDE plugins (PyCharm, IntelliJ, JetBrains)
Enterprise features (SSO, RBAC already implemented)

Support

Documentation: docs/
Issues: GitHub Issues
Discussions: GitHub Discussions

Made with ❤️ by Flamehaven | Detecting AI slop since 2024

Name		Name	Last commit message	Last commit date
Latest commit History 109 Commits
.github/workflows		.github/workflows
dashboard		dashboard
docs		docs
remotion		remotion
src/slop_detector		src/slop_detector
tests		tests
vscode-extension		vscode-extension
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.pre-commit-hooks.yaml		.pre-commit-hooks.yaml
.slopconfig.example.yaml		.slopconfig.example.yaml
CHANGELOG.md		CHANGELOG.md
CONFIG_EXAMPLES.md		CONFIG_EXAMPLES.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
ROADMAP.md		ROADMAP.md
docker-compose.yml		docker-compose.yml
project_structure.txt		project_structure.txt
pyproject.toml		pyproject.toml
run_scan.py		run_scan.py
slop_self_inspection.md		slop_self_inspection.md
test_context_jargon.py		test_context_jargon.py
test_docstring_inflation.py		test_docstring_inflation.py
test_hallucinated_deps.py		test_hallucinated_deps.py
test_patterns_result.json		test_patterns_result.json
test_placeholder_patterns.py		test_placeholder_patterns.py

License

flamehaven01/AI-SLOP-Detector

Folders and files

Latest commit

History

Repository files navigation

AI-SLOP Detector v2.6.4

Quick Start

What's New in v2.6.4

Scanner Reliability Update

Evidence-Based Validation

Quality Improvements

Core Features (v2.x)

What is AI Slop?

Pattern 1: Placeholder Code

Pattern 2: Buzzword Inflation

Pattern 3: Docstring Inflation

Pattern 4: Hallucinated Dependencies

Architecture Overview

Core Features

1. Context-Based Jargon Detection

2. Docstring Inflation Analysis

3. Placeholder Pattern Catalog

4. Hallucination Dependencies

5. Question Generation UX

6. CI Gate 3-Tier System

CLI Usage

Configuration

CI/CD Integration

VS Code Extension

Development & Contributing

License

Citation

Acknowledgments

Roadmap

Support

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 10

Packages 0

Contributors 2

Uh oh!

Languages

Packages