Quell reads your docstrings, Pydantic models, and type annotations, extracts every testable requirement, finds which ones have no test, generates pytest tests via a rule engine, verifies each test through a 5-gate pipeline, and writes only proven tests to disk.

Does Quell require an LLM API key?

The rule engine runs entirely in-process — no source code is ever transmitted. ~75% of edge cases are handled with no network call and no API key. LLM fallback is opt-in and only sends the function signature, never the full body.

What is the 5-gate pipeline?

Every generated test must pass: Gate 1 (AST valid Python), Gate 2 (not already in a test file), Gate 3 (no shell calls or file writes), Gate 4 (passes against original code), Gate 5 (fails when the requirement is violated). Only gate-5-verified tests are written to disk.

What is the Production Readiness Score (PRS)?

PRS = (WRITTEN × 1.0 + SCAFFOLDED × 0.5) / total_requirements × 100. Tiers: 80-100 Production Ready, 60-79 Review Needed, 0-59 Needs Work.

How is Quell different from GitHub Copilot or Qodo for test generation?

Quell reads specifications that already exist in your code — it does not generate tests from scratch. It finds requirements already documented in your docstrings, Pydantic models, and type annotations that have no test. The 5-gate pipeline, especially Gate 5 (violation injection), verifies each test actually catches the bug it claims to catch. This verification step is not present in Copilot, Qodo, or Hypothesis.

Can Quell be used in CI pipelines?

Yes. Run quell ci src/ --threshold 80 to fail CI if PRS falls below 80. Set prs_threshold in pyproject.toml under [tool.quell]. Works with GitHub Actions, GitLab CI, and any system that checks exit codes.

About

Quell

Name: Quell
Author: Shashank Bindal

Quell finds the edge cases your docstrings describe but your tests never prove. Built by Shashank Bindal after shipping one too many bugs that were documented but untested.

Mission

Code coverage measures which lines ran. It says nothing about whether the code is correct. Quell's goal is simple: for every testable requirement that exists in your codebase — in a docstring, a Pydantic model, a type annotation — there should be a test that proves the requirement holds. Not a test that exercises the line. A test that fails if the requirement is broken.

That's what the 5-gate pipeline does. Gate 5 mutates the source to violate the requirement and runs the test against the mutated code. If the test doesn't fail, the test doesn't ship. Only mathematically proven tests reach your repository.

Principles

Proof, not coverage

A test that only achieves a line hit is worse than no test — it gives false confidence. Every test Quell writes must prove it catches a violation of the requirement it targets.

No silent failures

Every requirement gets a bucket: WRITTEN (proven), SCAFFOLDED (stub), or FLAGGED (reason given). Nothing is quietly skipped. You always know exactly what's covered and why.

Your code stays on your machine

The rule engine handles ~75% of cases locally with no network call. LLM fallback is opt-in and only activated for the remaining complex cases. No source code leaves your machine by default.

Determinism first

The rule engine is the primary path — fast, deterministic, reproducible. LLM is a fallback for the cases rules can't handle, not the default approach.

History

Jan 2024

The idea

After shipping a bug that was documented in a docstring — 'must not accept zero amount' — but had no test, it became clear that coverage metrics lie. A line can be executed without ever proving the contract holds.

Mid 2024

First prototype

A weekend script that parsed Python docstrings and wrote naive pytest stubs. It caught three real bugs in the first project it ran on. The stubs were ugly, but the idea was validated.

Dec 2024

Public beta

v0.4.0 shipped to PyPI. Rule engine, Groq LLM fallback, and a basic 4-gate pipeline. Early adopters gave feedback that shaped the WRITTEN / SCAFFOLDED / FLAGGED split.

Jan 2025

v1.0.0 stable

The full 5-gate pipeline shipped. Gate 5 — proving a test fails when the requirement is violated — became the defining feature. It's the moat between coverage and proof.

May 2025

v2.0.0

Unified `quell find`, Production Readiness Score, libcst AST-safe injection, and the GitHub Action. Quell became a CI-first tool, not just a developer utility.

Built by

Shashank Bindal

Software engineer passionate about developer tooling and automated testing.

Website GitHub Email

Get started →View on GitHub