Model Family · Capabilities

Every task. Which model handles it.

The comprehensive task-to-model mapping for the lineup. Every capability the family ships, mapped to the variant that actually performs it, with a per-cell verdict — full support, partial support, not supported, or dispatched to a sibling via the auto-router.

Lineup recap

Three families, one corpus.

Lino

Inline · sub-100 ms · on-device · the commit-time gut check.

Read the deep dive

Eagle

Wide-angle triage · ranks and clusters candidate paths across the repo.

Read the deep dive

Griffin

Deep reasoning · the hypothesis engine, five variants from 8B to 671B-MoE.

Read the deep dive

Master capability matrix

Capability by variant.

Forty-two tasks across seven bands. Marks reflect actual capability rather than badge-collection upgrades.

Full support

Partial

Not supported

Via routing

Capability	Lino 1B	Griffin S 14B	Griffin Zero 671B-MoE
Detection & triage
Inline sink detection (deserialisation, SSRF-able URL builders, unsafe SQL)
Secret pattern detection (Gitleaks-class)
Sanitiser-quality scoring
Cross-scanner finding dedup
Taint path enumeration (single-package)
Taint path enumeration (cross-package)
Path ranking + clustering
Confidence scoring per candidate path
Reasoning & hypothesis
Exploit-class hypothesis (CWE category mapping)
Exploit-trigger input synthesis
Cross-package taint chain reasoning (≤ 4 hops)
Cross-package taint chain reasoning (≤ 12 hops)
Cross-package taint chain reasoning (> 12 hops)		via routing
Multi-finding correlation in a single reasoning pass
Adversarial disproof pass (refute own hypothesis)
Structured reasoning trace output
Remediation
Single-finding fix suggestion
Auto-fix PR with diff
Auto-fix PR with cited reasoning trace
Sanitiser-aware patch synthesis
Multi-service auto-fix campaign
Upstream coordinated-disclosure patch + draft
Eval & gates
PR-time gate decision			via routing
Pre-merge policy evaluation			via routing
SARIF / CycloneDX / SPDX emit
Eval-harness scoring of candidate patches
Context & scale
Context window 32k
Context window 64k
Context window 128k
Context window 256k (retrieval-gated)
Repo-wide reasoning (1k–5k packages)
Portfolio-wide reasoning (multi-repo)
Deployment shape
On-device inference (no network egress)
Shared cloud
Dedicated cluster
VPC-isolated
Air-gapped / sovereign
AI & MCP governance
Sensitive-data egress scanning
Prompt audit-log signing
MCP tool-call inspection	via routing

"Via routing" means the variant does not perform the capability itself; the auto-router dispatches to a sibling when that capability is requested.

Per-model quick reference

One card per variant.

Lino 1B

Inline gut check at the keystroke.

Best at

Sub-100 ms inline sink + sanitiser detection
Secret patterns and obvious unsafe primitives
On-device, fully offline operation

In the pipeline: Sits in the IDE, CLI, and pre-commit hook.

Deep dive

Eagle 13B

Wide-angle triage across the repo.

Best at

Ranking and clustering candidate taint paths
Cross-scanner deduplication and confidence scoring
Batched full-repo sweeps after CI

In the pipeline: Runs after CI; feeds the auto-router queue.

Deep dive

Griffin Lite 8B

Cheapest deep reasoning tier.

Best at

Single-finding reasoning at low latency
Sanitiser-aware fix synthesis
Short-hop taint chain proofs

In the pipeline: Low-complexity candidates from Eagle's queue.

Deep dive

Griffin S 14B

PR-level reasoning, 64k window.

Best at

Mid-depth call-graph reasoning
Adversarial disproof on PR diffs
Structured reasoning trace output

In the pipeline: PR reviewer tier; default for mid-confidence findings.

Deep dive

Griffin M 32B

Repo-wide reasoning, 128k window.

Best at

Cross-package taint chains up to twelve hops
Multi-finding correlation in one pass
Repo-scale (1k–5k package) reasoning

In the pipeline: Medium-confidence findings, transitive chains.

Deep dive

Griffin L 70B

Default production tier.

Best at

Multi-hop cross-package exploit hypothesis
Cited auto-fix PRs with full trace
Portfolio-wide reasoning across repos

In the pipeline: High-confidence findings; the survivors-of-disproof tier.

Deep dive

Griffin Zero 671B-MoE

Deepest reasoning, sovereign tier.

Best at

256k retrieval-gated context
Coordinated-disclosure draft synthesis
Supply-chain-scale audits

In the pipeline: Sovereign and air-gapped deployments only.

Deep dive

How the auto-router decides

Triage score picks the variant.

The router is not a vibe. A single scalar collapses confidence, complexity, hop depth, and tenant tier into a bucket that maps to one variant.

Lino runs at the commit

Inline on the developer machine, sub-100 ms, no network egress. Catches the obvious unsafe primitives before they ever reach a build server.

Eagle sweeps the repo

Post-CI batched scan across every package. Ranks and clusters candidate paths and assigns each one a confidence score on the dataflow head.

Auto-router scores triage

The ranked queue is scored on a 0.0–1.0 scale using confidence, complexity, hop depth, and the deployment tier the tenant is on.

Triage score picks the variant

0.0–0.4 → Lite · 0.4–0.6 → S · 0.6–0.75 → M · 0.75–0.9 → L · 0.9–1.0 → Zero (sovereign tier only). The chosen variant emits the reasoning trace.

Triage score → Griffin variant

0.00–0.40

Griffin Lite

0.40–0.60

Griffin S

0.60–0.75

Griffin M

0.75–0.90

Griffin L

0.90–1.00

Griffin Zero

Zero is only reachable for tenants on a sovereign or air-gapped deployment tier. Below 0.4, Eagle's verdict ships as-is and no Griffin pass is requested.

Honest about scope

What's NOT on this matrix.

The lineup is weighted for cybersecurity. It deliberately doesn't try to do the things below — a different model class would be the right tool.

Out of scope by design

General-purpose code generation outside a security context.
Image, audio, or video generation of any kind.
Customer-support or open-domain chat.
Translation, summarisation of marketing copy, or content rewriting.
Autocomplete for unrelated business logic.
Code review for style, lint, or developer-experience concerns.
Synthesising training data for unrelated downstream models.
Hardware fault diagnosis or non-software risk analysis.

The corpus is curated to defenders, taint graphs, CVE bodies, and patch diffs. We'd rather be excellent at a small set of security tasks than mediocre at a large set of unrelated ones.

Pick the right variant for your workload.

The matrix is the spec sheet. Book a call and we'll map your workflow onto the variants — including which findings the router should escalate, and which it should let Eagle settle.

Lineup overview Research notes

Every task. Which model handles it.

Three families, one corpus.

Lino

Eagle

Griffin

Capability by variant.

One card per variant.

Lino 1B

Eagle 13B

Griffin Lite 8B

Griffin S 14B

Griffin M 32B

Griffin L 70B

Griffin Zero 671B-MoE

Triage score picks the variant.

What's NOT on this matrix.

Out of scope by design

Pick the right variant for your workload.

Product

Solutions

Compare

Resources

Company

Legal

Developers