Methodology & Standards · Version 2.5

Monitoring activeMethodology v2.5Last audited May 2026Standards Board sign-off

How SafeGradeAI evaluates AI tools

Every certification is the product of a proprietary, evidence-driven review framework built specifically for schools, districts, and families. We evaluate what matters for children — privacy, moderation, age appropriateness, governance, and educational fit — using a structured internal methodology, and re-verify our findings on an ongoing basis. Certain internal review criteria remain confidential to preserve scoring integrity.

COPPA aligned
FERPA aligned
GDPR-K aligned
UK AADC informed
Independent review
Documentation verified
Human evaluated

See the five dimensions Badge standards

The Framework

Five evaluation dimensions, weighted for educational use

SafeGradeAI uses a proprietary multi-factor evaluation framework. Each certified product is reviewed against five public dimensions. Relative weighting reflects what most affects safety, suitability, and institutional risk when AI is used by minors — exact weightings and internal escalation thresholds are intentionally not disclosed to preserve framework integrity.

Radial diagram of the five SafeGradeAI evaluation dimensions: Privacy & Data Protection 25%, Content Safety & Moderation 25%, Age Appropriateness & Student Impact 20%, Educational Value & Alignment 15%, Transparency & Governance 15%.

Privacy & Data Handling

Weighting · Primary

How a tool collects, retains, shares, and uses data generated by minors and educational accounts.

Relative weightingPrimary

What we review

Data retention windows and deletion controls
Third-party data sharing and sub-processor disclosure
Student data usage and model-training opt-outs
Account creation requirements (email, age gate, parental consent)
Telemetry, analytics, and advertising signals collected
Regional handling for COPPA, FERPA, GDPR-K, and UK AADC

Moderation & Safety Controls

Weighting · Primary

How effectively the system prevents harmful outputs and resists adversarial attempts to bypass safeguards.

Relative weightingPrimary

What we review

Harmful content prevention across CSAM, self-harm, and violence categories
Jailbreak and prompt-injection resistance under repeated probes
Behavior on unsafe, leading, or manipulative prompts
Layered filtering: model-level, system-level, and post-response review
User reporting, escalation, and human-in-the-loop pathways
Recovery behavior after a filter breach or unsafe output

Age Appropriateness

Weighting · Substantial

Whether the experience is developmentally suitable for the audience the vendor claims to serve.

Relative weightingSubstantial

What we review

Reading level and language complexity versus stated audience
Emotional safety: tone, persona behavior, and parasocial risk
Developmental suitability of topics, imagery, and interaction patterns
Required supervision level and parental visibility
Consistency of age gating across web, mobile, and embedded surfaces
Behavior when a user self-reports as under the stated minimum age

Transparency & Governance

Weighting · Material

How clearly the vendor publishes, maintains, and stands behind its safety and data commitments.

Relative weightingMaterial

What we review

Published privacy policy, DPA, and child-safety statement
Vendor responsiveness to safety inquiries within 10 business days
Auditability of decisions: logs, model versioning, and change history
Explainability of refusals and content decisions to end users
Disclosure clarity around AI generation, sources, and limitations
Public incident history and remediation track record

Educational Alignment

Weighting · Material

How well the tool fits classroom workflows, district procurement requirements, and policy frameworks.

Relative weightingMaterial

What we review

Classroom usability for teachers and students under realistic load
Compatibility with common AUPs and district AI policies
District readiness: SSO, rostering, admin console, audit logs
Teacher controls over student-facing features and prompts
Implementation safeguards: pilot guidance, training, change management
Alignment with state and national digital learning standards

Scoring System

Four Certification Badges

Overall scores are calculated from the five weighted dimensions and mapped to one of four bands. Bands determine which audiences a tool is appropriate for and what level of oversight we recommend.

KidSafe Approved

85–100

Strong privacy protections and robust safety safeguards.

Suitable for unsupervised use within the stated age range. Vendor demonstrates mature privacy controls, layered moderation, transparent governance, and education-ready deployment options.

Examples of what lowers a score in this band

Ambiguous opt-out flows for model training
Partial admin console coverage
Inconsistent disclosure between marketing and policy pages

Tween Approved

75–84

Suitable with parental or institutional supervision.

Acceptable when paired with classroom supervision, a managed account, or active parental oversight. Some controls are present but not uniformly enforced across surfaces.

Examples of what lowers a score in this band

Long-context safety drift on extended sessions
Limited teacher controls over student-facing features
Sub-processor list missing or incomplete

Teen Approved

65–74

May require additional oversight or classroom controls.

Use only inside a managed environment with explicit institutional controls. Significant gaps in moderation, transparency, or age handling reduce suitability for unsupervised minors.

Examples of what lowers a score in this band

Weak age gating across mobile and embedded surfaces
No published incident history
Inconsistent refusal behavior under repeated probes

Not Recommended

Below 65

Significant moderation, privacy, or governance concerns.

Not appropriate for use by minors in our review. Concerns may include reported safety incidents, unbounded persona behavior, opaque data practices, or absent governance commitments.

Examples of what lowers a score in this band

Conversations stored and used for training without clear opt-out
Documented safety incidents involving minors
No reliable separation between adult and minor user experiences

Overall scores are generated through a structured internal review methodology that combines the five public dimensions with proprietary risk, escalation, and reassessment criteria. Exact dimension weights, escalation thresholds, and red-flag triggers are intentionally not disclosed to preserve framework integrity and reduce manipulation risk.

Tier assignments refresh whenever monitoring evidence, vendor policy changes, or reassessment review warrant a change.

Review Process

Seven stages, from intake to ongoing monitoring

Every certification follows the same procedural framework. Reviewers, timelines, and evidence requirements are defined in advance so the outcome is reproducible from the case file.

The vendor submits a structured intake covering product surface area, intended audience, data flows, and existing safety documentation. Intake is scoped before any review hours are spent.

Reviewer actions

Reviewer assigns a case ID and conflict-of-interest screen
Vendor receives an intake checklist within one business day
Scope agreement signed (surfaces, regions, languages)

Evidence & criteria

Product surface map (web, mobile, embedded, API)
Stated audience and minimum age
Regions where the product is offered

Ongoing Monitoring

A certification is a commitment, not a snapshot

AI products change quickly. SafeGradeAI treats every certification as a living record with continuous monitoring, scheduled re-evaluation, and the authority to act between cycles.

Monitoring Active

Live

Certified products are tracked continuously. Privacy policies, DPAs, and child-safety statements are diffed automatically.

Annual Re-evaluation

12-month cycle

Every certified product is fully re-reviewed against the current methodology version on a 12-month cycle.

Policy Updated

10-day SLA

Material policy changes are surfaced on the verification page within 10 business days, with reviewer notes attached.

Status Can Change

Always revocable

Certifications can be paused, downgraded, or revoked between cycles when incidents, policy changes, or test results warrant it.

Badge Standards

Display standards for vendors and publishers

Each SafeGradeAI badge represents a specific, verifiable claim. These standards protect the meaning of the certification for the schools, districts, and families that rely on it.

Allowed

Display the badge on a product, pricing, or trust page
Include the badge in procurement responses and security reviews
Link the badge to the public SafeGradeAI verification page for the product
Use the badge alongside the certification date and tier label
Reduce or enlarge the badge proportionally, preserving aspect ratio

Prohibited

Modify the badge artwork, color, gradient, or typography
Display the badge for a product that has not been certified
Display a tier higher than the one issued
Continue to display the badge after revocation or expiry
Imply endorsement of features outside the evaluated surface area

Display specifications

Minimum height: 32px digital · 12mm print
Clear space: ≥ 25% of badge height on all sides
Background contrast: ≥ 3:1 against badge edge
Accessible alt text: “SafeGradeAI <tier> certified”
File format: SVG preferred · PNG @2x acceptable
Revalidation cadence: Every 12 months

Verification

Every badge resolves to a public record

Every certified product is displayed on a permanent public verification page that shows the current tier, the last review date, and the methodology version it was scored against. Anyone can confirm a badge in two clicks.

Tier and overall score, with the methodology version
Reviewer date and next scheduled re-evaluation
Certification status and validity period

Browse certified tools Apply for certification

Revocation

Certifications can be paused, downgraded, or revoked

A SafeGradeAI badge is a commitment maintained between reviews. The Standards Committee can take action at any time, with the rationale published on the verification page.

Triggers for re-review or revocation

Material change to privacy policy, DPA, or sub-processor list
Documented safety incident affecting minors on the product
Failure to provide reviewer access for scheduled re-evaluation
Misuse of the badge — including unapproved tier or expired display

When a certification is paused or revoked, the vendor is notified the same business day and the public verification page is updated within 24 hours. Display of the badge by the vendor must end immediately and remain off until reinstatement.

Use the framework

Trusted by schools, families, and procurement teams

Browse certified tools, learn what each badge means, or submit a product for independent evaluation under the current methodology.

Explore certified tools Badge standards Submit a tool

Methodology metadata

Current version: 2.5 · published Mar 2026
Next scheduled revision: Sep 2026 (semi-annual)
Public change log: Maintained per certification
Standards committee: Independent review panel

How SafeGradeAI evaluates AI tools

Five evaluation dimensions, weighted for educational use

Privacy & Data Handling

Moderation & Safety Controls

Age Appropriateness

Transparency & Governance

Educational Alignment

Four Certification Badges

KidSafe Approved

Tween Approved

Teen Approved

Not Recommended

Seven stages, from intake to ongoing monitoring

Submission & Intake

Documentation Review

Safety Testing

Human Evaluation

Verification

Certification Decision

Ongoing Monitoring

A certification is a commitment, not a snapshot

Monitoring Active

Annual Re-evaluation

Policy Updated

Status Can Change

Display standards for vendors and publishers

Allowed

Prohibited

Display specifications

Every badge resolves to a public record

Certifications can be paused, downgraded, or revoked

Triggers for re-review or revocation

Trusted by schools, families, and procurement teams

How SafeGradeAI evaluates AI tools

Privacy & Data Handling

Moderation & Safety Controls

Age Appropriateness

Transparency & Governance

Educational Alignment

KidSafe Approved

Tween Approved

Teen Approved

Not Recommended

Stage 1Day 0 — same business daySubmission & Intake

Submission & Intake

Stage 2Days 1–4Documentation Review

Documentation Review

Stage 3Days 4–8Safety Testing

Safety Testing

Stage 4Days 6–10Human Evaluation

Human Evaluation

Stage 5Days 10–12Verification

Verification

Stage 6Days 12–14Certification Decision

Certification Decision

Stage 7Continuous · 12-month cycleOngoing Monitoring

Ongoing Monitoring

Monitoring Active

Annual Re-evaluation

Policy Updated

Status Can Change

Allowed

Prohibited

Display specifications

Every badge resolves to a public record

Triggers for re-review or revocation

Trusted by schools, families, and procurement teams