Skip to content

ata-core/ata-validator

Repository files navigation

ata-validator

Ultra-fast JSON Schema validator powered by simdjson. Multi-core parallel validation, RE2 regex, codegen bytecode engine. Standard Schema V1 compatible.

ata-validator.com

Performance

Single-Document Validation

Scenario ata ajv
validate(obj) valid 76M ops/sec 8M ops/sec ata 9.5x faster
validate(obj) invalid 34M ops/sec 8M ops/sec ata 4.3x faster
isValidObject(obj) 15.4M ops/sec 9.2M ops/sec ata 1.7x faster
validateJSON(str) valid 2.15M ops/sec 1.88M ops/sec ata 1.1x faster
validateJSON(str) invalid 2.17M ops/sec 2.29M ops/sec ata 1.1x faster
Schema compilation 113K ops/sec 818 ops/sec ata 138x faster

validate(obj) numbers are isolated single-schema benchmarks. Multi-schema benchmark overhead reduces throughput; real-world numbers depend on workload.

Large Data — JS Object Validation

Size ata ajv
10 users (2KB) 6.2M ops/sec 2.5M ops/sec ata 2.5x faster
100 users (20KB) 658K ops/sec 243K ops/sec ata 2.7x faster
1,000 users (205KB) 64K ops/sec 23.5K ops/sec ata 2.7x faster

Real-World Scenarios

Scenario ata ajv
Serverless cold start (50 schemas) 7.7ms 96ms ata 12.5x faster
ReDoS protection (^(a+)+$) 0.3ms 765ms ata immune (RE2)
Batch NDJSON (10K items, multi-core) 13.4M/sec 5.1M/sec ata 2.6x faster
Fastify HTTP (100 users POST) 24.6K req/sec 22.6K req/sec ata 9% faster

ata is faster than ajv on every benchmark — valid and invalid data, objects and JSON strings, single documents and parallel batches.

How it works

Hybrid validator: ata compiles schemas into monolithic JS functions identical to the boolean fast path, but returning VALID_RESULT on success and calling the error collector on failure. V8 TurboFan optimizes it identically to a pure boolean function — error code is dead code on the valid path. No try/catch (3.3x V8 deopt), no lazy arrays, no double-pass.

JS codegen: Schemas are compiled to monolithic JS functions (like ajv). Supported keywords: type, required, properties, items, enum, const, allOf, anyOf, oneOf, not, if/then/else, uniqueItems, contains, prefixItems, additionalProperties, dependentRequired, $ref (local), minimum/maximum, minLength/maxLength, pattern, format.

V8 TurboFan optimizations: Destructuring batch reads, undefined checks instead of in operator, context-aware type guard elimination, property hoisting to local variables, tiered uniqueItems (nested loop for small arrays).

Adaptive simdjson: For large documents (>8KB) with selective schemas, simdjson On Demand seeks only the needed fields — skipping irrelevant data at GB/s speeds.

JSON Schema Test Suite

98.4% pass rate (937/952) on official JSON Schema Test Suite (Draft 2020-12).

When to use ata

  • Any validate(obj) workload — 4.3x–9.5x faster than ajv
  • Serverless / cold starts — 12.5x faster schema compilation
  • Security-sensitive apps — RE2 regex, immune to ReDoS attacks
  • Batch/streaming validation — NDJSON log processing, data pipelines (2.6x faster)
  • Standard Schema V1 — native support for Fastify v5, tRPC, TanStack
  • C/C++ embedding — native library, no JS runtime needed

When to use ajv

  • Schemas with patternProperties, dependentSchemas — these bypass JS codegen and hit the slower NAPI path
  • 100% spec compliance needed — ajv covers more edge cases (ata: 98.4%)

Features

  • Hybrid validator: 76M ops/sec — same function body as boolean check, returns result or calls error collector. No try/catch, no double pass
  • Multi-core: Parallel validation across all CPU cores — 13.4M validations/sec
  • simdjson: SIMD-accelerated JSON parsing at GB/s speeds, adaptive On Demand for large docs
  • RE2 regex: Linear-time guarantees, immune to ReDoS attacks (2391x faster on pathological input)
  • V8-optimized codegen: Destructuring batch reads, type guard elimination, property hoisting
  • Standard Schema V1: Compatible with Fastify, tRPC, TanStack, Drizzle
  • Zero-copy paths: Buffer and pre-padded input support — no unnecessary copies
  • Defaults + coercion: default values, coerceTypes, removeAdditional support
  • C/C++ library: Native API for non-Node.js environments
  • 98.4% spec compliant: Draft 2020-12

Installation

npm install ata-validator

Usage

Node.js

const { Validator } = require('ata-validator');

const v = new Validator({
  type: 'object',
  properties: {
    name: { type: 'string', minLength: 1 },
    email: { type: 'string', format: 'email' },
    age: { type: 'integer', minimum: 0 },
    role: { type: 'string', default: 'user' }
  },
  required: ['name', 'email']
});

// Fast boolean check — JS codegen (9.5x faster than ajv)
v.isValidObject({ name: 'Mert', email: 'mert@example.com', age: 26 }); // true

// Full validation with error details + defaults applied
const result = v.validate({ name: 'Mert', email: 'mert@example.com' });
// result.valid === true, data.role === 'user' (default applied)

// JSON string validation (simdjson fast path)
v.validateJSON('{"name": "Mert", "email": "mert@example.com"}');
v.isValidJSON('{"name": "Mert", "email": "mert@example.com"}'); // true

// Buffer input (zero-copy, raw NAPI)
v.isValid(Buffer.from('{"name": "Mert", "email": "mert@example.com"}'));

// Parallel batch — multi-core, NDJSON (2.6x faster than ajv)
const ndjson = Buffer.from(lines.join('\n'));
v.isValidParallel(ndjson);  // bool[]
v.countValid(ndjson);        // number

Options

const v = new Validator(schema, {
  coerceTypes: true,       // "42" → 42 for integer fields
  removeAdditional: true,  // strip properties not in schema
});

Standard Schema V1

const v = new Validator(schema);

// Works with Fastify, tRPC, TanStack, etc.
const result = v['~standard'].validate(data);
// { value: data } on success
// { issues: [{ message, path }] } on failure

Fastify Plugin

npm install fastify-ata
const fastify = require('fastify')();
fastify.register(require('fastify-ata'), {
  coerceTypes: true,
  removeAdditional: true,
});

// All existing JSON Schema route definitions work as-is

C++

#include "ata.h"

auto schema = ata::compile(R"({
  "type": "object",
  "properties": { "name": {"type": "string"} },
  "required": ["name"]
})");

auto result = ata::validate(schema, R"({"name": "Mert"})");
// result.valid == true

Supported Keywords

Category Keywords
Type type
Numeric minimum, maximum, exclusiveMinimum, exclusiveMaximum, multipleOf
String minLength, maxLength, pattern, format
Array items, prefixItems, minItems, maxItems, uniqueItems, contains, minContains, maxContains
Object properties, required, additionalProperties, patternProperties, minProperties, maxProperties, propertyNames, dependentRequired, dependentSchemas
Enum/Const enum, const
Composition allOf, anyOf, oneOf, not
Conditional if, then, else
References $ref, $defs, definitions, $id
Boolean true, false

Format Validators (hand-written, no regex)

email, date, date-time, time, uri, uri-reference, ipv4, ipv6, uuid, hostname

Building from Source

# C++ library + tests
cmake -B build
cmake --build build
./build/ata_tests

# Node.js addon
npm install
npm run build
npm test

# JSON Schema Test Suite
npm run test:suite

License

MIT

Author

Mert Can Altin

About

Ultra-fast JSON Schema validator. 9.5x faster than ajv (76M ops/sec), 138x faster compilation, ReDoS-immune (RE2). Hybrid JS codegen + simdjson + multi-core. Standard Schema V1.

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages