Verified Performance Evidence
Raw benchmark output across execution environments. Single-run baseline, concurrent saturation tests, and live cloud deployments. All results are end-to-end: source file through destination write, with full row-count and checksum validation.
2,516,818 Rows/sec
One worker. One file. One machine. The cleanest measurement of the engine itself, uncomplicated by concurrency dynamics.
DataForge Bench v0.4.1 Config: bench.yaml · Workers: 1 · Source: courtlistener_opinions_2024.csv Stage 1 Source validation PASS 75,814,101 rows detected Stage 2 Schema inference PASS 42 columns mapped Stage 3 Ingestion PASS elapsed 30.123s Rows inserted 75,814,101 Rows/sec 2,516,818 Throughput 86.0 MB/sec Malformed 0 Dropped 0 Delta 0 Stage 4 Validation PASS row count, schema, checksum Run complete.
5,426,774 Aggregate Rows/sec
10 parallel workers. 758M total rows. Disk was not yet the bottleneck.
Storage Saturation — Lossless
20 parallel workers pushed the test device to its physical write ceiling. The engine held. No data loss. No job failures.
At 20 workers, DataForge reached the physical write ceiling of the test storage device — sustaining peak writes of 1,954 MB/s. Throughput held at 5.2M aggregate rows/sec. No data loss. No job failures. The architecture degraded gracefully against hardware limits rather than introducing errors or partial writes. This is the key integrity result: storage saturation surfaced as reduced throughput, not as data corruption.
Cross-Cloud Deployment
Cloud numbers carry honest overhead — cold start, network transport, database writes over the network. These are real production numbers, not compute-only figures.
Enterprise Plus · 8 vCPU / 64 GB
5-region cluster · end-to-end validated
Azure Database for PostgreSQL
Run It on Your Infrastructure
The benchmark harness is available for download. Register to receive the binary, configuration, and runbook. No sales call required. Results unlock immediately after email verification.
talk throughput?
Pilot discussions, investor conversations, enterprise architecture review, or technical deep-dives.