DataRadar

About DataRadar

DataRadar is a privacy-first data quality profiling tool designed for locked-down environments where installing software isn't possible. All processing happens client-side in your browser using WebAssembly - your data never leaves your laptop.

The Problem

Council workers, NHS staff, university researchers, and SMB analysts often face a frustrating challenge: they receive unknown datasets but can't install the tools needed to validate them. Their "terrible Windows laptop" has no Python, no R, no SQL tools - just a web browser.

Meanwhile, data quality issues like encoding problems, nulls in unexpected places, and format inconsistencies go unnoticed until they cause failures downstream.

The Solution

DataRadar brings enterprise-grade data profiling to your browser. Upload a CSV, Excel file, or JSON dataset and instantly see:

Product Ladder

DataRadar is the entry point to a complete data quality ecosystem. As your needs grow, our tools scale with you.

1. DataRadar (Browser Tool)

Free • Perfect for small datasets (up to ~50K rows)
  • Zero installation - works in any modern browser
  • 100% client-side processing - complete privacy
  • CSV, Excel, JSON, NDJSON support
  • Data quality and character profiling reports
  • Perfect for quick sanity checks and exploratory analysis
Try DataRadar Now

2. bytefreq (Command-Line Tool)

Free & Open Source • Millions of rows • Multi-threaded
  • Blazing-fast Rust implementation
  • Multi-threaded processing with Rayon
  • Handles files too large for browser tools
  • Pipe-based workflows (e.g., cat data.csv | bytefreq)
  • Enhanced JSON output with data quality assertions
  • Ideal for DIY users, data engineers, and CI/CD pipelines
Download CLI Tool

3. DQPRE (Enterprise Spark Engine)

Enterprise • Billions/Trillions of rows • Public & Private Sector
  • Scala/Spark implementation for massive datasets
  • Configurable assertion rules engine
  • Custom data quality validations
  • Proven at scale in UK Central Government
  • CI/CD integration for automated quality gates
  • Consulting and support packages available
Contact for Pricing

Why This Approach?

Most data quality tools follow one of two extremes: either they're massive enterprise platforms requiring complex setup, or they're simple scripts that don't scale. DataRadar fills the gap with a freemium funnel that meets you where you are:

Who We Are

DataRadar is developed by Andrew Morgan, author of Mastering Spark for Data Science, and is based on bytefreq, a 20 year old data profiling toolkit originally coded in awk. More recently Andrew led the development of DQPRE, a bespoke data quality tool that scales to needs of the UK Central Government, a modern scala tool now opensourced. Andrew's latest data quality project is dataradar, a wasm based browser tool powered by bytefreq, an open-source Rust library implementing the DQOR (Data Quality On Read) pattern.

Get Started

Try DataRadar Now Download CLI Tool