================================================================================
  The upscaler.video Codec Support Dataset
  Version: 2026-01-14
  License: CC-BY 4.0
================================================================================

OVERVIEW
--------
This dataset contains 45,519,786 individual WebCodecs API compatibility tests
collected from 143,181 real user sessions at free.upscaler.video.

Each row represents one codec string tested on one user's device, including
the full browser user agent string for detailed analysis.

DATASET FILE
------------
upscaler-video-codec-dataset-raw.csv (2-3 GB uncompressed)

CSV SCHEMA
----------
timestamp       - ISO 8601 timestamp of when test was performed
user_agent      - Full browser user agent string
browser         - Browser family (Chrome, Safari, Edge, Firefox, Unknown)
platform_raw    - Raw platform identifier from navigator.platform
platform        - Normalized platform (Windows, macOS, iOS, Android, Linux)
codec           - WebCodecs codec string tested (e.g., "av01.0.01M.08")
supported       - Whether codec was supported (true/false)

EXAMPLE ROW
-----------
2026-01-05T00:54:11.570Z,"Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36...",Edge,Win32,Windows,av01.0.01M.08,true

QUICK STATS
-----------
Total Tests:       45,519,786
Sessions:          143,181
Unique Codecs:     1,087
Collection Period: January 2026
Data Source:       free.upscaler.video

PLATFORM NORMALIZATION
----------------------
The 'platform' column applies the following normalization:
  Win32, Win64 → Windows
  MacIntel → macOS
  iPhone, iPad → iOS
  Linux armv8*, Android* → Android
  Linux x86_64, Linux aarch64 → Linux

The 'platform_raw' column contains the original navigator.platform value.

USAGE EXAMPLES
--------------
Python (pandas):
  import pandas as pd
  df = pd.read_csv('upscaler-video-codec-dataset-raw.csv')

  # Calculate global support per codec
  support = df.groupby('codec')['supported'].apply(
      lambda x: (x == 'true').mean() * 100
  )

R:
  library(readr)
  df <- read_csv('upscaler-video-codec-dataset-raw.csv')

  # Filter for specific browser
  chrome_only <- df[df$browser == 'Chrome',]

SQL (import to database):
  CREATE TABLE codec_tests (
    timestamp TIMESTAMP,
    user_agent TEXT,
    browser VARCHAR(50),
    platform_raw VARCHAR(50),
    platform VARCHAR(50),
    codec VARCHAR(100),
    supported BOOLEAN
  );

CITATION
--------
If you use this dataset in research or publications, please cite:

  Bhattacharyya, Samrat. (2026). The upscaler.video Codec Support Dataset.
  Version 2026-01-14. https://free.upscaler.video/research/webcodecs-dataset

BibTeX:
  @dataset{upscaler_codec_dataset_2026,
    title   = {The upscaler.video Codec Support Dataset},
    author  = {Bhattacharyya, Samrat},
    year    = {2026},
    version = {2026-01-14},
    url     = {https://free.upscaler.video/research/webcodecs-dataset},
    note    = {45.5M codec tests from 143k sessions}
  }

LICENSE
-------
Creative Commons Attribution 4.0 International (CC-BY 4.0)

You are free to:
- Share and redistribute in any format
- Adapt, remix, and build upon the data
- Use commercially

Under the following terms:
- Attribution: Credit "upscaler.video Codec Support Dataset" with link

Full license: https://creativecommons.org/licenses/by/4.0/

MORE INFORMATION
----------------
Dataset Documentation:
  https://webcodecsfundamentals.org/datasets/codec-support/

Methodology Details:
  https://free.upscaler.video/research/webcodecs-dataset

Codec Registry (interactive table):
  https://webcodecsfundamentals.org/datasets/codec-registry/

Questions or Issues:
  https://github.com/sb2702/webcodecs-fundamentals/issues

================================================================================
Dataset collected using free.upscaler.video
An open-source reference implementation of production WebCodecs workflows
================================================================================
