Censys Universal Internet Dataset
The Censys Universal Internet Dataset is the most comprehensive Internet-wide scan data in the industry. This dataset supersedes the legacy IPv4 and IPv4 Banners datasets.
Censys global scanning engine continuously walks the entire IPv4 space, detecting 107 protocols on over 3500 ports to produce visibility in a high-resolution map of the public Internet. This provides a significant tool for threat hunters, attack surface managers, and other security professionals.
Every day, a 'snapshot' of every service on the Internet is taken and added as a partition to a dataset available to Censys Enterprise customers, who can query this data using Google BigQuery.
Scan data in any snapshot is never more than 10 days old, and the data for a service is either structured protocol-specific key-value pairs or, if the service is using an unknown protocol, banner data and/or TLS information.
Usage
BigQuery Dataset Names
- The entire history of daily Internet snapshots:
censys-io.universal_internet_dataset_v2
- Today’s snapshot of the live Internet:
censys-io.universal_internet_dataset_v2.current
- A snapshot from a specific day:
censys-io.universal_internet_dataset_v2.{yyyyMMdd}
Data model
Read this introduction to the host data model to learn how Censys represents Internet hosts.
Schema
Explore the Censys Universal Internet Dataset schema for fields you can query.
Query Syntax
Use Standard SQL to query the Censys datasets, which is now the default for Google BigQuery.
Examples
Want to know the port distribution of SMTP services using the compromised TLSv1_0 encryption method on Jun. 9, 2021? Try this query:
SELECT
tls.version_selected,
service_name,
port,
COUNT(*) AS count
FROM
`censys-io.universal_internet_dataset.20210609`,
UNNEST(services)
WHERE
tls.version_selected = 'TLSv1_0'
AND service_name = 'SMTP'
GROUP BY
1,
2,
3
ORDER BY
4 DESC
Want to see the software (in CPE format) reported by Telnet services across all ports today? Try running this query:
SELECT
software.vendor,
software.product,
software.version,
port,
COUNT(*) AS count
FROM
`censys-io.universal_internet_dataset.current`,
UNNEST(services),
UNNEST(software) AS software
WHERE
service_name = 'TELNET'
GROUP BY
1,2,3,4
ORDER BY
5 DESC
Or see a count of all the Kubernetes services across the Internet on May 25, 2021, grouped by the port they’re running on:
SELECT
port,
COUNT(*) AS count
FROM
`censys-io.universal_internet_dataset.20210525`,
UNNEST(services)
WHERE
service_name = 'KUBERNETES'
GROUP BY
1
ORDER BY
2 DESC
Updated 18 days ago