New Profile
Introduction
The telemetry_new_profile_parquet
table is the most direct representation of a new-profile ping.
Contents
The table contains one row for each ping. Each column represents one field from the new-profile ping payload, though only a subset of all fields are included.
Accessing the Data
The data is stored as a parquet table in S3 at the following address. See this cookbook to get started working with the data in Spark.
s3://net-mozaws-prod-us-west-2-pipeline-data/telemetry-new-profile-parquet/v2/
The telemetry_new_profile_parquet
is accessible through re:dash.
Here's an example query.
Further Reading
This dataset is generated automatically using direct to parquet. The configuration responsible for generating this dataset was introduced in bug 1360256.
Data Reference
Schema
As of 2018-06-26, the current version of the telemetry_new_profile_parquet
dataset is v2
, and has a schema as follows:
root
|-- id: string (nullable = true)
|-- client_id: string (nullable = true)
|-- metadata: struct (nullable = true)
| |-- timestamp: long (nullable = true)
| |-- date: string (nullable = true)
| |-- normalized_channel: string (nullable = true)
| |-- geo_country: string (nullable = true)
| |-- geo_city: string (nullable = true)
| |-- geo_subdivision1: string (nullable = true)
| |-- geo_subdivision2: string (nullable = true)
| |-- creation_timestamp: long (nullable = true)
| |-- x_ping_sender_version: string (nullable = true)
|-- environment: struct (nullable = true)
| |-- build: struct (nullable = true)
| | |-- application_name: string (nullable = true)
| | |-- architecture: string (nullable = true)
| | |-- version: string (nullable = true)
| | |-- build_id: string (nullable = true)
| | |-- vendor: string (nullable = true)
| | |-- hotfix_version: string (nullable = true)
| |-- partner: struct (nullable = true)
| | |-- distribution_id: string (nullable = true)
| | |-- distribution_version: string (nullable = true)
| | |-- partner_id: string (nullable = true)
| | |-- distributor: string (nullable = true)
| | |-- distributor_channel: string (nullable = true)
| | |-- partner_names: array (nullable = true)
| | | |-- element: string (containsNull = true)
| |-- settings: struct (nullable = true)
| | |-- is_default_browser: boolean (nullable = true)
| | |-- default_search_engine: string (nullable = true)
| | |-- default_search_engine_data: struct (nullable = true)
| | | |-- name: string (nullable = true)
| | | |-- load_path: string (nullable = true)
| | | |-- origin: string (nullable = true)
| | | |-- submission_url: string (nullable = true)
| | |-- telemetry_enabled: boolean (nullable = true)
| | |-- locale: string (nullable = true)
| | |-- attribution: struct (nullable = true)
| | | |-- source: string (nullable = true)
| | | |-- medium: string (nullable = true)
| | | |-- campaign: string (nullable = true)
| | | |-- content: string (nullable = true)
| | |-- update: struct (nullable = true)
| | | |-- channel: string (nullable = true)
| | | |-- enabled: boolean (nullable = true)
| | | |-- auto_download: boolean (nullable = true)
| |-- system: struct (nullable = true)
| | |-- os: struct (nullable = true)
| | | |-- name: string (nullable = true)
| | | |-- version: string (nullable = true)
| | | |-- locale: string (nullable = true)
| |-- profile: struct (nullable = true)
| | |-- creation_date: long (nullable = true)
|-- payload: struct (nullable = true)
| |-- reason: string (nullable = true)
|-- submission: string (nullable = true)
For more detail on the raw ping these fields come from, see the raw data.