Page Comparison

What is Bridge Downstream?

Bridge Downstream is a data pipeline that takes data uploaded to Bridge by a digital health app and turns it into parquet datasets. These parquet datasets are written to an S3 bucket and can be accessed through Synapse.

Why

would you want to do

bother doing all that?

A digital health app typically sends data to Bridge as a .zip archive of JSON files. This is not an easy data format for analysts to work with – we want data frames! A parquet dataset is a relational normalized version of the data in these JSON files and can be easily loaded as a data frame.

Why is the parquet folder in my Synapse project empty?

Parquet data is written to an S3 bucket that acts as the external storage location of the parquet folder. For instructions on how to access the Parquet datasets in this external storage location, see Getting Started.

How do I interpret the names of the Parquet datasets?

The Parquet dataset names specify what type of assessment data is contained inside them. All of this is explained in Understanding Parquet Datasets.

Panel

On this page:

Table of Contents

Related pages

Filter by label (Content by label)

showLabels	false
spaces	BD
showSpace	false
sort	title
type	page
cql	label = "documentation-space-sample" and type = "page" and space = "BD"
labels	documentation-space-sample

Versions Compared

Old Version 4

New Version 5

Key

What is Bridge Downstream?

Why

bother doing all that?

Why is the parquet folder in my Synapse project empty?

How do I interpret the names of the Parquet datasets?