Skip to content

Commit ae5c443

Browse files
Updated explanatory text for query.
1 parent 4c535ef commit ae5c443

1 file changed

Lines changed: 15 additions & 4 deletions

File tree

notebooks/pathomics/lung_cancer_cptac_DataExploration.ipynb

Lines changed: 15 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,15 @@
11
{
22
"cells": [
3+
{
4+
"cell_type": "markdown",
5+
"metadata": {
6+
"id": "view-in-github",
7+
"colab_type": "text"
8+
},
9+
"source": [
10+
"<a href=\"https://colab.research.google.com/github/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/pathomics/lung_cancer_cptac_DataExploration.ipynb\" target=\"_parent\"><img src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"Open In Colab\"/></a>"
11+
]
12+
},
313
{
414
"cell_type": "markdown",
515
"metadata": {
@@ -146,11 +156,11 @@
146156
"id": "TM1g60Hx_Iij"
147157
},
148158
"source": [
149-
"IDC relies on the Google Cloud Platform (GCP) for storage and management of DICOM data. The data are contained in so-called [storage buckets](https://cloud.google.com/storage/docs/key-terms#buckets), from which they can be retrieved on a requester pays basis. Currently, all pathology whole-slide images (WSI) are located in the *idc-open* bucket.\n",
159+
"IDC relies on the Google Cloud Platform (GCP) for storage and management of DICOM data. The data are contained in so-called [storage buckets](https://cloud.google.com/storage/docs/key-terms#buckets).\n",
150160
"\n",
151-
"Metadata for the DICOM files — including standard DICOM tags, but also non-DICOM metadata — are stored in the BigQuery table *canceridc-data.idc_current.dicom_all*. The IDC Documentation gives further information on [data organization](https://learn.canceridc.dev/data/organization-of-data) and [code examples](https://learn.canceridc.dev/cookbook/bigquery) on how to query the table. The easiest way to access BigQuery tables from a Jupyter notebook is to use [BigQuery cell magic](https://cloud.google.com/bigquery/docs/visualize-jupyter#querying-and-visualizing-bigquery-data) with the `%%bigquery` command.\n",
161+
"Metadata for the DICOM files — including standard DICOM tags, but also non-DICOM metadata — are stored in the BigQuery table *bigquery-public-data.idc_v11.dicom_all*. The IDC Documentation gives further information on [data organization](https://learn.canceridc.dev/data/organization-of-data) and [code examples](https://learn.canceridc.dev/cookbook/bigquery) on how to query the table. The easiest way to access BigQuery tables from a Jupyter notebook is to use [BigQuery cell magic](https://cloud.google.com/bigquery/docs/visualize-jupyter#querying-and-visualizing-bigquery-data) with the `%%bigquery` command.\n",
152162
"\n",
153-
"The following statement loads relevant metadata of all slide images from the CPTAC-LUAD and CPTAC-LSCC datasets into a pandas data frame called `slides_df`."
163+
"The following statement loads relevant metadata of all slide images from the CPTAC-LUAD and CPTAC-LSCC datasets into a pandas data frame called `slides_df`. The query might look intimidating at first, however accompanyinig comments try to explain every step in the query."
154164
]
155165
},
156166
{
@@ -378,7 +388,8 @@
378388
"name": "cptac_use_case.ipynb",
379389
"private_outputs": true,
380390
"provenance": [],
381-
"toc_visible": true
391+
"toc_visible": true,
392+
"include_colab_link": true
382393
},
383394
"environment": {
384395
"name": "tf2-gpu.2-6.m79",

0 commit comments

Comments
 (0)