Skip to content

Commit 466f7de

Browse files
committed
Updating DSC180 pages
1 parent 0b6bb72 commit 466f7de

7 files changed

Lines changed: 57 additions & 45 deletions

File tree

_data/navigation.yml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,3 +9,5 @@ main:
99
url: /philosophy/
1010
- title: "Media"
1111
url: /media/
12+
- title: "Teaching"
13+
url: /teaching/

_pages/dsc_180.md

Lines changed: 15 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -1,47 +1,46 @@
11
---
22
permalink: /dsc_180/
33
title: "Deep Learning for Climate Model Emulation"
4-
layout: single
4+
layout: projects
55
toc: true
66
toc_sticky: true
7+
featured_figure:
8+
image: /assets/images/climate_change.jpeg
79
---
8-
### Data Science Capstone Domain - DSC 180AB
9-
### Section B03 (TA: Yanyi)
10-
10+
# Deep Learning for Climate Model Emulation
11+
**Data Science Capstone - DSC 180A/B Section B03 (TA: Yanyi)**
1112

1213
## Introduction to Topic
1314

14-
The choices humanity makes in the next few decades will determine how much warmer the Earth will be by the end of the century, with implications for billions of lives and trillions of dollars in GDP. Many different emission pathways exist that are compatible with the Paris climate agreement, and many more are possible that miss that target. While some of the most complex climate models have simulated a small selection of these, it is impractical to use these computationally expensive models to fully explore the space of possibilities or assess all the associated risks. Our lab has recently developed state-of-the-art climate model emulators to enable fast, accurate and reliable predictions for any given scenario (https://github.com/duncanwp/ClimateBench).
15-
15+
The choices humanity makes in the next few decades will determine how much warmer the Earth will be by the end of the century, with implications for billions of lives and trillions of dollars in GDP. Many different emission pathways exist that are compatible with the Paris climate agreement, and many more are possible that miss that target. While some of the most complex climate models have simulated a small selection of these, it is impractical to use these computationally expensive models to fully explore the space of possibilities or assess all the associated risks. Our lab has recently developed a state-of-the-art climate model emulation benchmark to enable fast, accurate and reliable predictions for any given scenario: [ClimateBench](<https://github.com/duncanwp/ClimateBench>).
1616

1717
## Phase I - Replication
1818

1919
The aim of reproducing a paper's results is to affirm the original authors' findings and methodologies. This process is vital in science to ensure results are robust and reliable, not merely due to chance or error. Reproduction would reinforce the evidence that the constructed emulators are faithfully reproducing the underlying climate model and can be trusted for such tasks. It also provides a deeper understanding of the applied methods like long short-term memory networks. Ultimately, this endeavor seeks to enable fast and efficient sampling of different climate scenarios to improve decision making.
2020

2121
The paper, linked here, we will be working with is:
22-
> **Watson-Parris, D.**, Rao, Y., Olivié, D., Seland, Ø., ... "ClimateBench v1.0: A benchmark for data-driven climate projections". *Journal of Advances in Modeling Earth Systems 14, e2021MS002954*: <https://doi.org/10.1029/2021MS002954>
22+
> Watson-Parris, D., Rao, Y., Olivié, D., Seland, Ø., ... "ClimateBench v1.0: A benchmark for data-driven climate projections". *Journal of Advances in Modeling Earth Systems 14, e2021MS002954*: <https://doi.org/10.1029/2021MS002954>
2323
2424

2525
### Accessing the ClimateBench Dataset
2626

2727
While the processed dataset is publically available, it will be instructive for you to generate it yourselves, and an important part of the replication process. You will be provided with access to [Casper](https://arc.ucar.edu/knowledge_base/70549550) data analysis cluster at the National Center for Atmospheric Research (NCAR) with sufficient resources to perform the analyses throughout the project. Please note this is a national facility with shared resources so be mindful of your requests and be sure to abide by their rules.
2828

29-
The data we will use is available from the sixth Coupled Model Intercomparison Project (CMIP6) which represents the combined efforts of dozens of international research laboratories running hundreds of thousands of simulation years of experiments. The data (all 30 petabytes!) is publically archived and available e.g. here: https://esgf-index1.ceda.ac.uk/projects/esgf-ceda/, and also recently mirrored to the cloud here: https://registry.opendata.aws/cmip6/. Fortunately, all the data you will need is already available on Casper so you shouldn't need to download any large datasets, which can be quite cumbersome.
29+
The data we will use is available from the sixth Coupled Model Intercomparison Project (CMIP6) which represents the combined efforts of dozens of international research laboratories running hundreds of thousands of simulation years of experiments. The data (all 30 petabytes!) is publically archived and available e.g. [here](https://esgf-index1.ceda.ac.uk/projects/esgf-ceda/), and also recently mirrored to the cloud [here](https://registry.opendata.aws/cmip6/). Fortunately, all the data you will need is already available on Casper so you shouldn't need to download any large datasets, which can be quite cumbersome.
3030

3131
### Schedule
3232

3333
Click the "topic" links below for details regarding the readings, questions, and tasks for that week.
3434

3535
| Week | Topic |
3636
| --- | --- |
37-
| Summer | [Summer preperation](dsc_180_summer) |
38-
| 1 | [Introduction to topic, domain, and paper](dsc_180_intro) |
39-
| 2-3 | [Dive into the ClimateBench dataset](dsc_180_data) |
40-
| 4-5 | [Begin data preprocessing and learn about xarray](dsc_180_xarray) |
41-
| 6-7 | [Start implementing regression models](dsc_180_implement) |
42-
| 8-9 | [Perform validation and testing of baselines](dsc_180_validate) |
43-
| 10 | [Project wrap up and debrief](dsc_180_debrief) |
44-
37+
| Summer | [Summer preperation](/dsc_180_summer) |
38+
| 1 | [Introduction to topic, domain, and paper](/dsc_180_intro) |
39+
| 2-3 | [Dive into the ClimateBench dataset](/dsc_180_data) |
40+
| 4-5 | [Begin data preprocessing and learn about xarray](/dsc_180_xarray) |
41+
| 6-7 | [Start implementing regression models](/dsc_180_implement) |
42+
| 8-9 | [Perform validation and testing of baselines](/dsc_180_validate) |
43+
| 10 | [Project wrap up and debrief](/dsc_180_debrief) |
4544

4645

4746
## Phase II

_pages/dsc_180_intro.md

Lines changed: 15 additions & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -7,38 +7,37 @@ toc_sticky: true
77
---
88

99

10-
Welcome, Welcome, Welcome!
10+
Welcome!
1111

12-
Week 1 is upon us and we are going to spend it getting everyone situated, goiong over the primary paper, and beginning to read up on the domain of climate model simulation and emulation
12+
Week 1 is upon us and we are going to spend it getting everyone situated, goiong over the primary paper, and beginning to read up on the domain of climate model simulation and emulation.
1313

1414
### Topics
1515

1616
You will be touching on these topics over the first week
1717

18-
- What is sepsis, how it can be diagnosed/treated and why it is very deadly from an epidemiological point of view
19-
- Severity of illness scores
20-
- EHR data
18+
- What is climate change, and how do we model it?
19+
- What is CMIP and how does it relate to the IPCC?
20+
- What is the ClimateBench dataset?
2121
- Reproducibility/replicability in data science
2222

2323
### Tasks
2424

25-
- Gain access to the MIMIC-III and MIMIC-IV dataset, see instructions on home pagecritical
26-
- Complete all assigned readings
27-
- Begin to familairize yourself with the domain by thumbing through the Domain Expertise page.
28-
- Submit question answers to the online form (linked below)
29-
- Join the class Discord Channel: TBD
25+
- [ ] Gain access to Casper, see instructions on home page
26+
- [ ] Complete all assigned readings
27+
- [ ] Submit question answers to the online form (linked below)
28+
- [ ] Join the class Slack Channel
3029

3130
### Readings
3231

33-
- Read the capstone primary paper by Zador et al. linked on the home pagecritical
34-
- MIMIC-III paper by Johnson et al. link - if you already did not finish it
35-
- MIMIC-III web description link - if you already did not finish it
32+
- Skim the latest UN Intergovernmental Panel on Climate Change [Synthesis Report](https://www.ipcc.ch/report/ar6/syr/downloads/report/IPCC_AR6_SYR_SPM.pdf) to get a summary of the latest climate change science, especially the figures.
33+
- Fully read the ClimateBench primary [paper](https://agupubs.onlinelibrary.wiley.com/doi/full/10.1029/2021MS002954) by Watson-Parris et al.
34+
- Pick two citations within the introduction to the ClimateBench paper and skim-read them (just read the abstract, conclusion and look at the figures. Maybe look at the methods if relevent.).
3635

3736
### Questions
3837

39-
Answer the following questions using this google form link
38+
Answer the following questions using this [form](https://forms.office.com/r/xR1LFNZ6Tg).
4039

4140
1. What are the primary goals of the research paper we aim to replicate? Additionally, what do you anticipate to be the major hurdles in this process? Lastly, define what outcomes would you consider as a successful completion of Phase I in this capstone project.
42-
2. Provide a detailed explanation of what an Electronic Health Record (EHR) is. How does EHR relate to and integrate with the MIMIC datasets?
43-
3. Explain the concept and utility of Illness Scores in healthcare. Why do multiple illness scores exist? Select one illness score from the following - OASIS, SAPS II, or SOFA. Write a brief overview, including its applications and key attributes. Note: The aim is for each team member to gain expertise in at least one illness score, so ensure that all three scores are explored by different members.
41+
2. Provide a brief explanation of each of the ClimateBench inputs, what they represent, and the relative magnitude of their climate impact.
42+
3. Write a brief summary of each citation you read and how it relates to the ClimateBench paper. Note: You will be doing this for each paper we read in the course, so it is good to get some practice in now. The aim is for each team to gain a broad understanding of the field, so ensure that each team member chooses different citations.
4443

_pages/dsc_180_summer.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,6 @@ layout: single
66

77
I don't expect much over the summer but the following activities would be useful and allow you to hit the ground running in Fall:
88

9-
- Skim the latest UN Intergovernmental Panel on Climate Change Synthesis Report to get a summary of the latest climate change science, especially the figures: https://www.ipcc.ch/report/ar6/syr/downloads/report/IPCC_AR6_SYR_SPM.pdf
10-
- Read the ClimateBench paper: https://agupubs.onlinelibrary.wiley.com/doi/full/10.1029/2021MS002954
11-
- Try out the xarray python library for working with climate data: https://docs.xarray.dev/en/stable/
9+
- Skim the latest UN Intergovernmental Panel on Climate Change Synthesis Report to get a summary of the latest climate change science, especially the figures: <https://www.ipcc.ch/report/ar6/syr/downloads/report/IPCC_AR6_SYR_SPM.pdf>
10+
- Read the ClimateBench paper: <https://agupubs.onlinelibrary.wiley.com/doi/full/10.1029/2021MS002954>
11+
- Try out the xarray python library for working with climate data: <https://docs.xarray.dev/en/stable/>

_pages/projects.md

Lines changed: 1 addition & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -4,20 +4,11 @@ layout: posts
44
permalink: /projects/
55
entries_layout: grid
66
classes: wide
7-
8-
header:
9-
overlay_color: "#5e616c"
10-
overlay_image: /assets/images/podcast-header.png
11-
caption: "Photo credit: [**Third Pod from the Sun**](https://thirdpodfromthesun.com/)"
12-
13-
14-
# https://mmistakes.github.io/minimal-mistakes/portfolio/
157
---
168

179
# Projects
1810

19-
Samples of published and ongoing projects by CCOG. Sort them by computational or Earth Science tags.\
20-
Exciting new CCOG focus areas include sea level and large scale climate dynamics.
11+
Samples of published and ongoing projects by the Climate Analytics Lab. Sort them by computational or Earth Science tags.
2112

2213
{% case site.tag_archive.type %}
2314
{% when "liquid" %}

_pages/teaching.md

Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
---
2+
permalink: /teaching/
3+
title: "Teaching"
4+
layout: archive
5+
author_profile: true
6+
---
7+
8+
## HDSI Courses
9+
10+
### DSC 200
11+
12+
Computing structures and programming concepts such as object orientation, data structures such as queues, heaps, lists, search trees and hash tables. Laboratory skills include data analysis with pandas and xarray in Jupyter notebooks.
13+
14+
- Please see course canvas page for details: <https://canvas.ucsd.edu/courses/49102>
15+
- Github repo: <https://github.com/climate-analytics-lab/dsc200-fa23-public/>
16+
17+
## DSC 180A
18+
19+
Data science capstone course. Students work in teams to complete a climate related data science project. Project management, communication, and teamwork skills are emphasized.
20+
21+
- Please see course page [here](/dsc_180) for details.

assets/images/climate_change.jpeg

440 KB
Loading

0 commit comments

Comments
 (0)