|
5105 | 5105 | "text": [ |
5106 | 5106 | "Help on function ecco_podaac_s3_get in module ecco_s3_retrieve:\n", |
5107 | 5107 | "\n", |
5108 | | - "ecco_podaac_s3_get(ShortName, StartDate, EndDate, download_root_dir=None, n_workers=6, force_redownload=False, return_downloaded_files=False)\n", |
| 5108 | + "ecco_podaac_s3_get(ShortName, StartDate, EndDate, download_root_dir=None, n_workers=6,\n" |
| 5109 | + " force_redownload=False, return_downloaded_files=False)\n", |
5109 | 5110 | " This routine downloads ECCO datasets from PO.DAAC, to be stored locally on a AWS EC2 instance running in region us-west-2. \n", |
5110 | 5111 | " It is adapted from the ecco_podaac_download function in the ecco_download.py module, and is the AWS Cloud equivalent of \n", |
5111 | 5112 | " ecco_podaac_download.\n", |
|
5316 | 5317 | "text": [ |
5317 | 5318 | "Help on function ecco_podaac_s3_get_diskaware in module ecco_s3_retrieve:\n", |
5318 | 5319 | "\n", |
5319 | | - "ecco_podaac_s3_get_diskaware(ShortNames, StartDate, EndDate, max_avail_frac=0.5, snapshot_interval=None, download_root_dir=None, n_workers=6, force_redownload=False)\n", |
5320 | | - " This function estimates the storage footprint of ECCO datasets, given ShortName(s), a date range, and which files (if any) are already present.\n", |
5321 | | - " If the current instance's available storage is at least twice the footprint of the new files, they are downloaded and stored locally on the instance \n", |
5322 | | - " using ecco_podaac_s3_get (hosting files locally typically speeds up loading and computation).\n", |
5323 | | - " Otherwise, the files are \"opened\" using ecco_podaac_s3_open so that they can be accessed directly on S3 without occupying local storage.\n", |
| 5320 | + "ecco_podaac_s3_get_diskaware(ShortNames, StartDate, EndDate, max_avail_frac=0.5, \n", |
| 5321 | + " snapshot_interval=None, download_root_dir=None, n_workers=6, force_redownload=False)\n", |
| 5322 | + " This function estimates the storage footprint of ECCO datasets, given ShortName(s), a date range, and which \n", |
| 5323 | + " files (if any) are already present.\n", |
| 5324 | + " If the footprint of the files to be downloaded (not including files already on the instance or re-downloads) \n", |
| 5325 | + " is <= the max_avail_frac specified of the instance's available storage, they are downloaded and stored locally \n", |
| 5326 | + " on the instance (hosting files locally typically speeds up loading and computation).\n", |
| 5327 | + " Otherwise, the files are "opened" using ecco_podaac_s3_open so that they can be accessed directly \n", |
| 5328 | + " on S3 without occupying local storage.\n", |
5324 | 5329 | " \n", |
5325 | 5330 | " Parameters\n", |
5326 | 5331 | " ----------\n", |
|
5336 | 5341 | " \n", |
5337 | 5342 | " max_avail_frac: float, maximum fraction of remaining available disk space to use in storing current ECCO datasets.\n", |
5338 | 5343 | " This determines whether the dataset files are stored on the current instance, or opened on S3.\n", |
5339 | | - " Valid range is [0,0.9]. If number provided is outside this range, it is replaced by the closer endpoint of the range.\n", |
| 5344 | + " Valid range is [0,0.9]. If number provided is outside this range, it is replaced by the closer \n", |
| 5345 | + " endpoint of the range.\n", |
5340 | 5346 | " \n", |
5341 | | - " snapshot_interval: ('monthly', 'daily', or None), if snapshot datasets are included in ShortNames, this determines whether\n", |
5342 | | - " snapshots are included for only the beginning/end of each month ('monthly'), or for every day ('daily').\n", |
5343 | | - " If None or not specified, defaults to 'daily' if any daily mean ShortNames are included and 'monthly' otherwise.\n", |
| 5347 | + " snapshot_interval: ('monthly', 'daily', or None), if snapshot datasets are included in ShortNames, \n", |
| 5348 | + " this determines whether snapshots are included for only the beginning/end of each month \n", |
| 5349 | + " ('monthly'), or for every day ('daily').\n", |
| 5350 | + " If None or not specified, defaults to 'daily' if any daily mean ShortNames are included \n", |
| 5351 | + " and 'monthly' otherwise.\n", |
5344 | 5352 | " \n", |
5345 | 5353 | " download_root_dir: str, defines parent directory to download files to.\n", |
5346 | 5354 | " Files will be downloaded to directory download_root_dir/ShortName/.\n", |
|
5373 | 5381 | "id": "887f8436-98d3-4b09-a7d1-936810717592", |
5374 | 5382 | "metadata": {}, |
5375 | 5383 | "source": [ |
5376 | | - "The syntax of this function is similar to `ecco_podaac_s3_get`, but there are two arguments specific to this function: **max_avail_frac** and **snapshot_interval**. **max_avail_frac** sets the storage threshold for whether the specified dataset(s) will be downloaded to the user's instance vs. opened from S3. For example, the default max_avail_frac = 0.5 will download the datasets if they will occupy less than 50% of the instance's remaining available memory. **snapshot_interval** applies only if there are snapshot datasets included in ShortNames, e.g., it could be useful to specify snapshot_interval = 'monthly' if you want to limit the size of the potential download.\n", |
| 5384 | + "The syntax of this function is similar to `ecco_podaac_s3_get`, but there are two arguments specific to this function: **max_avail_frac** and **snapshot_interval**. **max_avail_frac** sets the storage threshold for whether the specified dataset(s) will be downloaded to the user's instance vs. opened from S3. For example, the default max_avail_frac = 0.5 will download the datasets if they will occupy <= 50% of the instance's remaining available storage. **snapshot_interval** applies only if there are snapshot datasets included in ShortNames, e.g., it could be useful to specify snapshot_interval = 'monthly' if you want to limit the size of the potential download.\n", |
5377 | 5385 | "\n", |
5378 | 5386 | "Now let's repeat the calculation that was done above by invoking this function, first removing the files if they are already on disk to replicate the previous example in Method 2 as closely as possible." |
5379 | 5387 | ] |
|
0 commit comments