You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -229,13 +229,13 @@ In total, it should contain 1280 files (via `find -type f | wc -l`) for a total
229
229
230
230
Register on <https://image-net.org/> and follow directions to obtain the
231
231
URLS for the ILSVRC2012 train and validation images.
232
-
The script will additionally automatically download the `matched-frequency` version of [ImageNet v2](https://www.tensorflow.org/datasets/catalog/imagenet_v2#imagenet_v2matched-frequency_default_config), which is used as the test set of the ImageNet workloads.
232
+
The script will additionally automatically download the `matched-frequency` version of [ImageNet v2](https://www.tensorflow.org/dataset/catalog/imagenet_v2#imagenet_v2matched-frequency_default_config), which is used as the test set of the ImageNet workloads.
233
233
234
234
The ImageNet data pipeline differs between the PyTorch and JAX workloads.
235
235
Therefore, you will have to specify the framework (either `pytorch` or `jax`) through the framework flag.
236
236
237
237
```bash
238
-
python3 datasets/dataset_setup.py \
238
+
python3 dataset/dataset_setup.py \
239
239
--data_dir $DATA_DIR \
240
240
--imagenet \
241
241
--temp_dir $DATA_DIR/tmp \
@@ -349,7 +349,7 @@ In total, it should contain 20 files (via `find -type f | wc -l`) for a total of
349
349
### Criteo1TB
350
350
351
351
```bash
352
-
python3 datasets/dataset_setup.py \
352
+
python3 dataset/dataset_setup.py \
353
353
--data_dir $DATA_DIR \
354
354
--temp_dir $DATA_DIR/tmp \
355
355
--criteo1tb
@@ -378,7 +378,7 @@ In total, it should contain 885 files (via `find -type f | wc -l`) for a total o
378
378
To download, train a tokenizer and preprocess the librispeech dataset:
0 commit comments