Commit 81d6d4e
Mark Saroufim
Shuffle iteration data and fix recheck bug
Additional hardening on top of the object-identity caching fix:
- Shuffle data order each timing iteration to prevent call-count
caching (a submission could track invocation count and predict
which data item appears at each position)
- Move clone before torch.cuda.synchronize() so clone GPU copies
can overlap with previous iteration's tail work
- Fix pre-existing recheck bug where only the last item's
correctness was checked (if not good was outside the for loop)
- Use shuffle_order indices to correctly pair shuffled outputs
with their reference data during recheck1 parent 583bf08 commit 81d6d4e
1 file changed
Lines changed: 12 additions & 7 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
3 | 3 | | |
| 4 | + | |
4 | 5 | | |
5 | 6 | | |
6 | 7 | | |
| |||
274 | 275 | | |
275 | 276 | | |
276 | 277 | | |
277 | | - | |
278 | | - | |
279 | | - | |
| 278 | + | |
| 279 | + | |
280 | 280 | | |
| 281 | + | |
| 282 | + | |
| 283 | + | |
| 284 | + | |
| 285 | + | |
281 | 286 | | |
282 | 287 | | |
283 | 288 | | |
| |||
294 | 299 | | |
295 | 300 | | |
296 | 301 | | |
297 | | - | |
298 | | - | |
299 | | - | |
300 | | - | |
| 302 | + | |
| 303 | + | |
| 304 | + | |
| 305 | + | |
301 | 306 | | |
302 | 307 | | |
303 | 308 | | |
| |||
0 commit comments