Commit df582dd
Mark Saroufim
Fix moe-mxfp4 check_implementation: avoid cloned weight comparison
aiter's fused_moe produces different results when weight tensors are
cloned (same values, different memory). The eval harness clones data
before passing to the submission, so comparing cloned-weight output
against original-weight output always fails. Since fused_moe doesn't
mutate inputs, we use a custom check_implementation that compares
the submission output against a fresh ref_kernel run on the original
(un-cloned) data.1 parent 55af3b9 commit df582dd
1 file changed
Lines changed: 13 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
2 | 2 | | |
3 | 3 | | |
4 | 4 | | |
| |||
296 | 296 | | |
297 | 297 | | |
298 | 298 | | |
299 | | - | |
| 299 | + | |
| 300 | + | |
| 301 | + | |
| 302 | + | |
| 303 | + | |
| 304 | + | |
| 305 | + | |
| 306 | + | |
| 307 | + | |
| 308 | + | |
| 309 | + | |
| 310 | + | |
0 commit comments