Commit 5465b80
authored
Keep last 3 benchmark shapes for fp8_quant and causal_conv1d (#129)
* Keep last 3 benchmark shapes for fp8_quant_py instead of first 3
The larger shapes (higher num_tokens and hidden_dim) are more
representative of production workloads for FP8 quantization.
* Keep last 3 benchmark shapes for causal_conv1d_py instead of first 3
The larger shapes (higher D and S) are more representative of
production workloads for causal conv1d.1 parent d773d99 commit 5465b80
2 files changed
Lines changed: 4 additions & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
39 | 39 | | |
40 | 40 | | |
41 | 41 | | |
42 | | - | |
43 | | - | |
44 | 42 | | |
| 43 | + | |
| 44 | + | |
45 | 45 | | |
46 | 46 | | |
47 | 47 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
46 | 46 | | |
47 | 47 | | |
48 | 48 | | |
49 | | - | |
50 | | - | |
51 | 49 | | |
| 50 | + | |
| 51 | + | |
52 | 52 | | |
53 | 53 | | |
54 | 54 | | |
| |||
0 commit comments