Skip to content

Commit fe00a84

Browse files
authored
tests: enable kv_unified to prevent cuda oom error on rtx 2060 (ggml-org#20645)
Signed-off-by: Aaron Teo <aaron.teo1@ibm.com>
1 parent 7ab321d commit fe00a84

1 file changed

Lines changed: 1 addition & 0 deletions

File tree

tests/test-backend-sampler.cpp

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -89,6 +89,7 @@ struct test_context {
8989
cparams.n_batch = 512;
9090
cparams.samplers = configs.data();
9191
cparams.n_samplers = configs.size();
92+
cparams.kv_unified = true;
9293

9394
// If n_seq_max is not specified, calculate it from configs
9495
if (n_seq_max < 0) {

0 commit comments

Comments
 (0)