Update conversion for Luciole model (1B and 23B) by Jeronymous · Pull Request #1 · OpenLLM-France/llama.cpp

Jeronymous · 2026-03-18T16:22:12Z

No description provided.

… <tool_call>. And fix conversion with tied weight embedding in some cases

LLAMA_TOKENIZE_PARSE_SPECIAL=1 make common_tokenize parse chat-template control tokens (<|im_start|> etc.) atomically. Default unchanged. LLAMA_DUMP_TENSORS_FILE=PATH dump full tensor data (filtered by optional LLAMA_DUMP_TENSORS_REGEX=PAT LLAMA_DUMP_TENSORS_REGEX) to a binary file from the eval callback, for offline layer-by- layer comparison against reference backends.

5-step pipeline: render+tokenize (transformers ref) → token-count probes vs Ollama → behavioural tool-call test → next-token logit comparison → unified report. Layer-diff diagnostic for localizing per-op divergence when the logit step flags a regression.

Jeronymous added 9 commits March 18, 2026 17:16

fix and update test

d89fc07

Update conversion script for Luciole

ff6f474

Fix tied word embeddings

41c639c

Special path for Luciole-8B (NemotronHForCausalLM hybrid archi)

5c02601

more robust distinction between Luciole and others

c14603b

Correctly handle added tokens that are not marked as special, such as…

0284fd3

… <tool_call>. And fix conversion with tied weight embedding in some cases

convert: cast LayerNorm1p γ to fp32 before adding 1 (nemotron)

755aa05

Jeronymous force-pushed the luciole branch from d2a09ee to ffd08ef Compare May 29, 2026 12:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update conversion for Luciole model (1B and 23B)#1

Update conversion for Luciole model (1B and 23B)#1
Jeronymous wants to merge 9 commits into
masterfrom
luciole

Jeronymous commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Jeronymous commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant