Skip to content

Update conversion for Luciole model (1B and 23B)#1

Open
Jeronymous wants to merge 9 commits into
masterfrom
luciole
Open

Update conversion for Luciole model (1B and 23B)#1
Jeronymous wants to merge 9 commits into
masterfrom
luciole

Conversation

@Jeronymous
Copy link
Copy Markdown
Member

No description provided.

… <tool_call>. And fix conversion with tied weight embedding in some cases
LLAMA_TOKENIZE_PARSE_SPECIAL=1   make common_tokenize parse chat-template
                                 control tokens (<|im_start|> etc.) atomically.
                                 Default unchanged.

LLAMA_DUMP_TENSORS_FILE=PATH     dump full tensor data (filtered by optional
LLAMA_DUMP_TENSORS_REGEX=PAT     LLAMA_DUMP_TENSORS_REGEX) to a binary file
                                 from the eval callback, for offline layer-by-
                                 layer comparison against reference backends.
5-step pipeline: render+tokenize (transformers ref) → token-count
probes vs Ollama → behavioural tool-call test → next-token logit
comparison → unified report.

Layer-diff diagnostic for localizing per-op divergence when the
logit step flags a regression.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant