Skip to content

[pull] master from scrapinghub:master#81

Merged
pull[bot] merged 1 commit into
zanachka:masterfrom
scrapinghub:master
Apr 28, 2026
Merged

[pull] master from scrapinghub:master#81
pull[bot] merged 1 commit into
zanachka:masterfrom
scrapinghub:master

Conversation

@pull
Copy link
Copy Markdown

@pull pull Bot commented Apr 28, 2026

See Commits and Changes for more details.


Created by pull[bot] (v2.0.0-alpha.4)

Can you help keep this open source service alive? 💖 Please sponsor : )

…#1324)

* fix: preserve whitespace when removing translation tokens (fix #1302)

- Fixed whitespace collapsing when skip tokens (like 'klo' in Finnish) are removed
- When a skip token is removed between spaces, preserve the maximum of surrounding spaces
- Added comprehensive tests in test_whitespace_preservation.py
- Updated existing affected test expectations to match corrected behavior
- Fixes issue where double spaces were created when skip tokens were removed

* fix: update remaining test expectations for whitespace preservation

- Fixed 6 edge case tests related to _clear_future_words method
- Most cases now correctly expect single spaces (he, nl, pl, vi-lúc, da)
- Vietnamese pipe case ('|') correctly expects double space preservation
- All 1,342 language and whitespace tests now passing

* Fix the issue
@pull pull Bot locked and limited conversation to collaborators Apr 28, 2026
@pull pull Bot added the ⤵️ pull label Apr 28, 2026
@pull pull Bot merged commit 373ede9 into zanachka:master Apr 28, 2026
11 of 12 checks passed
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant