dungquixote42 869b83bc49
Add Unicode allowlist (#1597)
* initial commit

* cleanup

* fix whitelist arg parsing and simplify keyword search state

* rename white* to allow*

* add vocab_pieces init function, rename update functions, delete accidentally added file

* delete temporary bias code

* auto-generate fill function with script data inside

* deduplicate allowlist unicode rule parsing

* minor cleanup

* delete unnecessary header

* refactor allowlist to support sequential rule sets via keywords

* add early exit for zero-rules case

* delete accidentally added file
2026-04-10 18:22:57 +02:00
..
2024-07-27 07:55:01 +02:00
2026-03-26 17:24:11 +01:00
2026-04-10 18:22:57 +02:00
2026-04-10 18:22:57 +02:00
2025-12-15 08:27:20 +01:00
2026-04-10 18:22:57 +02:00
2026-04-10 18:22:57 +02:00
2026-04-09 15:33:56 +02:00
2023-11-13 14:16:23 +02:00