[Users] How to filter utf8 messages

Slavko linux at slavino.sk
Thu Jul 27 11:37:18 UTC 2023


Ahoj,

Dňa Thu, 27 Jul 2023 08:38:18 +0000 Colin Leroy-Mira via Users
<users at lists.claws-mail.org> napísal:

> July 27, 2023 at 9:50 AM, "Slavko" <linux at slavino.sk> wrote:
> 
> 
> > And another question, i use this regex to score Chinesse & etc chars
> > (scripts) in subject in rspamd (perhaps can be useful for OP):
> >  [\p{Han}\p{Hiragana}\p{Katakana}\p{Hangul}\p{Arabic}]+
> > 
> > Will that work in CM filter regexes?  
> 
> I'm unsure, you can test regexps in the QuickSearch in extended mode:
> subject regexpcase "..."
> 

does not work :-(

Or, more precise, it seems to work in opposite direction, it founds
everything except Chinese (or so) subjects. I tried some escaping, but
that doesn't help at all.

I used dialog to create rule, result:

    subject regexp "[\\p{Han}\\p{Hiragana}\\p{Katakana}\\p{Hangul}]"

Don't worry, i am not interested in that, i was just curious...


-- 
Slavko
https://www.slavino.sk
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 488 bytes
Desc: Digit��lny podpis OpenPGP
URL: <http://lists.claws-mail.org/pipermail/users/attachments/20230727/4f2fd8fb/attachment.sig>


More information about the Users mailing list