test/compatibility/windows-1252.tst
changeset 37 037942e1bc4f
     1.1 --- /dev/null	Thu Jan 01 00:00:00 1970 +0000
     1.2 +++ b/test/compatibility/windows-1252.tst	Mon Feb 20 10:10:48 2012 +0000
     1.3 @@ -0,0 +1,29 @@
     1.4 +**************** ENCODING ****************
     1.5 +WINDOWS-1252
     1.6 +**************** INPUT ****************
     1.7 +gutcheck has only a very limited support for windows-1252, but it does
     1.8 +recognise some characters as letters.
     1.9 +
    1.10 +Žal at the start of a paragraph would throw a warning if its first letter
    1.11 +wasn't recognised since the paragraph would then appear to start with
    1.12 +something other than a capital letter. Æsop likewise proves that ash is
    1.13 +seen as a letter (otherwise a warning would be given for a period not
    1.14 +followed by a capital letter). Œcolampadius does the same for œthel.
    1.15 +
    1.16 +Ÿ-decay is something I don't even pretend to understand, but I'm quite
    1.17 +happy to abuse it to test that strange letter.
    1.18 +
    1.19 +Contrawise, we can prove that some characters are _not_ seen as letters
    1.20 +since neither 2×2=4 nor 4÷2=2 produce a warning (if they had been seen
    1.21 +as letters, we would expect ‘Query digit’ warnings).
    1.22 +
    1.23 +The trademark symbol ™ and œthel might,for whatever reason, confuse the
    1.24 +column numbers in warnings.
    1.25 +
    1.26 +**************** EXPECTED ****************
    1.27 +
    1.28 +gutcheck has only a very limited support for windows-1252, but it does
    1.29 +    Line 1 column 1 - Paragraph starts with lower-case
    1.30 +
    1.31 +The trademark symbol ™ and œthel might,for whatever reason, confuse the
    1.32 +    Line 17 column 39 - Missing space?