Testcases for individual character checks
authorali <ali@juiblex.co.uk>
Tue Feb 14 19:31:55 2012 +0000 (2012-02-14)
changeset 2252e61d5db08c
parent 21 e58bd1b0b60f
child 23 38ea5c71f04f
Testcases for individual character checks
test/compatibility/Makefile.am
test/compatibility/control-characters.tst
test/compatibility/unusual-characters.tst
test/compatibility/windows-1252.tst
     1.1 --- a/test/compatibility/Makefile.am	Tue Feb 14 09:50:08 2012 +0000
     1.2 +++ b/test/compatibility/Makefile.am	Tue Feb 14 19:31:55 2012 +0000
     1.3 @@ -4,6 +4,7 @@
     1.4  	extra-period.tst ellipsis.tst short-line.tst abbreviation.tst \
     1.5  	example.tst non-ascii.tst embedded-lf.tst markup.tst \
     1.6  	user-defined-typo.tst brackets.tst single-quotes.tst grave-quotes.tst \
     1.7 -	dashes.tst
     1.8 +	dashes.tst control-characters.tst unusual-characters.tst \
     1.9 +	windows-1252.tst
    1.10  
    1.11  dist_pkgdata_DATA=$(TESTS)
     2.1 --- /dev/null	Thu Jan 01 00:00:00 1970 +0000
     2.2 +++ b/test/compatibility/control-characters.tst	Tue Feb 14 19:31:55 2012 +0000
     2.3 @@ -0,0 +1,12 @@
     2.4 +**************** ENCODING ****************
     2.5 +WINDOWS-1252
     2.6 +**************** INPUT ****************
     2.7 +“I will—if I can,” murmured Dudd Flockley.
     2.8 +And then, as some other students approached, the
     2.9 +talk had to come to an end.
    2.10 +
    2.11 +CHAPTER V
    2.12 +**************** EXPECTED ****************
    2.13 +
    2.14 +CHAPTER V
    2.15 +    Line 5 column 1 - Control character 12
     3.1 --- /dev/null	Thu Jan 01 00:00:00 1970 +0000
     3.2 +++ b/test/compatibility/unusual-characters.tst	Tue Feb 14 19:31:55 2012 +0000
     3.3 @@ -0,0 +1,18 @@
     3.4 +**************** INPUT ****************
     3.5 +All wrapt in flames, behold our craft/
     3.6 +	She'll plough the main no more!
     3.7 +Her galla~t crew may well shed tears----*
     3.8 +
     3.9 +**************** EXPECTED ****************
    3.10 +
    3.11 +All wrapt in flames, behold our craft/
    3.12 +    Line 1 column 38 - Forward slash?
    3.13 +
    3.14 +	She'll plough the main no more!
    3.15 +    Line 2 column 1 - Tab character?
    3.16 +
    3.17 +Her galla~t crew may well shed tears----*
    3.18 +    Line 3 column 10 - Tilde character?
    3.19 +
    3.20 +Her galla~t crew may well shed tears----*
    3.21 +    Line 3 column 41 - Asterisk?
     4.1 --- /dev/null	Thu Jan 01 00:00:00 1970 +0000
     4.2 +++ b/test/compatibility/windows-1252.tst	Tue Feb 14 19:31:55 2012 +0000
     4.3 @@ -0,0 +1,29 @@
     4.4 +**************** ENCODING ****************
     4.5 +WINDOWS-1252
     4.6 +**************** INPUT ****************
     4.7 +gutcheck has only a very limited support for windows-1252, but it does
     4.8 +recognise some characters as letters.
     4.9 +
    4.10 +Žal at the start of a paragraph would throw a warning if its first letter
    4.11 +wasn't recognised since the paragraph would then appear to start with
    4.12 +something other than a capital letter. Æsop likewise proves that ash is
    4.13 +seen as a letter (otherwise a warning would be given for a period not
    4.14 +followed by a capital letter). Œcolampadius does the same for œthel.
    4.15 +
    4.16 +Ÿ-decay is something I don't even pretend to understand, but I'm quite
    4.17 +happy to abuse it to test that strange letter.
    4.18 +
    4.19 +Contrawise, we can prove that some characters are _not_ seen as letters
    4.20 +since neither 2×2=4 nor 4÷2=2 produce a warning (if they had been seen
    4.21 +as letters, we would expect ‘Query digit’ warnings).
    4.22 +
    4.23 +The trademark symbol ™ and œthel might,for whatever reason, confuse the
    4.24 +column numbers in warnings.
    4.25 +
    4.26 +**************** EXPECTED ****************
    4.27 +
    4.28 +gutcheck has only a very limited support for windows-1252, but it does
    4.29 +    Line 1 column 1 - Paragraph starts with lower-case
    4.30 +
    4.31 +The trademark symbol ™ and œthel might,for whatever reason, confuse the
    4.32 +    Line 17 column 39 - Missing space?