# HG changeset patch # User ali # Date 1329247915 0 # Node ID 52e61d5db08cc1b2e68a17d6ea0c015c1609b223 # Parent e58bd1b0b60f956a4577c56b326c5bc7142ae21d Testcases for individual character checks diff -r e58bd1b0b60f -r 52e61d5db08c test/compatibility/Makefile.am --- a/test/compatibility/Makefile.am Tue Feb 14 09:50:08 2012 +0000 +++ b/test/compatibility/Makefile.am Tue Feb 14 19:31:55 2012 +0000 @@ -4,6 +4,7 @@ extra-period.tst ellipsis.tst short-line.tst abbreviation.tst \ example.tst non-ascii.tst embedded-lf.tst markup.tst \ user-defined-typo.tst brackets.tst single-quotes.tst grave-quotes.tst \ - dashes.tst + dashes.tst control-characters.tst unusual-characters.tst \ + windows-1252.tst dist_pkgdata_DATA=$(TESTS) diff -r e58bd1b0b60f -r 52e61d5db08c test/compatibility/control-characters.tst --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test/compatibility/control-characters.tst Tue Feb 14 19:31:55 2012 +0000 @@ -0,0 +1,12 @@ +**************** ENCODING **************** +WINDOWS-1252 +**************** INPUT **************** +“I will—if I can,” murmured Dudd Flockley. +And then, as some other students approached, the +talk had to come to an end. + + CHAPTER V +**************** EXPECTED **************** + + CHAPTER V + Line 5 column 1 - Control character 12 diff -r e58bd1b0b60f -r 52e61d5db08c test/compatibility/unusual-characters.tst --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test/compatibility/unusual-characters.tst Tue Feb 14 19:31:55 2012 +0000 @@ -0,0 +1,18 @@ +**************** INPUT **************** +All wrapt in flames, behold our craft/ + She'll plough the main no more! +Her galla~t crew may well shed tears----* + +**************** EXPECTED **************** + +All wrapt in flames, behold our craft/ + Line 1 column 38 - Forward slash? + + She'll plough the main no more! + Line 2 column 1 - Tab character? + +Her galla~t crew may well shed tears----* + Line 3 column 10 - Tilde character? + +Her galla~t crew may well shed tears----* + Line 3 column 41 - Asterisk? diff -r e58bd1b0b60f -r 52e61d5db08c test/compatibility/windows-1252.tst --- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test/compatibility/windows-1252.tst Tue Feb 14 19:31:55 2012 +0000 @@ -0,0 +1,29 @@ +**************** ENCODING **************** +WINDOWS-1252 +**************** INPUT **************** +gutcheck has only a very limited support for windows-1252, but it does +recognise some characters as letters. + +Žal at the start of a paragraph would throw a warning if its first letter +wasn't recognised since the paragraph would then appear to start with +something other than a capital letter. Æsop likewise proves that ash is +seen as a letter (otherwise a warning would be given for a period not +followed by a capital letter). Œcolampadius does the same for œthel. + +Ÿ-decay is something I don't even pretend to understand, but I'm quite +happy to abuse it to test that strange letter. + +Contrawise, we can prove that some characters are _not_ seen as letters +since neither 2×2=4 nor 4÷2=2 produce a warning (if they had been seen +as letters, we would expect ‘Query digit’ warnings). + +The trademark symbol ™ and œthel might,for whatever reason, confuse the +column numbers in warnings. + +**************** EXPECTED **************** + +gutcheck has only a very limited support for windows-1252, but it does + Line 1 column 1 - Paragraph starts with lower-case + +The trademark symbol ™ and œthel might,for whatever reason, confuse the + Line 17 column 39 - Missing space?