# HG changeset patch # User Peter Hosey # Date 1294478864 28800 # Node ID 5f17911c4fe6daf862c15b2712cb17983ff10136 # Parent 3b33b1c488809a681ffdb92c3fc81211cba2de48 Expanded this comment to acknowledge that the same sub-expression covers both decimal and octal (beginning with 0) number literals. diff --git a/diff-colorize.py b/diff-colorize.py --- a/diff-colorize.py +++ b/diff-colorize.py @@ -174,7 +174,7 @@ def common_and_distinct_substrings(a, b) def tokenize(a): "Each token is an identifier, a number, or a single character." import re - # Word in identifier, word in macro name (MACRO_NAME), binary number, hex number, decimal number, operator, other punctuation. + # Word in identifier, word in macro name (MACRO_NAME), binary number, hex number, decimal or octal number, operator, other punctuation. token_exp = re.compile('[_A-Z]*[_a-z0-9]+:?|_??[A-Z0-9]+:?|0b[01]+|0[xX][0-9A-Fa-f]+|[0-9]+|[-+*|&^/%\[\]<=>,]|[()\\\\;`{}]') start = 0 for match in token_exp.finditer(a):