module documentation

Undocumented

Function tidy_xml Read in file, screen out unsafe unicode characters, write back file in utf-8.
Constant RE_XML_ILLEGAL Undocumented
Constant _SAFE_XML_REGEX Undocumented
def tidy_xml(filename):

Read in file, screen out unsafe unicode characters, write back file in utf-8.

Parameters
filenamestr
Returns
False if unable to read from file
RE_XML_ILLEGAL =

Undocumented

Value
('([%s-%s%s-%s%s-%s%s-%s])'+'|'+'([%s-%s][^%s-%s])|([^%s-%s][%s-%s])|([%s-%s]$)|(
^[%s-%s])')%(char(0),
    char(8),
    char(11),
    char(12),
    char(14),
    char(31),
    char(65534),
...
_SAFE_XML_REGEX =

Undocumented

Value
re.compile(RE_XML_ILLEGAL)