Randomize the text content of the sample documents
But maintain the XML structures
Let's supposed you've discovered a bug in Saxon, but the document that demonstrates the bug is really embarrassing