Printing, photocopying and scanning processes degrade the image quality of a document. Statistical models of these degradation processes are crucial for document image understanding research. Models allow us to predict system performance; conduct controlled experiments to study the break-down points of the systems; create large multi-lingual data sets with ground truth for training classifiers; design optimal noise removal algorithms; choose values for the free parameters of the algorithms; and so ...