Generating XML by OCRing PDF: it’s called innovation.