Categories
Data capture Jisc digitisation programmes OCR Projects 2006-2009

The challenges of “useful” OCR

The National Archive’s digitisation project, British Governance in the 20th century – Cabinet Papers, 1914-1975, has been grappling with issues of “useful” OCR. It might be stating the obvious, but accurate OCR is as useful as the search results it produces. If OCRd text consistently misspells particularly relevant key words for retrieving certain documents, than […]