Category Archives: OCR

GW4 Archives: exploring UK Medical Heritage Library and Historical Texts as data

In recent years hack-days have been all the rage and have proved a good vehicle for interactions between people who normally might not work together. In academia there has been a trend towards running so-called ‘labs’. The word implies experimentation; hack-day tends to imply coding (it can be experimental!), whereas ‘lab’ suggests that it can … Read more

Share and Enjoy

  • Facebook
  • Twitter
  • Delicious
  • LinkedIn
  • StumbleUpon
  • Add to favorites
  • Email
  • RSS

New Jisc commission to Visualise Medical History

Jisc will shortly be commissioning a project entitled Visualising Medical History as part of the wider work around the UK Medical Heritage Library. This post explains the rationale behind the project and provides practical details. The UK Medical Heritage Library is making 15 million pages of 19th Century medical texts available digitally in one searchable collection for the … Read more

Share and Enjoy

  • Facebook
  • Twitter
  • Delicious
  • LinkedIn
  • StumbleUpon
  • Add to favorites
  • Email
  • RSS

Final IMPACT Conference on OCR, October 2011, The Hague

The final conference of the IMPACT project will take place on 24-25 October 2011 at the British Library in London, with the title: “Digitisation & OCR: Better, faster, cheaper. Solutions of the IMPACT Centre of Competence and future challenges” The IMPACT Project (Improving Access to Text) started on 1st January 2008 with the aim to … Read more

Share and Enjoy

  • Facebook
  • Twitter
  • Delicious
  • LinkedIn
  • StumbleUpon
  • Add to favorites
  • Email
  • RSS

Challenging our understanding of Digitisation

At the forthcoming Developer Happiness Days one of the sessions planned to take place will be exploring a DIY digitisation workflow: Taking you from the act of scanning images and objects, learning how to process and edit them with software like ocrupus, blender and OpenCV, storing and manipulating them online and finally, through to printing … Read more

Share and Enjoy

  • Facebook
  • Twitter
  • Delicious
  • LinkedIn
  • StumbleUpon
  • Add to favorites
  • Email
  • RSS

Workshop: Optical Character Recognition (OCR) for the mass digitisation of textual materials: Improving Access to Text

24 September 2009 – UKOLN, University of Bath http://www.ukoln.ac.uk/events/ocr-2009/ FREE one-day workshop for * Collection holders in HE and Cultural Heritage organisations * Users of digitised content for teaching, learning and research This workshop is funded by the Joint Information Systems Committee (JISC) as part of a series of workshops & seminars on Achievements & … Read more

Share and Enjoy

  • Facebook
  • Twitter
  • Delicious
  • LinkedIn
  • StumbleUpon
  • Add to favorites
  • Email
  • RSS

European Conference on OCR and Mass Digitisation

From the IMPACT project, a European Union project which is aiming to create a centre of excellent for the digitisation of textual cultural heritage Introduction On 6 and 7 April 2009 the IMPACT project will organise a conference on OCR in mass digitisation projects. This conference will focus on exchanging views with other researchers and … Read more

Share and Enjoy

  • Facebook
  • Twitter
  • Delicious
  • LinkedIn
  • StumbleUpon
  • Add to favorites
  • Email
  • RSS

The challenges of “useful” OCR

The National Archive’s digitisation project, British Governance in the 20th century – Cabinet Papers, 1914-1975, has been grappling with issues of “useful” OCR. It might be stating the obvious, but accurate OCR is as useful as the search results it produces. If OCRd text consistently misspells particularly relevant key words for retrieving certain documents, than … Read more

Share and Enjoy

  • Facebook
  • Twitter
  • Delicious
  • LinkedIn
  • StumbleUpon
  • Add to favorites
  • Email
  • RSS