This user has not added any information to their profile yet.
Senior research scientist, Natural Language Processing
- Document conversion
- Typography, book design
I am currently involved in the EU H2020 READ project on providing access to historical documents.
If you need to transcribe old handwritten documents check out here the READ Transkribus platform!
For pdftoxml, there's the open source project https://sourceforge.net/projects/pdf2xml/