Overleg index:Het Koninkrijk Deel 01 Voorspel (1969).djvu

Pagina-inhoud wordt niet ondersteund in andere talen.
Uit Wikisource

Hier gaat iets fout[bewerken]

Vanaf pag. 14 gaat er iets helemaal fout. De OCR loopt niet meer gelijk met de scans. Ik stop hier even mee. Eens kijken hoe dit moet worden opgelost. --Dick Bos (overleg) 3 feb 2016 21:45 (CET)[reageer]


Request for Help[bewerken]

the following message is posted in the Scriptorium of en-ws:

Request for help with a problem on Dutch Wikisource.

Please help me with a problem on Dutch Wikisource. It's a problem with OCR-layer of the text of this book: s:nl:Index:Het_Koninkrijk_Deel_01_Voorspel_(1969).djvu

The pdf is on Commons: commons:File:L._de_Jong_-_Het_Koninkrijk_der_Nederlanden_in_de_Tweede_Wereldoorlog_1939-1945_Deel_1_Voorspel.pdf

This pdf was uploaded to IA (cf. this procedure).

https://archive.org/details/L.DeJongHetKoninkrijkDerNederlandenInDeTweedeWereldoorlog19391945Deel1Voorspel

The djvu thus created was uploaded to commons again: commons:File:Het_Koninkrijk_Deel_01_Voorspel_(1969).djvu

This djvu is used in Dutch Wikisource.

The problem is this: from (djvu-)page 25 onward (page 14) there is a page missing in ocr: so the ocr is not corresponding to the scanned image, but to the next page. Further up there are more pages missing. I don't exactly know which ones. I suppose a total of 24 pages is missing: djvu page 744 is the last one with an ocr. After that there are 24 more scanned pages in the book.

Can anyone explain what's gone wrong here? And how could I solve this problem?

After the moment I discovered this problem, I uploaded a new version (directly from NIOD) to IA. And again to Commons. And made this s:nl:Index:De_Jong_-_Koninkrijk_Deel_01_Voorspel_(1969).djvu. This file shows exactly the same problem. So it looks like there's something corrupt in the original pdf.

--Dick Bos (overleg) 4 feb 2016 21:14 (CET)[reageer]