Old Weather Forum

Old Weather: Arctic => Notes from OW5 => Topic started by: Randi on April 11, 2016, 09:58:34 pm

Title: OCR testing
Post by: Randi on April 11, 2016, 09:58:34 pm
Please note: Philip has specifically asked us to work on Farragut.
The team working on OCR is ready for some test data - and our experts are needed to provide that test data!


https://www.oldweather.org/#/ships

We have a typed log: Farragut 1942, Part 1

The logbooks are available on line at: U.S. Navy Logbooks | Arctic Rediscovery
Jan 1942 = https://catalog.archives.gov/id/7795039
...
Dec 1942 = https://catalog.archives.gov/id/7795050
Title: Re: OCR testing
Post by: Randi on April 11, 2016, 10:57:06 pm
Kevin says:
Quote
please emphasize centering characters and not the ruled lines when marking tables.

I have asked him for more details and/or an example.



Quote from: Kevin
My concern is if people do it how I did at first (by the lines of the printed table) there will be truncated characters when numbers are not typed in the center of the box.

Kevin says that this Eastwind example is good:
(http://i.imgur.com/8NNZj5f.jpg)
(I suspect that all those "Calm" entries in the middle of two separate boxes will be a learning experience ;))

(http://i.imgur.com/iABZerL.jpg)
Title: Re: OCR testing
Post by: Randi on April 12, 2016, 10:43:28 pm
Farragut - Questions and Comments (https://www.zooniverse.org/projects/zooniverse/old-weather/talk/161/12848?comment=25182)
 ;)

Thanks Randi!!  ;)
Title: Re: OCR testing
Post by: Randi on April 13, 2016, 01:23:23 pm
Reminder: Philip has asked us to include the State of the Sea column(s) for OW5.