Author Topic: Faulty scanned and duplicate pages  (Read 101408 times)

Randi

  • Global Moderator
  • Hero Member
  • *****
  • Posts: 12060
    • View Profile
Faulty scanned and duplicate pages
« on: October 27, 2012, 11:33:29 pm »
Faulty Scanned and Missing Pages
If there is a problem reading a log page because of the quality of the scan, please post a description of the problem and a link to the page in this topic.
If a page is missing*, please post a description of the problem and a link to the page before the missing page in this topic.



Duplicate Pages
You may see two (or more) pages from a ship's log with the same date. There are two main causes: duplicate scans and duplicate copies. Duplicated dates may also occur when ships cross the International Date Line*.

Duplicate scans of log pages appear when the same day has been scanned twice.
This can happen because the page was accidentally scanned twice (e.g., http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4775_0.jpg and http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4776_0.jpg) or because the page was rescanned due to the poor quality of the first scan but the first scan was not removed. If you transcribed the first scan, there is no need to transcribe the second scan. Just click on "I've finished with this page". If the first scan is unreadable, please transcribe the second scan. Occasionally you will have to transcribe some data from the first scan and some from the second. If you are not sure how to transcribe the data, don't hesitate to post a question in this topic or the ship's Discussion topic. You can also do a search to find out what other transcribers have done.
Duplicates can also be caused by inserts. See Inserts in bound logbooks below.

Duplicate copies of log pages appear when the same day has been logged twice.
This could have happened if the log keeper accidentally recorded, for example, March 1 in the February log book and then correctly recorded it in the March log book. It could also have occurred if someone copied the log book and accidentally copied a page twice. Even if the data on the two pages are exactly the same, you will be able to see differences between the pages. On some duplicated pages the data are NOT the same (see 7am weather codes in http://s3.amazonaws.com/oldweather/ADM53-40562/ADM%2053-40562-187_0.jpg and http://s3.amazonaws.com/oldweather/ADM53-40563/ADM%2053-40563-007_0.jpg). In such cases, the scientists would very much like to have both pages transcribed, in order to see how accurate the original log keepers/copiers were.
NOTE: We have both the original logbook and the copy of some US logs, so there are occasionally several months of duplicates. The scanners/programmers tried to remove the duplicates earlier, but some were missed. Please post in this topic when you find such duplicates. They are not worth transcribing, so just keep clicking "I've finished with this page" without entering any data until you've worked through them.



Inserts in bound logbooks
In most cases the left- and right-hand pages of the log book are included in a single image which is then split into two. If an insert covers the left-hand page, two scans will be required: one with the insert visible and one with the log page visible. This will generate two copies of the right-hand page.
Whenever you find a log page covered by another sheet, check its JPEG link and the next couple of JPEG links for a view of what's happening.
There are variations, depending on the size and placement of the insert, but here is an example:
It's often necessary to look at the JPEG link for the other half of the image to read the beginnings or ends of lines, because the insert's placement may cause it to cross the point where the image is split into two pages. It is acceptable to put any transcribing you want to do entirely on one of the two pages. Page joining tool - for when the text you are trying to read is split over two pages

Note: You do not need to report duplicate scans when they are the result of inserts.





* Ships sometimes cross the International Date Line. If the ship is going west to east, this may lead to two weather and events pages having the same date. Please transcribe these pages as written, since these are technically for two different days and they will have different values.
If the ship is going east to west, this may lead to a date being skipped; for example, going from May 10 to May 12 with no May 11. Please transcribe these pages as written. In both cases, we (and the scientists) would appreciate it if you could post a note here, or in the ship's Discussion topic, stating which pages have been affected.
« Last Edit: May 07, 2015, 10:24:05 pm by Randi »

Randi

  • Global Moderator
  • Hero Member
  • *****
  • Posts: 12060
    • View Profile
Re: Faulty scanned and duplicate pages
« Reply #1 on: October 27, 2012, 11:39:25 pm »
Patterson:

For the whole log book, on most of the events pages the extreme left side of the page (about 2 letters) is not visible.
30 July 1884 - http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4770_1.jpg
to
4 Feb. 1885 - http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4972_1.jpg

Randi

  • Global Moderator
  • Hero Member
  • *****
  • Posts: 12060
    • View Profile
Re: Faulty scanned and duplicate pages
« Reply #2 on: October 27, 2012, 11:39:57 pm »
Copied from: Listing of faulty scanned pages Phase 3

I just noticed an "hour" field has been added and it shows up in the cloud cover picture so you can keep track of where you are! I can no longer see all the air pressure digits but it's worth it.

I am struggling with the split pages of the Manning so it's still not easy, but Mann-agable.  ;D

There are more Weather fields in the capture screen than in the Manning log. I guess she has non-standard log because there is no temp. with the pressure. There are now 4 "Air" fields (is that new?). There are only 3 on the Manning log.

With the split pages the first 2 digits of the barometer reading are sometimes lost but I can deduce what they are so I am entering them.

The splitting of Manning's pages has been fully repaired. :)
« Last Edit: October 27, 2012, 11:41:50 pm by randi_2 »

Janet Jaguar

  • Global Moderator
  • Hero Member
  • *****
  • Posts: 9655
  • Smell the sea, feel the sky, & fly into the mystic
    • View Profile
Re: Faulty scanned and duplicate pages
« Reply #3 on: October 28, 2012, 12:01:16 am »
Patterson:

For the whole log book, on most of the events pages the extreme left side of the page (about 2 letters) is not visible.
30 July 1884 - http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4770_1.jpg
to
4 Feb. 1885 - http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4972_1.jpg

And on the right side, the curve and tilt of the page makes lining up magnifier boxes difficult.

I must also note, these early logs were rushed on what the American team calls "archaic" machines - once again OW transcribers were too good and fast for our own good, and the team had to start scanning ahead of schedule.  Once we work our way through the first launch, all of the scans will be of better quality.


Craig

  • Shipherd
  • Hero Member
  • *****
  • Posts: 2972
    • View Profile
Re: Faulty scanned and duplicate pages
« Reply #4 on: October 28, 2012, 12:04:44 pm »
« Last Edit: October 28, 2012, 12:06:23 pm by gastcra »

Janet Jaguar

  • Global Moderator
  • Hero Member
  • *****
  • Posts: 9655
  • Smell the sea, feel the sky, & fly into the mystic
    • View Profile
Re: Faulty scanned and duplicate pages
« Reply #5 on: October 28, 2012, 02:11:36 pm »
USRS Rush
Oct. 11 1890 remarks: page folded http://oldweather.s3.amazonaws.com/ow3/USS%20Rush/Rush_1890/IMG_1698_1.jpg
Oct. 12 1890 weather observations obscured by report http://oldweather.s3.amazonaws.com/ow3/USS%20Rush/Rush_1890/IMG_1699_0.jpg

Very unusual.  The folded extra paper is a total miss by the scanners.  I hope we eventually get another, because this includes the full readings and log for the 13th, as well as the captain's summary of something for the whole book.

Kevin/Philip/team member:
the log body for Oct.12th can be fully transcribed from these, no problem:
http://oldweather.s3.amazonaws.com/ow3/USS%20Rush/Rush_1890/IMG_1698_0.jpg
http://oldweather.s3.amazonaws.com/ow3/USS%20Rush/Rush_1890/IMG_1699_1.jpg

The log body for Oct.13th and then the captain's summary for the entire book should have been 4 half-pages; what there is of it can be seen here:
http://oldweather.s3.amazonaws.com/ow3/USS%20Rush/Rush_1890/IMG_1698_1.jpg
http://oldweather.s3.amazonaws.com/ow3/USS%20Rush/Rush_1890/IMG_1699_0.jpg

I don't know if you want to redo just the extra paper inserted.
« Last Edit: October 28, 2012, 02:22:38 pm by Janet Jaguar »


AvastMH

  • Global Moderator
  • Hero Member
  • *****
  • Posts: 7049
    • View Profile
Re: Faulty scanned and duplicate pages
« Reply #7 on: October 30, 2012, 09:58:30 pm »
I've just gone to transcribe - and been given a page I've already done:
http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4776_1.jpg
It was easy to recognize because of the 'small area of sea weed' mentioned. Is this a known problem please?
Joan

Janet Jaguar

  • Global Moderator
  • Hero Member
  • *****
  • Posts: 9655
  • Smell the sea, feel the sky, & fly into the mystic
    • View Profile
Re: Faulty scanned and duplicate pages
« Reply #8 on: October 30, 2012, 10:09:56 pm »
That's distressing, because I found I'd done July 30th and 31st on the Patterson twice last night.  And didn't recognize it, because I'd done them on different days.  But it was the comments the cued me in also.

I'm going to ask Kevin, cc to Philip, how far this happens and what to do with the duplicates.  The curve of the weather pages is bad enough, twice or three times each page is a bit much.

and I'm moving both of our posts to "bad scans" - this is truly not an interface issue for Stuart.
« Last Edit: October 30, 2012, 11:23:55 pm by Janet Jaguar »

AvastMH

  • Global Moderator
  • Hero Member
  • *****
  • Posts: 7049
    • View Profile
Re: Faulty scanned and duplicate pages
« Reply #9 on: October 30, 2012, 10:26:01 pm »
Thanks Janet. I thought I was gong mad on Sunday when it happened - and didn't quite believe myself. Luckily having sea weed on the brain has helped my memory this time. ;D ;D

Right - hammock time zzzzzzzzzzzzzzzzzzzzzzz
Sweet dreams all....

Janet Jaguar

  • Global Moderator
  • Hero Member
  • *****
  • Posts: 9655
  • Smell the sea, feel the sky, & fly into the mystic
    • View Profile
Re: Faulty scanned and duplicate pages
« Reply #10 on: October 30, 2012, 11:23:20 pm »
Okay, cancel call to the PTB.  It is not as bad as I thought.

Lots of the intro pages were duplicated, including the lists of officers and ratings. (both _1 left blank)

http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4762_0.jpg
http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4763_0.jpg

http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4764_0.jpg
http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4765_0.jpg

Also these 3 dates - I say transcribe the first of each and skip the second.

July 30 (also _0 for each)
   http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4770_1.jpg
   http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4771_1.jpg

Aug. 3 (also _0 for each)
   http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4775_1.jpg
   http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4776_1.jpg

Aug. 8 (also _0 for each)
   http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4781_1.jpg
   http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4782_1.jpg


I checked the whole of August, and everything else is fine.  I'm saying, skip the following:

   http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4771_0.jpg
   http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4771_1.jpg

   http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4776_0.jpg
   http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4776_1.jpg

   http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4782_0.jpg
   http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4782_1.jpg

I'm thinking the whole thing was a scanner-operator learning curve.  I'm just curious why it is that the duplicated pages are spaced that way.

Randi

  • Global Moderator
  • Hero Member
  • *****
  • Posts: 12060
    • View Profile
Re: Faulty scanned and duplicate pages
« Reply #11 on: October 30, 2012, 11:49:38 pm »
I've just gone to transcribe - and been given a page I've already done:
http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4776_1.jpg
It was easy to recognize because of the 'small area of sea weed' mentioned. Is this a known problem please?
Joan

Here is the other image: http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4775_1.jpg

Note that on the corresponding weather pages (http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4775_0.jpg and http://oldweather.s3.amazonaws.com/ow3/final/USCS%20Patterson/Book%201/IMG_4776_0.jpg) the scanner's thumb is in a different place ;D

Janet Jaguar

  • Global Moderator
  • Hero Member
  • *****
  • Posts: 9655
  • Smell the sea, feel the sky, & fly into the mystic
    • View Profile
Re: Faulty scanned and duplicate pages
« Reply #12 on: October 31, 2012, 12:07:10 am »
I know, they are 2 different scans of the same page. Sigh.

Randi, I need something on this put in the Reference sheet for 1884 Patterson:
Quote
Comments on the log: after "This log was processed when the scanning technique ..."
  • The following pages have been scanned twice; transcribe the first copy of each and skip the second.  The pages involved are:
    • The book covers, List of Officers, and List of Crew
    • July 30, August 3, and August 8, 1884

« Last Edit: October 31, 2012, 03:24:20 am by Janet Jaguar »

Randi

  • Global Moderator
  • Hero Member
  • *****
  • Posts: 12060
    • View Profile
Re: Faulty scanned and duplicate pages
« Reply #13 on: October 31, 2012, 08:52:57 am »
I'll add a note, but I think it is better to keep it general rather than to specify just 2 pages.

I can also add a link to this thread and put some explanation in the first post of duplicate scan vs duplicate copy - and probably modify the name of this thread.

What do you think?

Janet Jaguar

  • Global Moderator
  • Hero Member
  • *****
  • Posts: 9655
  • Smell the sea, feel the sky, & fly into the mystic
    • View Profile
Re: Faulty scanned and duplicate pages
« Reply #14 on: October 31, 2012, 01:39:25 pm »
Your idea is more expansive, Randi, if the problem reoccurs past August.  Go with it. :)