Professional Documents
Culture Documents
“We did everything we said we were gonna do and nobody can take that away from
us …”
Zack to Sheridan in Babylon 5: “Sleeping in Light”
Like written in the previous posting, I wanted to digitize the contents of my book
shelf … and I did it. Last weekend, I cut and scanned 109 books. Yes, one hundred
and nine books. OCR is also finished, as is reducing the file size (my MacBook was
busy at night).
The reduced file size is still readable, although in some cases the compression is
noticeable if you look closely at the text. But for reading them on the screen the
quality is very well and turning the virtual pages is snappy with the reduced file size
versions. However, I will keep the original scans. Color is compressed here as well
(medium quality), but the quality is better than the reduced file size versions, and I
might need them.
Why did I cut (and thus destroyed the physical version of) my books? There are a
few reasons:
1. You end up with one virtual page equals one printed page.
Two book pages on one scanned page (which you would get if you open the
book and put it on a flatbed scanner or copier with a scan unit) might save
paper if you want to print it, but printing is not the reason for scanning. The
reason is to read it on screen and easily make notes with the same medium
and for this one page on one scanned page is ideal. If you want to print it
some day you can still use the driver of the printer to put two scanned pages
on one printed page.
2. No black areas.
If you are using a scanner with automatic size detection you get scans that
have no black areas which cost a lot of ink/toner and look atrocious.
3. It’s fricking fast.
I am really fast in scanning books with a normal copier with a scan unit,
where you have to put on each page spread and press a button. I did it during
my time at the university when I was working as a student assistant, and I did
it afterwards. I am a squirrel in many ways and I like to keep what I find. But I
also want to work unencumbered by stuff that slows me down. Digital is the
way to go and over time I really got fast in digitizing stuff. But this way
surprised even me. It’s just so … fast.
Here are some tips for digitizing books (by gutting them):
Strategy
1. Go in brackets.
For example, first scan all books about climbing, then all books about
photography, and so on, so you ahve specific blocks of books that you have
digital. Dealing with categories is more motivating than having 12 of 100
books scanned.
2. Start with the most precious book you want to scan but leave the most
important category for last.
Afterwards, you have no qualms about scanning the “lesser” books. It’s
kinda like dealing with pain, the worst pain comes first, then you know
whatever comes next won’t be as worse and it’s easier to bear. Then again,
keep the most important category for last. This will motivate you to go to the
end.
3. Don’t try to compete with the scanner.
First I tried to cut books while the scanner was busy with the 25-30 pages it
was scanning. Only the scanner beat me each and every time. I tried to work
faster and this nearly destroyed some books (destroyed in the sense that I
cut into the text). So I stopped competing with the scanner, which also
meant stopping to parallelize the work. I first cut all books belonging to one
category and then I did scan them (in some cases, I turned the pages of the
block that was scanned next while the scanner was working to ensure that
the pages are really separated from each other and do not cause a paper
jam). But competing with the scanner … nope, the scanner is just to fricking
fast.
Important note: Working with a cutter/blade is dangerous. If you are too stupid to
realize that, don’t do it. I do not take any responsibility to what happens to you,
your fingers or skin or whatever if you try to cut your books.
1. Do not cut where you scan.
It might be nice to scan and at the same time (and place) cut the next book.
Don’t. The cutting creates dust which will gravitate to the scanner and you
have to clean it more often. Cut away from the scanner. I did the cutting in
the “kitchen” while the scanner was on my desk about five meters away.
2. If you hurry a book dies needlessly.
Cutting is delicate work and it’s very easy to cut into the letters on the
page. So, take your time and cut carefully — many strokes with small
advances cut the book.
3. Remove the cover first.
Even with paperbacks it’s more easy to cut if you remove the cover first.
This often includes the first page, which is glued to the cover differently than
the rest and can give you the impression of cutting farer away from the text
than you really are (i.e., a book will die). It’s much easier to cut if the cover
(including the back cover) is out of the way and you are only dealing with the
book block.
In retrospect I get dizzy when I compare the time it took me to digitalize the books
this way compared to the time it would have taken me with my old flatbed scanner.
Instead of cutting the books, putting them in and going zzz, zzz, zzz … finished, it
would have been a long process even to get a single page scanned (open the book,
put the correct spread on the flatbed scanner, press the button each time, waiting
until the scanner heats up, waiting the painful seconds it takes to scan the page,
correct the scanned page with the software, etc. pp. And it wouldn’t have been that
much better for the book. Sure, the book would still be a book, but opening the book
far enough to scan the whole spread (and avoid black bars in the middle where the
paper is in some distance from the glass) would also have damaged the spine.
I guess it’s another strong reminder to make damn sure you use the right tools and
the right technique to do the job, or to quote someone: “If you have eight hours to
cut down a tree, it is best to spend six hours sharpening your axe and then two
hours cutting down the tree.” And despite the high costs of destroying the books,
this way worked admirably.