The British Library is handing 250,000 books to Google for scanning into the Google Books project. The Library had previously partnered with Microsoft which digitised books from the 19th century and Leonardo da Vinci's notebooks. The Library only owned one of the notebooks - the second was from Bill Gates' own collection. The …
Regardless of who gets the job of scanning in those documents, I'm pleased to hear that the British Library is taking steps to convert their libraries into digital files. Hopefully there will be something of use to researchers who were otherwise unable to locate specific texts or travel to the library to do so.
Also help them to last longer
Paper degrades with time and is costly to restore or keep in good condition. Bytes (in open, documented formats) should last much, much longer.
Don't be so hasty
Digital storage has only been around for 50 years or so, and most digital content available now is much younger than that, whereas the books that are being digitized are sometimes ten times as old and are clearly in a robust enough state to be handled by Google.
The effects and problems associated with long-term digital archiving have yet to be seen. Not that I'm against such archving, quite the opposite, but it should not be assumed that digital is "better" in some regard. Thankfully there has been lots of research, contingency plans, and strategies drawn up to manage digital archiving, but only time will tell if it stands the test of time (to paraphrase Van Halen)
I have documents on my computer that I can't read. They were created in an early version of some Apple word processonf 22 years ago. They are intact, but unreadable in any modern programme I have available.
I have tapes (35 years old), on 6250 BPI reels, that I cant read now. Bye Bye University final year project.
My DAT tapes may well survive the century the company claims. Doubt the DAT reader will (I have been through three in 10 years), and they are now becoming like hens teeth for the tapes I have. Yes, its fine to upgrade, but what about the tapes I have in the safe stpre?
As the BL shows, documents last several 100 years and are still readable. Stone tablets even longer. Now thats technology built to last.
Mine's the one with the stolen Rosetta Stone in the pocket, yes the heavy one.
Where have you been?
There's been a huge scandal in the library world for some time now about the destruction (throwing away) or paper copies and keeping only the microfilm/microfiche and now digital versions. But paper lasts for thousands of years. We are still finding paper in archaeological sites in Egypt with lost ancient Greek plays and other writing on them, all still viable, all still readable. Paper and vellum are both tough materials. I have read medieval parchment documents that are still whole. entire, and clear, while my microfilm version of the same documents is already degrading, due to not being able to store it in optimal conditions.
We have already seen formats of the last fifty years change, become obsolete, and become unreadable. I am sure other Reg readers will name them. I found a 3.5 inch disk the other month but my all-card reader, which I retained for just such a possibility, gave up trying to read it, and so all those data are lost.
I am all for copies of material. I would rather students read Austen online or on a Kindle than to keep requesting the original edition at the British Library, where they do not say no and are thus allowing these books to be thumbed to pieces by the merely curious. The idea of a single copy of anything makes my blood run cold.
I do share the disquiet of other Reg readers on who is doing this digitising. Already in the USA and Europe there is deep concern about handing the only digital version to a company that exists to make profit. France is digitising its own patrimony itself, while the UK merrily hands it to an international company whose HQ is in a country that interprets copyright to its own benefit. The British Library was recently exposed as actually throwing out books. It seems that we cannot trust these librarians to understand that they are the custodians of all materials in their collection. I see these latest deals as the little lambs of the BL wagging their tails as they skip to the slaughter.
The good thing about digital copies is that you can move them to a new medium on a yearly basis.
an early version of some Apple word processor...
Going by my experience with doc files from old versions of MS Word, I think there's a very high probability that you'd be able to read the text (if not the formatting) by opening the files with a text editor/hex editor/binary editor like the old DOS Edit.
> unreadable in any modern program[me] I have available
strings? C? or are they not "modern"?
I'll bet your 22-year old program is easier to reverse-engineer than PDF is.
"We are still finding paper in archaeological sites in Egypt"
Nope we are finding papyrus in Egypt, a completely different technology. Paper was a _very_ long way off, even for the Chinese.
Correct, but point still valid
Ms Handcart, you are quite right: I typed in haste. But I have handled books from pre-1500 and newspapers from the 18th century and they were holding up well. Old rag-based paper is amazing. I have seen early 'Advertisers' in better shape than crumbling Penguin paperbacks from the 1930s, but the better acid-free paper we're seeing should last for many centuries.
Papyrus and vellum and so on have staying power. They live even when the electric sub-station is dead. Digital requires the juice be flowing somewhere, simply to retrieve.
I still think that national or international consortia digitising material as an inheritance and resource for their citizens, or for the whole human race, is a wonderful thing, and applaud the initiatives that exist. I don't applaud handling over this rare and precious stuff for the 'Don't do Evil' guys.
Paper is no good
Yeah, yeah, I've heard all the guff about this new-fangled paper-and-ink stuff, but I ask you: Who's going to trust that? If you really want your magnum opus to survive down the millennia, you'll do what serious professional writers have done for centuries: carve runes into stone. Runes don't fade and discolour, the way that ink does, and a well-manufactured stone tablet will survive even a conflagration. Backups are barely necessary, but if you want to be sure, a few dozen slave scribes will be able to make a complete copy of all your data in only a few short years. Why anybody wastes their time on these nonsensical modern fads is beyond me.
You are kidding, right?
"Paper degrades with time and is costly to restore or keep in good condition. Bytes (in open, documented formats) should last much, much longer."
Paper will undoubtedly last infinitely longer than any currently stored digital media. In a century from now no one will know a damn thing about any of us because most of it (digitally and magnetically stored as it is) will have gone. Even optical media -- its not stable.
This is good news, my weekends will be filled for a long time
This is good news. Especially the fact that it can be republished also on a non-Google, non-BL platform like Europeana. Google seems to get more permissive with the license arrangements it offers to libraries and this is a good thing.
Does the BL get any money from this?
After all, its my Tax Pound that has gone in to maintaining this library for the good of Fun Factory, it seems.
>>After all, its my Tax Pound that has gone in to maintaining this library for the good of Fun Factory, it seems.
"His rage passes description - the sort of rage that is only seen when rich folk who have more than they can enjoy suddenly lose something that they have long had but have never before used or wanted."
So, how often do you use the British Library, Adrian?
We're gonna need a bigger Kindle.
> We're gonna need a bigger Kindle.
I don't know if you meant it like that, but what guarantees the survival of books is the existence of many copies. The books that have survived up to now are the ones of which at least one copy survived fires/floods/bombings/dry rot/religious zealots, the chance of which was proportional to the number of copies in different places.
If the Kindle Mark 15 in 2025 comes with the out-of-copyright section of the British Library preloaded on 10% of its storage capacity, and people can't be bothered deleting it but just keep copying the complete contents to their next device, the books will survive somewhere.
I can see only two developments invalidating my theory:
- The move to "cloud storage" means there aren't any actual copies.
- Disney and friends keep stopping old content falling out of copyright.
Let's not hope so...
Will these digitised out of copyright books ...
suddenly acquire a 'copyright google' watermark?
Clearly the collection is unique and thus very valuable. I do hope that the BL has not given away our heritage too cheaply or have entered into a restrictive deal.
"A spokesman for the British Library said the deal with Microsoft was simply different and there was no question of scale problems with its technology"
No just problems using it with software other than Microsoft's. At least partnering with Google will increase the number of folks that are able to use the digital archive.
Shame on the British Library
They were hijacked by Microsoft to act as an ad for Vista - at one point their website was trumpeting material that could only be viewed with that OS. I complained to no avail - obviously they were in Microsoft's pockets. Now it seems that has gone the way of Vista and now I expect they are in Google's pockets, but you're right - at least more people will be able to access the stuff. I'm not surprised to hear about their newspaper deal - they are either corrupt or very stupid.
While we're on about Goggle and copyrighting stuff, what about Goggle Maps?
I can see a God's-eye view of my house, in a little window that says 'Copyright2011 Google' but it shows an empty field next door where, actually, houses were built in March 2006.
If the pictures were taken in (say) 2005, how come Goggle can update its copyright this year (and every year ad infinitum)?
Maybe we should see this as building infrastructure in cyberspace.
I seem to recall that ownership of the Channel Tunnel passes from Eurotunnel to the State after some period of time during which they get to recoup their costs. Old bridges that we now walk or drive over for nothing used to have tolls. That's how you get the private sector to do things that we need.
So long as the BL and its readers get to use the digital version for nothing after ten years or whatever this seems to be ok. What we don't want is for Google to own the BL indefinitely.
Been across the Humber Bridge recently?
You can just see the Library of Congress handing centuries of historical texts to a British or even European company to make sh*t loads from adverts - jesus ****ing wept!
this reminds me the following article
If it costs the public nothing to scan, and nothing to access the scans, then it looks like a good deal. Google will offer the option to produce on-demand books, and if you don't like, print your own, or look at the originals. Looks good all round.
- Review Is it an iPad? Is it a MacBook Air? No, it's a Surface Pro 3
- Game Theory The agony and ecstasy of SteamOS: WHERE ARE MY GAMES?
- Hello, police, El Reg here. Are we a bunch of terrorists now?
- Microsoft and HTC are M8s again: New One mobe sports WinPhone
- Worstall on Wednesday Wall Street woes: Oh noes, tech titans aren't using bankers