Computer algorithm to decipher ancient texts
Wednesday 02 September 2009
Researchers in Israel say they have developed a computer program that can decipher previously unreadable ancient texts and possibly lead the way to a Google-like search engine for historical documents.
The program uses a pattern recognition algorithm similar to those law enforcement agencies have adopted to identify and compare fingerprints.
But in this case, the program identifies letters, words and even handwriting styles, saving historians and liturgists hours of sitting and studying each manuscript.
By recognizing such patterns, the computer can recreate with high accuracy portions of texts that faded over time or even those written over by later scribes, said Itay Bar-Yosef, one of the researchers from Ben-Gurion University of the Negev.
"The more texts the program analyses, the smarter and more accurate it gets," Bar-Yosef said.
The computer works with digital copies of the texts, assigning number values to each pixel of writing depending on how dark it is. It separates the writing from the background and then identifies individual lines, letters and words.
It also analyses the handwriting and writing style, so it can "fill in the blanks" of smeared or faded characters that are otherwise indiscernible, Bar-Yosef said.
The team has focused their work on ancient Hebrew texts, but they say it can be used with other languages, as well.
The team published its work, which is being further developed, most recently in the academic journal Pattern Recognition due out in December but already available online.
A program for all academics could be ready in two years, Bar-Yosef said.
And as libraries across the world move to digitize their collections, they say the program can drive an engine to search instantaneously any digital database of handwritten documents.
Uri Ehrlich, an expert in ancient prayer texts who works with Bar-Yosef's team of computer scientists, said that with the help of the program, years of research could be done within a matter of minutes.
"When enough texts have been digitized, it will manage to combine fragments of books that have been scattered all over the world," Ehrlich said.
Life & Style blogs
Half of young women unable to ‘locate vagina’ and 65% find it difficult to say the word
Is Apple's iCloud safe after leak of Jennifer Lawrence and other celebrities' nude photos?
David Sedaris: What I learnt from Fitbit about the world around me
Reader dilemma: My wife only wants to have sex when she's drunk
Three quarters of the Ikea catalogue is CGI
Rotherham child sex abuse scandal: Labour Home Office to be probed over what Tony Blair's government knew - and when
What do immigrants really think of Britain? Polish immigrant's Reddit post goes viral
Ashya King: Parents of five-year-old boy refused permission to visit him in hospital and denied bail at Spanish court
With Douglas Carswell joining Ukip, my party has taken another giant step forward
When elitism grips the top of British society to this extent, there is only one answer: abolish private schools
Ashya King: 'Cruel NHS has not given us the treatment we need', says father of five-year-old with brain tumour who fled to Spain
- 1 Al Pacino on suffering from depression: 'It can last and it's terrifying'
- 2 Half of young women unable to ‘locate vagina’ and 65% find it difficult to say the word
- 3 Saudis risk new Muslim division with proposal to move Mohamed’s tomb
- 4 A teacher speaks out: 'I'm effectively being forced out of a career that I wanted to love'
- 5 Mexican woman becomes world’s 'oldest person' at 127
iJobs Gadgets & Tech
£35000 - £36500 per annum: Ashdown Group: Systems Administrator (SharePoint) -...
£600 - £800 per day: Harrington Starr: Derivatives Risk Commodities Business A...
£600 - £800 per day: Harrington Starr: Power & Gas Business Analyst/Subject Ma...
£600 - £900 per day: Harrington Starr: Infrastructure Lead, (Trading infrastru...