Fast and easy way to cut a long story short
Summarising a text is a key human skill. But can software do it? Roderick Neil Kay looks at progress
Tuesday 29 July 1997
The ability to summarise a text is founded on core intellectual skills, so much so that the trials in the task still appear in numerous IQ and recruitment tests, including those run for the elite of the Civil Service.
Like many other products involving natural language, the new summarising technology doesn't quite live up to its billing. Summarisers have now been developed by a host of companies, including Oracle, InXight and BT, all based on similar statistical techniques. The simplest such method runs as follows. First, find the most frequently occurring words in a text, excluding trivial words. Second, locate the sentences in which groups of these words occur. Third, extract these sentences and compile them in chronological order. Then describe the compilation as a summary, without blinking.
One of the most surprising things about the statistical techniques at the centre of the new software is that the ideas have been around for a long time, almost before the dawn of AI, in fact. As early as 1958, while working for IBM, Luhn ran some experiments on a corpus of technical articles, using the algorithm just described. He was enthusiastic about the results: "The auto-abstract is perhaps the first example of a machine- generated equivalent of a completely intellectual task in the field of literature evaluation."
But at the time, text retrieval wasn't the hot topic it is today, and his idea was never marketed in the form of software. The view within the AI community, which has always aimed at getting computers to understand language, has been that statistical techniques are OK as far they go, but if you consider what a human can do, that isn't very far. The emergence at this point of the new summarisers probably owes as much to bumped-up demand as it does to advancement in the field.
While the recent crop of summarisers fall reassuringly short of human performance, their wide availability should generate the kind of interest which leads to improvement. And anyway, enough of human condescension; let us allow the computer to speak - or rather to summarise - for itself. The following extract is a 20 per cent summary of this article, produced by BT's Netsumm.
"Automatic summarising has long been considered one of the most prized goals in artificial intelligence, but working summarisers have now finally appeared based on far more superficial techniques. Summarisers have now been developed by a host of companies not normally associated with text processing: Oracle, InXight and British Telecom, all based on similar statistical heuristics. The view within the AI community, which has always aimed at getting computers to understand language, has been that statistical techniques are OK as far they go, but if you consider what a human can do, it isn't very far"n
Life & Style blogs
NHS struggling to monitor the safety and efficacy of its services outsourced to private providers
Airline food across the classes: Ever wondered what the other half are eating?
Coachella Festival 2015: from Kendall Jenner to Alexa Chung, stars and festival-goers parade their boho best
What do the emoji on Snapchat mean?
Huawei P8 review: best phones nobody's seen from the biggest company nobody's heard
If I’m being racially abused I don’t need a stranger with a saviour complex to rescue me
The only black face in the Ukip manifesto is on the page about overseas aid
Ukip is the only main political party to not address LGBT rights in its manifesto
Food banks: One million Britons will soon be using them, according to Trussell Trust
Religion isn't growing, it is becoming vigorous in its demise, says philosopher AC Grayling
BBC election debate: The one photo that summed up the whole 90-minute leaders debate
- 1 Rarest Beanie Baby of them all could be sold for £62,500 on eBay
- 2 Ben Affleck asked TV chiefs to hide slave-owning ancestry, new hacked Sony emails published by Wikileaks claim
- 3 Driving while dehydrated can be just as dangerous as drink driving, study suggests
- 4 Farmer told to tear down mock-Tudor castle after hiding construction behind hay bales
- 5 One Direction: Louis Tomlinson launching his own record label, has already 'signed two acts'
£18000 - £23000 per annum: Recruitment Genius: They work with major vehicle ma...
£16500 per annum: Recruitment Genius: A Chiropractic Assistant is needed in a ...
£18000 - £26000 per annum: Recruitment Genius: They work with major vehicle ma...
£28000 - £30000 per annum: Recruitment Genius: This company provides coaching ...