Encyclopædia Britannica, Eleventh Edition
Encyclopædia Britannica, Eleventh Edition

Want to write a best-seller? Scientists claim this algorithm will tell you how

'Statsitical stylometry' looks at vast amounts of data in order to sift out the stylistic tropes that define a popular novel

Sophie Murray-Morris
Friday 10 January 2014 15:31
Comments

Ever wondered what the secret is to a novel’s success? Computer scientists from the US think they might have discovered the secret.

The new technique, with an accuracy rate of 84%, can tell aspiring writers whether their book will shoot to fame or be a total slump even before it is published.

Researchers at New York based Stony Brook University analysed over 40,000 books from a broad range of genres, as well as film scripts, to collate the findings. Notable titles included A Tale of Two Cities by Charles Dickens and The Lost Symbol by Dan Brown.

The technique, called statistical stylometry, differentiates between highly successful literature and less prosperous literary works by using vast amounts of data to define variations in literary style between one writer or genre and another.

The researched defined a book’s success by looking at its download figures and Amazon sales records.

A high percentage of verbs, adverbs and foreign words could be the reason why some books are failing, according to the research. They may also rely on verbs that more explicitly describe actions and emotions, including words such as “wanted”, “took”, “promised”, “cried”, and “cheered”. These books may also depend on overused words, such as cliché terms like “love” and their settings may be common geographical settings.

In contrast, more successful books use more conjunctions such as “and”, “but”, and “or”. They also included more thought-processing verbs such as “recognised” and “remembered”, the research revealed.

Yejin Choi, assistant professor at Stony Brook University, said: “Predicting the success of literary works poses a massive dilemma for publishers and aspiring writers alike.”

She added: “Based on novels across different genres, we investigated the predictive power of statistical stylometry in discriminating successful literary works, and identified the stylistic elements that are more prominent in successful writings.”

“Our work is the first that provides quantitative insights into the connection between the writing style and the success of literary works.”

Register for free to continue reading

Registration is a free and easy way to support our truly independent journalism

By registering, you will also enjoy limited access to Premium articles, exclusive newsletters, commenting, and virtual events with our leading journalists

Please enter a valid email
Please enter a valid email
Must be at least 6 characters, include an upper and lower case character and a number
Must be at least 6 characters, include an upper and lower case character and a number
Must be at least 6 characters, include an upper and lower case character and a number
Please enter your first name
Special characters aren’t allowed
Please enter a name between 1 and 40 characters
Please enter your last name
Special characters aren’t allowed
Please enter a name between 1 and 40 characters
You must be over 18 years old to register
You must be over 18 years old to register
Opt-out-policy
You can opt-out at any time by signing in to your account to manage your preferences. Each email has a link to unsubscribe.

Already have an account? sign in

By clicking ‘Register’ you confirm that your data has been entered correctly and you have read and agree to our Terms of use, Cookie policy and Privacy notice.

This site is protected by reCAPTCHA and the Google Privacy policy and Terms of service apply.

Join our new commenting forum

Join thought-provoking conversations, follow other Independent readers and see their replies

Comments

Thank you for registering

Please refresh the page or navigate to another page on the site to be automatically logged in