Categories: Social Media

Researchers develop ‘sarcasm detector’ for social media

Researchers have developed algorithms that detect sarcasm on social media platforms.

When trying to convey irony or sarcasm in writing, many social cues are lost, and the intent can be completely misunderstood.

Luckily, researchers have developed a “sarcasm detector” that relies on contextual evidence to determine whether or not someone is being sarcastic in writing rather than depending on eye contact and body language for cues.

According to research by David Bamman and Noah A. Smith at Carnegie Mellon University, “Most computational approaches to sarcasm detection … treat it as a purely linguistic matter, using information such as lexical cues and their corresponding sentiment as predictive features.”

Unfortunately, lexical cues alone cannot decipher the overall intended meaning of a linguistic utterance or speech act. Cultural differences, interpersonal relationships, and other anthropological background information is required to fully understand the context.

The sarcasm professors have shown that “by including extra-linguistic information from the context of an utterance on Twitter – such as properties of the author, the audience and the immediate communicative environment – we are able to achieve gains in accuracy compared to purely linguistic features in the detection of this complex phenomenon.”

By collecting data from Twitter posts, Bamman and Smith were able to analyze factors such as keywords, common phrases, the density of hashtags, and the use of intensifiers such as dare, shocked, clearly, so, very, and too.

Analyzing these “Tweet Features” have helped the researchers to develop an overall picture of when someone is being sarcastic. Among the most informative of the above has been the use of hashtags in determining sarcasm.

Apart from the obvious #sarcasm, the more hashtags someone uses, combined with emoticons, the more the sarcastic meaning can be fully grasped.

Interestingly, using #sarcasm has been found to be reserved for audiences that are unknown to the writer while utterances among family, friends, and acquaintances are usually more understood and the hashtag is not required.

The researchers point out that “in the absence of shared common ground required for their interpretation, explicit illocutionary markers are often necessary to communicate intent” – such as the hashtag or emoticon.

However, even with a maximum success rate of 85.1% sarcasm detection in certain tests, more research is still needed. As anyone who has had a frustrating or confusing text with family members can personally attest, using sarcasm in writing with a loved one oftentimes can lead to awkward misinterpretations.

“Studying sarcasm that does rely on common ground (and does not require such explicit markers) will likely need to rely on other forms of supervision,” the paper concludes.

Tim Hinchliffe

The Sociable editor Tim Hinchliffe covers tech and society, with perspectives on public and private policies proposed by governments, unelected globalists, think tanks, big tech companies, defense departments, and intelligence agencies. Previously, Tim was a reporter for the Ghanaian Chronicle in West Africa and an editor at Colombia Reports in South America. These days, he is only responsible for articles he writes and publishes in his own name. tim@sociable.co

View Comments

Recent Posts

Ethical Imperatives: Should We Embrace AI?

Five years ago, Frank Chen posed a question that has stuck with me every day…

4 days ago

The Tech Company Brief by HackerNoon: A Clash with the Mainstream Media

What happens when the world's richest man gets caught in the crosshairs of one of…

4 days ago

New Synop app provides Managed Access Charging functionality to EV fleets

As companies that operate large vehicle fleets make the switch to electric vehicles (EVs), a…

5 days ago

‘Predictive government’ is key to ‘govtech utopia’: Saudi official to IMF

A predictive government utopia would be a dystopian nightmare for constitutional republics: perspective Predictive government…

6 days ago

Nilekani, Carstens propose digital ID, CBDC-powered ‘Finternet’ to be ‘the future financial system’: BIS report

The finternet will merge into digital public infrastructure where anonymity is abolished, money is programmable…

2 weeks ago

Upwork’s Mystery Suspensions: Why Are High-Earning Clients Affected?

After more than ten years on Elance / oDesk / Upwork, I dare to say…

2 weeks ago