Government and Policy Military Technology

IARPA seeks AI teams to both attribute & anonymize authorship by identifying, removing writers’ ‘linguistic fingerprints’ from text

January 4, 2022

The Intelligence Advanced Research Projects Activity (IARPA) is holding a proposers’ day for an upcoming research program aimed at both identifying an author by their writing style, and also making an author anonymous by removing their “linguistic fingerprints.”

IARPA will hold a virtual proposers’ day on January 19 for its Human Interpretable Attribution of Text Using Underlying Structure (HIATUS) research program.

According to the HIATUS program description:

HIATUS seeks to develop novel human-useable AI systems for attributing authorship and protecting author privacy through identification and leveraging of explainable linguistic fingerprints.

The program will develop novel techniques to generate representations that capture author-level linguistic variation and will use these representations to build human-interpretable algorithms to perform authorship attribution and ensure author privacy (i.e., via removal of author-identifying characteristics from text).

In other words, IARPA’s HIATUS program has two mirroring parts — attribution and anonymization.

Part one is to identify an author with the help of artificial intelligence that analyzes patterns in their personal writing style — their “linguistic fingerprints” — which can include their unique forms of spelling, syntax, vocabulary, phrases, punctuation, rhythm, formatting, etc. for attribution purposes.

They call this “attributing authorship.”

The other part is to reverse the process by removing an author’s linguistic fingerprint — their “identifying characteristics from text” — thus making them anonymous.

They call this “protecting author privacy.”

The HIATUS program contact point is Dr. Timothy McKinnon, who was also a program manager for IARPA’s Better Extraction from Text Towards Enhanced Retrieval (BETTER) program, which in 2019 awarded a contract to Raytheon BBN — a company that harvests the text of social media postings and other data.

“HIATUS seeks to develop novel human-useable AI systems for attributing authorship and protecting author privacy through identification and leveraging of explainable linguistic fingerprints” — IARPA HIATUS program

To get an idea of the type of experts IARPA’s HIATUS program is looking for, the teaming form asks for expertise in:

Explainable NLP
Software integration
Forensic linguistics
Text generation
Authorship attribution
Human computation
Authorship privacy
Other

According to the HIATUS program description, “Successful technical approaches will be scalable across diverse topic domains, genres and languages.”

While the research funding arm of the US spying apparatus doesn’t give any specifics on practical applications or real-world use cases for the technology, the fruits of the HIATUS program could be used across multiple scenarios.

In a research and analysis setting, the attribution aspect could be applied to something as simple as making sure the correct authors are identified and attributed from a cache of nameless documents for a variety of intel purposes and/or in-house record keeping.

In the real world, the anonymization aspect could be another layer of protection for journalists, whistleblowers, refugees, or spies in the field by knowing which linguistic features could be used by adversaries to identify them, while at the same time, the attribution component could be used by the government to identify an adversary’s communication by their distinct, linguistic fingerprints, (i.e. forensic linguistics, code breaking).

HIATUS program “will develop novel techniques to generate representations that capture author-level linguistic variation and will use these representations to build human-interpretable algorithms to perform authorship attribution and ensure author privacy (i.e., via removal of author-identifying characteristics from text)” — IARPA HIATUS program

Through authentic authorship attribution, the flow of information can become increasingly more transparent, or at the very least, more organized.

On the flip side, the flow of information can become even more distorted by anonymizing the source, concealing its origins, and adding more noise to the channel — a tactic used by spy agencies in which, “You create so much noise in the channel that people start to have overall doubts on all information that’s available in the media, social media, and other places,” as one former NSA foreign surveillance agent told The Sociable.

Whatever real-world applications come out of IARPA’s HIATUS program, they will involve one or more of the following characteristics: analyzing text for linguistic fingerprints, attributing authorship, removing author-identifying characteristics, enlisting artificial intelligence.

This opens the door for the US intelligence community to know more accurately who wrote what, but also, how to better conceal or fake the source.

Another tool for the kit.

Spies will be spies.

‘IoT devices are a growing source of data that can be collected to learn intent’: IARPA director

Intel agency awards contract to company that harvests social media text, data

Every move you make IARPA will be watching you

Govt Geopolitical Forecasting Challenge offers $250K to predict the future

Tim Hinchliffe

The Sociable editor Tim Hinchliffe covers tech and society, with perspectives on public and private policies proposed by governments, unelected globalists, think tanks, big tech companies, defense departments, and intelligence agencies. Previously, Tim was a reporter for the Ghanaian Chronicle in West Africa and an editor at Colombia Reports in South America. These days, he is only responsible for articles he writes and publishes in his own name. [email protected]
VIEW ALL POSTS

< Next Post

How Personalized Compensation Will Affect Companies in 2022

Previous Post >

Vivaldi’s Browser is Behind the Wheel

Government and Policy

Klaus Schwab makes first appearance without ‘executive chair’ title at WEF meeting in Dubai

World Economic Forum (WEF) founder Klaus Schwab makes his first appearance in 53 years without the...

October 16, 2024 Tim Hinchliffe

Business Government and Policy

Klaus Schwab ends 53-year reign as ‘executive chair’ of the WEF, now billed as ‘chairman of the board’ for next meeting in Dubai

World Economic Forum (WEF) founder Klaus Schwab is slated to give the opening remarks at the WEF...

October 9, 2024 Tim Hinchliffe

Government and Policy

China’s digital yuan ‘serves as a model CBDC’ that reduces cash reliance while ‘democratizing’ banking services: WEF report

The WEF continues to be a propaganda arm for the Chinese Communist Party and its social credit...

October 9, 2024 Tim Hinchliffe

Sociable's Podcast

Brains Byte Back

Brains Byte Back interviews startups, entrepreneurs, and industry leaders that tap into how our brains work. We explore how knowledge & technology intersect to build a better, more sustainable future for humanity. If you’re interested in ideas that push the needle, and future-proofing yourself for the new information age, join us every Friday. Brains Byte Back guests include founders, CEOs, and other influential individuals making a big difference in society, with past guest speakers such as New York Times journalists, MIT Professors, and C-suite executives of Fortune 500 companies.

In this episode of Brains Byte Back, we’re proud to kick off Latin Heritage Month and recognize the achievements of those in the community. Today, we’re joined by Gerardo Sandoval, CEO and founder of Facil Cloud. Gerardo’s inspiring journey takes us from his early days in Venezuela, where he launched his first tech startup at just 16— a computer lab where recognized the value of empowering his community with computer skills. A small venture that gave him the skills leading up to his present-day success in creating a Miami-based private cloud solutions company.

He shares a personal story about a family vacation he never returned home from, which obliged him to leave a successful web hosting company behind. This story reflects the struggles of many Venezuelans forced to leave their homes, not by choice. However, that didn’t deter him from being a driving force in both the Latino and tech communities.

To date, Gerardo has founded several startups with clients in more than 20 countries and is recognized as one of the main Hispanic influencers in the world of Growth Hacking. He shares how he uses this combination of data science, digital marketing and other elements to create bridges between what companies want and what customers desire.

Find out more about Gerardo Sandoval here (Linkedin) –

https://www.linkedin.com/in/gerardosandovalcabrera/

Find out more about Facil Cloud (website) –

https://www.facilcloud.com/

Brains Byte Back:

Reach out to today’s host, Erick Espinosa (Linkedin) –

linkedin.com/in/erick-espinosa

Get the latest on tech news – https://sociable.co/

Leave an iTunes review – https://rb.gy/ampk26

Find out more about our sponsor Publicize –https://publicize.co/startup-resources/