Teaching computers to understand human language

Despite big advances in machine learning, making computers understand language is still a big challenge. Daniel Varab and his two classmates from Software Development trained a computer program to detect contradictions in texts – a technology that might eventually help us keep track of statements made by politicians and contradictions in the law.

Computer Science Department Education artificial intelligence algorithms ITU thesis

Written 13 December, 2017 08:23 by Vibeke Arildsen

What was your thesis about?

Inspired by the presidential election in the United States, we thought it would be fun if a computer program could automatically find contradictions in the things politicians say during the election race. For example, Donald Trump stated in 1999:

"Look, I’m very pro-choice. I hate the concept of abortion. I hate it. I hate everything it stands for. I cringe when I listen to people debating the subject, but you still — I just believe in choice."

In August 2015, however, he said: "I am very, very proud to say that I'm pro-life."

It would be great if a computer could help us find such contradictions.

So we immersed ourselves in Natural Language Processing (NLP), a field concerned with getting computers to understand human language. NLP is used for instance in the iPhone’s Siri, Google Translate and in Word’s spell check. It is also used to analyze whether texts are positively or negatively charged.

More specifically, we worked with contradiction detection – that is, a method of getting computers to assess whether two sentences contradict each other.

How do you teach a computer to find contradictions?

By feeding it a ton of examples of sentence pairs that contradict each other and sentence pairs that do not. We trained a machine learning algorithm with a data set from Stanford University with 500,000 sentence pairs and then tested it on sentences it had never seen before.

We found that the model worked best when we provided it with information about how linguists define a contradiction. For example, two sentences probably contradict each other if they contain antonyms. There is much hype about machine learning algorithms finding patterns in information all by themselves, but in practice, you get much further if you help them.

In the end, our model could detect with an accuracy of 86 percent whether two sentences contradicted each other. Funny enough, only 87 percent of a control group of humans could agree on the same sentences.

Human language is largely about interpretation, and this is one of the reasons why teaching it to computers is so difficult.
Daniel Varab, MSc in Software Development

Human language is largely about interpretation, and this is one of the reasons why teaching it to computers is so difficult.

What can we use it for?

There is a huge amount of information out there, and it’s impossible for people to have an overview of everything that is said and written by, for example, media and politicians.

It would be useful to have a tool that could automatically find contradictions, for instance between what a politician said two months ago and what he is saying today. Such a tool could also be used to detect contradictions in legal texts or to spot fake news.

There are many exciting perspectives, but still some way to go before computer’s understanding of languages is sophisticated enough.

Further information

Vibeke Arildsen, Press Officer, phone 2555 0447, email viar@itu.dk

News

ITU receives two Danish Data Science Academy Fellowships

26 June, 2025

Each year, the DDSA awards a total of 10 PhDs, and 6 postdocs. This year, ITU has secured two – Nils Grünefeld who will undertake a PhD in Machine Learning and Natural Language Processing, and Ola Rønning will begin a postdoc project in Probabilistic Programming.

ITU researcher wants to improve statistics models

26 June, 2025

Professor Andrzej Wasowski has been granted DKK 6.1 million from the Independent Research Fund Denmark. The grant is given for a project that is looking into how probabilistic models can become more reliable.

ITU researcher receives grant for project on verification of reflective programs

24 June, 2025

Assistant Professor at the IT University of Copenhagen, Eduard Kamburjan, has received a Sapere Aude grant of almost DKK 6.2 million from Independent Research Fund Denmark. The grant will fund a project that will investigate how to verify reflective programs.

Morten Hjelholt appointed head of research

20 June, 2025

Professor Morten Hjelholt has served as interim head of research since January and is highlighted for his “commitment, conviction, and a management philosophy”. Starting 1 August, he will take on the position permanently.

ITU researchers want to bring classical music to you

17 June, 2025

Is it possible to use technology to bring arts and music closer to people? This is one of the purposes of the research project XTREME, which is investigating how mixed reality can be used to bring music and art experiences to audiences that otherwise have some barriers to experience them.

Jonas Juul has been accepted into the Young Academy

10 June, 2025

The Young Academy has revealed which talented young researchers have been admitted this year. Among them is Assistant Professor Jonas Juul from the It University of Copenhagen.

Professor portrait: Thomas Binder's research connects to a changing world

2 June, 2025

On 19 June 2025 at 14:30, Professor Thomas Binder will give his inaugural lecture in Auditorium 0 at the IT University of Copenhagen. The lecture is entitled: “What design can do and how it matters”.

Professor portrait: Veronika Cheplygina improves the field of machine learning through meta-research

26 May, 2025

On 10 June 2025 at 14:30, Professor Veronika Cheplygina will present her inaugural lecture in Auditorium 0 at the IT University of Copenhagen. The lecture is entitled: “Not real research”.

"The aim is our trust"

6 May, 2025

As part of the Danish Science Festival, the IT University and the newspaper Dagbladet Information gathered a number of experts to discuss cyber warfare in Denmark and how prepared we are for it. The Minister of Resilience and Preparedness, Thorsten Schack Pedersen, also participated in the talk.

Professor portrait: Nutan Limaye is pushing the boundaries of complexity theory

1 May, 2025

On 22 May 2025 at 14:30, Professor Nutan Limaye from the section Theoretical Computer Science will present her inaugural lecture in Auditorium 0 at the IT University of Copenhagen. The lecture is entitled “My reflections on the last two decades and Complexity Theory”.

Professor portrait Anna Vallgårda challenges the design of care technology

24 April, 2025

On 9 May 2025 at 14:30, Professor Anna Vallgårda will give her inaugural lecture in Auditorium 0 at the IT University of Copenhagen. The lecture is entitled: ”Radical Redesign of Care Technologies”.

Is Denmark prepared for cyberwarfare?

8 April, 2025

A group of researchers from the IT University of Copenhagen is investigating what Denmark can learn from Ukraine in terms of preparing for cyberwarfare. Cyberwarfare does not just affect governments and companies, but also civilians, and the researchers ask what should be done if we come under attack.

Researchers aim to teach math students critical thinking with data science

31 March, 2025

In a new research project at the IT University of Copenhagen and the University of Copenhagen, a group of researchers will investigate how data science can become part of high school mathematics education to provide students with a better foundation for critical thinking and the ability to illuminate and nuance claims they encounter in their daily lives.

ITU researcher secures grant to improve safety of AI systems

19 March, 2025

At Advanced Institute of Science and Technology in Japan, Associate Professor Alessandro Bruni from ITU is currently conducting research on the mathematical foundation for developing verifiably correct machine learning frameworks. The project is supported by the Carlsberg Foundation.

Professor portrait: Vasilis Galis found his way in research on the Athens metro

13 March, 2025

On 28 March 2025 at 14:30, Professor Vasilis Galis from the section Technologies in Practice will present his inaugural lecture in Auditorium 0 at the IT University of Copenhagen. The lecture is entitled “Research against dead time”.

ITU researcher investigates elections in Greenland

11 March, 2025

On 11 March 2025, the election for Inatsisartut (Greenland's parliament) will take place. For several years, researchers from ITU, led by Professor Carsten Schürmann and Center for Information Security and Trust, have been investigating election and the possibility of internet elections in Greenland, and the election today is no exception.

IRFD funded ITU project to develop theoretical foundation for probabilistic session types

6 March, 2025

The increasing technological complexity makes probabilistic understanding and management of critical computing systems a necessity. A new research project, led by Associate Professor Marco Carbone, aims to develop the foundation for probabilistic session types to that end.

Urban highways are barriers to social connections

5 March, 2025

Researchers from IT University of Copenhagen have proved that urban highways limit social connections in the 50 largest cities in the US. It is the first ever quantitative evaluation of the barrier effect of urban highways in reducing social connections across neighborhoods.

New research to find efficient strategies for prevention of epidemics

26 February, 2025

Assistant Professor at ITU, Jonas Juul, receives a Novo Nordisk Foundation Data Science Investigator grant of DKK 6.5 million for a project that aims to improve statistical methods for predicting outbreaks of infections.

Within Limits – an exhibition on computation and constraint

24 February, 2025

On 7 March, join Artist Jacob Remin, Associate Professor James Maguire and Postdoc Frauke Mennes from the Center for Climate IT at ITU for the launch of Within Limits – an art installation that questions and reimagines the scalar logics inherent in computational worlds.