10.07.2025

What LLMs in programming can and cannot do

Do large language models indicate the end of traditional programming? A pro and con presented by Professor Gordon Fraser from the University of Passau and his guest from the industry, Marko Ivankovic.

Do large language models indicate the end of traditional programming? A pro and con presented by Professor Gordon Fraser from the University of Passau and his guest from the industry, Marko Ivankovic.

Programming languages exhibit a high degree of regularity. One might think that large language models would therefore be particularly well suited to automatically generating source code. But does this really spell the end for human programmers? No, says Professor Gordon Fraser. He holds the Chair of Software Engineering II at the University of Passau and conducts research into software quality. Testing code is an important part of the software development process.

As part of the lecture series ‘Artificial Intelligence – Between Hype and Reality,’ Professor Fraser substantiated his position with, among other things, a study from last year that examined how EvoSuite performed in comparison with the GPT-4o model. EvoSuite is a tool developed by Professor Fraser 15 years ago to check the quality of Java software. The tool automatically generates unit tests that are designed to check the code as close to 100 percent as possible. It works with a so-called evolutionary algorithm, an optimisation process inspired by biological or physical models. The study shows that even a current GPT model cannot match the precision of the 15-year-old tool when it comes to testing software code – for Professor Fraser, this is clear evidence that LLMs can generate syntactically correct programming text, but still lack semantic understanding.

Another point is that software engineering is not just about programming and testing. ‘It's also about designing intelligent software systems.’ Customer requirements must be recorded and analysed. This is the basis for a sustainable system, and in this respect, humans are superior to machines, at least in the long term. His thesis is that software generated by large language models may be superior at the beginning. But over time, designs developed by humans would prove to be significantly more robust and durable. He summed up his thesis with the following deliberately exaggerated graphic:

A view from the field – where LLMs are already replacing humans

Contradiction came from the industry: Marko Ivankovic from the London-based software company Cogna Ltd. was a guest at the lecture series. It has already fully automated the software development process for specific and small programs – across the entire cascade of the so-called waterfall model. This process model is used in software development and describes various project phases, from requirements to design and implementation to the maintenance phase.

Ivankovic explained that the company relies on LLMs for all phases, even when gathering requirements. ‘We invite customers in and let them talk about their requirements for 45 minutes. The artificial intelligence listens and structures what is said afterwards.’ In his view, there is no reason why AI cannot also design the software system. The large language models are also ‘extremely helpful’ during testing. However, he admitted that even in his company, it would not be possible to do without humans entirely: ‘Humans are still in the loop, checking and refining the software.’ But the language model can also be consulted when errors are discovered. His conclusion: ‘Humans supported by language models are the best of both worlds.’

In his opinion, language models would take over many steps in the software development process that have previously been done by humans. Classically trained programmers could therefore face real problems in the future job market. Nevertheless, language models would lead to an increase in productivity, as they would enable more smaller software projects to be implemented than before. The demand for good software developers who can work with LLMs and know their strengths and weaknesses remains unabated.

This text was machine-translated from German.

Professor Gordon Fraser

researches software engineering

How can we find and prevent software errors?

Professor Gordon Fraser has held the Chair of Software Engineering II at the University of Passau since 2017. After completing his doctorate at Graz University of Technology, he conducted research at Saarland University and the University of Sheffield. His research and teaching focusses on issues relating to software analysis, software development and the didactics of programming.

Focus page

Interdisciplinary research on large language models at the University of Passau

Large language models have disruptive effects. Researchers at the University of Passau are investigating the technical, social, ethical and legal consequences…

The AI knowledge trap: How artificial intelligence can cause businesses to lose their knowledge

A new study shows that over time, the loss of human expertise caused by AI use can impair the quality of that very AI – in the worst case, insidiously and unnoticed.

Who bears responsibility for AI-generated child pornography?

A study by the University of Passau shows that tech companies can also be prosecuted under German law if they tolerate abuse.

Quiz show ‘5 against AI’ receives huge response on YouTube

A hundred thousand views, thousands of likes and hundreds of comments: the German quiz show in which a team of professors competes against AI is causing lively discussion on YouTube. Answers to some of the questions.

Information for...

Information for...

Current students

Prospective students

Academics

Early career researchers

Businesses

Alumni and friends

Staff

Media representatives

Faculties & facilities

Administration

Central facilities

Faculties

Faculty of Law

Faculty of Social and Educational Sciences

Faculty of Humanities and Cultural Studies

School of Business, Economics and Information Systems

Faculty of Computer Science and Mathematics