ERCIM News 136

ERCIM News 136

ERCIM News 136
January 2024
Special theme: Large Language Models
Guest editors: Diego Collarana Vargas (Fraunhofer FIT) and Nassos Katsamanis (Athena RC)

This issue in pdf (56 pages)

ERCIM news 135

ERCIM news 135

ERCIM news 134

ERCIM news 134

ERCIM news 133

ERCIM news 133

ERCIM news 132

ERCIM news 132

ERCIM news 131

ERCIM news 131

ERCIM news 130

ERCIM news 130

Back Issues Online

Back Issues Online

Contents

Improving the Sample Efficiency of Pre-training Language Models

by Gábor Berend (University of Szeged)

The use of transformer-based pretrained language models (PLMs) arguably dominates the natural language processing (NLP) landscape. The pre-training of such models, however, is known to be notoriously data and resource hungry, which hinders their creation in low-resource settings, making it a privilege to those few (mostly corporate) actors, who have access to sufficient computational resource and/or pre-training data. The main goal of our research is to develop such a novel sample-efficient pre-training paradigm of PLMs, which makes their use available in the low data and/or computational budget regime, helping the democratisation of this disruptive technology beyond the current status quo.

ChatGPT Multilingual Querying Consistency – A Test Case

by George Tambouratzis (Athena Research Centre)

Conversational agents and chatbots have developed rapidly in the past year to provide answers to user queries, drawing information from huge collections of data. From the user-side, the usefulness of conversational agents hinges on the accuracy of response in addition to user-friendliness and response speed. Here we briefly evaluate one of the most widely used chatbots, ChatGPT over a set of queries posed using multiple languages, to test its robustness and consistency, while running the experiment at two timepoints to monitor ChatGPT’'s evolution.

A Pipeline for Validating ChatGPT Responses Using Knowledge Graphs and Embeddings

by Michalis Mountantonakis and Yannis Tzitzikas (FORTH-ICS and University of Crete)

Since it is challenging to combine ChatGPT (which has been trained by using data from web sources), with popular RDF Knowledge Graphs (that include high quality information), we present a generic pipeline that exploits RDF Knowledge Graphs and short sentence embeddings for enabling the validation of ChatGPT responses, and an evaluation by using a benchmark containing 2,000 facts for popular Greek persons, places and events.

A Unified Model for Automated Evaluation of Text Generation Systems

by Jan Deriu and Mark Cieliebak (Zurich University of Applied Sciences)

With all the recent hype around Large Language Models and ChatGPT in particular, one crucial question is still unanswered: how do we evaluate generated text, and how can this be automated? In this SNF project, we develop a theoretical framework to answer these questions.

TERMINET: Pioneering the Future of Smart Interconnected IoT

by Peter Kunz (ERCIM)

January 2024 marks the conclusion of the ambitious European H2020 project, TERMINET, which envisions a revolutionary next-generation IoT architecture. Combining state-of-the-art technologies like SDN, multiple-access edge computing, and virtualization, TERMINET aims to transform the IoT landscape with intelligent devices tailored for low-latency, market-driven use cases.

Tailoring Decarbonisation and Resilience Strategies to Drive Regional and Local Action

by Christiane Walter (PIK), Luis Costa (PIK) and Sara Dorato (T6)

LOCALISED is a four- year H2020-funded research project (October 2021 – September 2025) that develops, in a co-design process, tailored end-user products and services for local and regional policy-makers, administrations, businesses and citizens, to speed up sub-national decarbonisation processes while considering climate risks and adaptation needs. The flagship outputs are the Decarbonisation Profiler and the Net-Zero Business Consultant, providing information for all NUTS3-regions in Europe, currently under development.

Taranis AI: Applying Natural Language Processing for Advanced Open-Source Intelligence Analysis

by Florian Skopik and Benjamin Akhras (Austrian Institute of Technology)

Open-source intelligence (OSINT) provides up-to-date information about new cyber-attack techniques, attacker groups, changes in IT products, updates of policies, recent security events and much more. Often dozens of analysts search a multitude of sources and collect, categorise, cluster, and rank news items from the clear and dark web in order to prepare the most relevant information for decision makers. A tool that supports this job is “Taranis NG” from the Slovakian CERT. This solution ingests information from many types of sources such as websites, RSS feeds, emails and social media channels and makes them searchable. It also supports the creation of reports and daily summaries. However, the number of sources and news items is continuously growing, making it increasingly difficult to search them purely manually. These circumstances call for the application of novel natural language processing (NLP) methods to make OSINT analysis more efficient.

From Cultural Heritage Assets to Cultural Linked Data – the Case of the Archaeological Museum of Messara

by Dimitris Angelakis, Lida Charami, Pavlos Fafalios and Christos Georgis (FORTH-ICS)

The new Archaeological Museum of Messara, which opened its doors to visitors on 22 April 2023, is entirely dedicated to the antiquities of the Messara region in Heraklion, Greece. The Centre for Cultural Informatics of FORTH-ICS has undertaken the task to provide a comprehensive data management solution for the scientific and administrative documentation, research and promotion of the new museum’s important assets.

Controllable Artificial Intelligence

by Peter Kieseberg, Simon Tjoa (St. Pölten UAS) and Andreas Holzinger (University of Natural Resources and Life Sciences)

The burgeoning landscape of AI legislation and the ubiquitous integration of machine learning into daily computing underscore the imperative for trustworthy AI. Yet, prevailing definitions of this concept often dwell in the realm of the abstract, imposing robust demands for explainability. In light of this, we propose a novel paradigm that mirrors the strategies employed in navigating the opaqueness of human decision-making. This approach offers a pragmatic and relatable pathway to cultivating trust in AI systems, potentially revolutionising our interaction with these transformative technologies.

Report on the 4th Joint JST/ERCIM Workshop

by Katsumi Emura (Fukushima Institute for Research, Education and Innovation) and Dimitris Plexousakis (FORTH-ICS)

This article provides a brief report on the 4th Workshop jointly organized by ERCIM and the Japan Science and Technology Agency (JST). The workshop, themed “Exploring New Research Challenges and Collaborations in Artificial Intelligence, Big Data, Human-Computer Interaction, and the Internet of Things,” took place in October 2023 in Kyoto, Japan. Hosted by JST as part of the Advanced Integrated Intelligence Platform project, the event offered European and Japanese participants an opportunity to report on recent research results in the aforementioned areas and to explore collaboration prospects within the framework of European Commission programs or corresponding initiatives of JST.

Diversity Matters - INESC TEC's Diversity and Inclusion Commission

INESC TEC's Diversity and Inclusion Commission (D&IC), led by Ana Sequeira. In this interview, Ana Sequeira highlights D&IC’s pioneering initiatives in promoting gender equality, supporting disabilities, and fostering intercultural understanding within the research institute.

Dagstuhl Seminars and Perspectives Workshops - Call for Proposals

Schloss Dagstuhl – Leibniz-Zentrum für Informatik is accepting proposals for scientific seminars/workshops in all areas of computer science, in particular also in connection with other fields.

CWI Received Two Test of Time Awards

Marten van Dijk, head of CWI’s Computer Security group, won the ACM CCS 2023 Test-of-Time Award, and members of CWI’s Database Architectures group won the 2024 CIDR Test of Time Award.

43rd SAFECOMP 2024, 19th DECSoS Workshop - Call for Papers

Florence, Italy, 17-20 September 2024

The renowned international SAFECOMP Conference will be held this year in Florence, Italy, from September 17-20, with the first day reserved for several parallel workshops (Workshop Day). The key theme is Safety in a cyber-physical interconnected world.

Interdisciplinary Information Management Talks - IDIMT 2024 - Call for Papers

Hradec Kralove, Czech Republic, 4-6 September 2024

With over 30 years of history, IDIMT conferences have established themselves as an interdisciplinary international forum for exchanging concepts and visions in the areas of software-intensive systems, management and engineering of information and knowledge, social media, business engineering, and related topics. The conference has been organized since its inception by the University of Economics and Business in Prague and the Johannes Kepler University in Linz, Austria. The proceedings are published by Trauner Verlag, Linz, in the University Edition, Informatics series. Papers are peer-reviewed and indexed by Scopus and Web of Science. Past proceedings can be found at https://idimt.org/proceedings/.