Eingabe löschen

Kopfbereich

Hauptnavigation

Reliable Multi-lingual and Cross-lingual Open Data Exploration in Natural Language (MuLi)

Beschreibung

As data consolidates into relational databases,the need for text-to-SQL systems grows, democratizing access to information by allowing natural language queries. The potential for improvements brought byLarge Language Models (LLMs) in text-to-SQL systems is mostly assessed on monolingual English datasets. However, their performance and robustness with respect to other languages remain vastly unexplored. In this project, we wouldlike to concentrate on a multi-lingual and cross-lingual benchmark for evaluating text-to-SQL systems based on real-world applications. This dataset will comprise natural language/SQL-pairs over big databases with varying levels of complexity for English, German, French, and Italian. We will assess robustness of current state-of-the-art text-to-SQL models, particularly in handling diverse languages such as German, French, and Italian. We aim to address the challenges of achieving fairness in multiple languages, initiating discussions to develop more reliable text-to-SQL systems tailored for multi-lingual environments.

Eckdaten

Projektleitung

Co-Projektleitung

Projektstatus

laufend, gestartet 08/2024

Institut/Zentrum

Institut für Informatik (InIT)

Drittmittelgeber

Hasler Stiftung

Projektvolumen

48'000 CHF