Intelligent Information Systems
We Derive Value from Data and Information
- How to leverage information?
- How to find new topics and trends?
- How to derive insight from heterogeneous/unstructured data and information?
- How to allow a «natural» access to data?
- How can software link data automatically?
These are but a few of the questions that the Intelligent Information Systems (IIS) group of the InIT is working to answer. While the “data and information flood” is often discussed negatively, we see a great opportunity to leverage data and information using the right approaches – both at search-time, as well as during analysis.
The research group transfers insights derived from research and development into teaching for students of the computer science curricula. It offers modules such as “Information Engineering 1 (Information Retrieval)”, “Information Engineering 2 (Data Warehousing & Big Data)” and "Databases". The group is active in both national and international research projects of the EU framework programs.
Research Topics
The Intelligent Information Systems group develops solutions for a changing, data-driven world. It performs research at the intersection of databases (DB), information retrieval (IR), data engineering (DE), natural language processing (NLP) and machine learning (ML)
The group covers two main research lines:
Big Data and Nano Data
We solve challenging problems when working with a range of datasets from very small (nano data) to very large (big data), where the nature of the problems change drastically as we work on different scales:
Current research:
- Information retrieval for small document collections
- Machine learning for query optimization
- Artificial intelligence for data integration and cleaning
- Quantum databases and quantum machine learning
Data Understanding
As we strive for "intelligent" solutions to data-driven problems, classical information systems need to process data at a different level, interpreting it to gain important information. Both structured and unstructured data must be processed not on a mechanical, but on a semantic level - e.g. by using natural language processing and understanding. Data is ultimately connected through graph structures or made accessible via semantic search.
Current research:
- Natural language interfaces for databases
- Semantic search on entities
- Knowledge graph construction
- Question answering over knowledge graphs
- Stream analytics and event detection
- Information retrieval evaluation
-
GraphQueryML – Using Machine Learning to Optimize Queries in Graph Databases (SNSF/DFG)
Optimizing the brain of databases with machine learning:Query optimization is one of the hardest problems of database systems research. A query optimizer can be considered as the “brain” of the system that makes sure that queries are executed efficiently. Even after several decades of research, many sub-problems of ...
-
DOSSMA – Detection of Suspicious Social Media Activities
The DOSSMA project will investigate suspicious and malicious behaviour on social media platforms. In a first phase, we will compile an extensive survey report on the areas that are currently being researched, including the respective state-of-the-art, existing solutions and initiatives. This report will serve as a ...
-
Accessible Scientific PDFs for All
PDF is the most popular document format to provide and distribute information on the internet. It was developed by Adobe 1996 but has been an open format since 2008. It was estimated in 2015 that more than 2.5 trillion PDF documents exist on the internet, covering all aspects of life and research, and their number ...
-
Schmitt-Koopmann, Felix; Huang, Elaine M.; Hutter, Hans-Peter; Stadelmann, Thilo; Darvishy, Alireza,
2024.
MathNet : a data-centric approach for printed mathematical expression recognition.
IEEE Access.
12, pp. 76963-76974.
Available from: https://doi.org/10.1109/ACCESS.2024.3404834
-
Gerber, Jonathan; Saxer, Jasmin S.; B. Kreiner, Bruno; Weiler, Andreas,
2024.
DIGILOG : towards a monitoring platform for digital transformation of European communities.
In:
Joint Proceedings of RCIS 2024 Workshops and Research Projects Track.
18th International Conference on Research Challenges in Information Science (RCIS), Guimarães, Portugal, 14-17 May 2024.
RWTH Aachen University.
Available from: https://doi.org/10.21256/zhaw-30792
-
Chen, Yaxuan; Vergara, Ana Fernandez; Hamilton, Angus; Stockinger, Kurt,
2024.
Digital public infrastructure for environmental sustainability.
United Nations Environment Programme.
ISBN 978-92-807-4157-5.
Available from: https://doi.org/10.21256/zhaw-30874
-
Appel, Jan; Weiler, Andreas,
2024.
XCrowd : a realistic crowd simulation tool for efficient movement management.
In:
Proceedings of the Workshops of the EDBT/ICDT 2024 Joint Conference co-located with the EDBT/ICDT 2024 Joint Conference.
6th International Workshop on Big Mobility Data Analytics (BMDA) during EDBT/ICDT Joint Conference, Paestum, Italy, 25 March - 28 March 2024.
Aachen:
RWTH Aachen University.
Available from: https://doi.org/10.21256/zhaw-30720
-
Smith, Ellery; Paloots, Rahel; Giagkos, Dimitris; Baudis, Michael; Stockinger, Kurt,
2024.
Data-driven information extraction and enrichment of molecular profiling data for cancer cell lines.
Bioinformatics Advances.
4(1), pp. vbae045.
Available from: https://doi.org/10.1093/bioadv/vbae045