Dr. Jan Milan Deriu
Dr. Jan Milan Deriu
ZHAW
School of Engineering
Centre for Artificial Intelligence
Technikumstrasse 71
8400 Winterthur
Network
ORCID digital identifier
Social media
Projects
- Critical Science Without Borders: LLMs for Translation of Scientific Knowledge in Multilingual Contexts / Project leader / ongoing
- Unified Model for Evaluation of Text Generation Systems / Deputy project leader / ongoing
- Holistic Analysis of Organised Misinformation Activity in Social Networks / Project leader / completed
- End-to-End Low-Resource Speech Translation for Swiss German Dialects / Deputy project leader / completed
- Pre-Study on Generation of Hockey News / Deputy project leader / completed
- Call-E – Virtual Call Agent / Team member / completed
- LIHLITH – Learning to Interact with Humans by Lifelong Interaction with Humans / Team member / completed
- DeepText: Intelligent Text Analysis with Deep Learning / Deputy project leader / completed
Publications
Articles in scientific journal, peer-reviewed
- Sager, P. J., Deriu, J. M., Grewe, B. F., Stadelmann, T., & von der Malsburg, C. (2026). The cooperative network architecture : learning structured networks as representation of sensory patterns. Neural Computation. https://doi.org/10.1162/neco.a.1505
- Zhang, Y., Deriu, J. M., Katsogiannis-Meimarakis, G., Kosten, C., Koutrika, G., & Stockinger, K. (2024). ScienceBenchmark : a complex real-world benchmark for evaluating natural language to SQL systems. Proceedings of the VLDB Endowment, 17(4), 685–698. https://doi.org/10.14778/3636218.3636225
- Deriu, J. M., Rodrigo, A., Otegi, A., Echegoyen, G., Rosset, S., Agirre, E., & Cieliebak, M. (2020). Survey on evaluation methods for dialogue systems. Artificial Intelligence Review, 54(1), 755–810. https://doi.org/10.1007/s10462-020-09866-x
Written conference contributions, peer-reviewed
- Giedemann, P., von Däniken, P., Deriu, J. M., Rodrigo, A., Peñas, A., & Cieliebak, M. (2025). ViClaim : a multilingual multilabel dataset for automatic claim detection in videos [Conference paper]. In C. Christodoulopoulos, T. Chakraborty, C. Rose, & V. Peng (Eds.), Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (pp. 397–413). Association for Computational Linguistics. https://doi.org/10.18653/v1/2025.emnlp-main.21
- Stucki, S., Deriu, J., & Cieliebak, M. (2025). Voice adaptation for Swiss German [Conference paper]. Proceedings Interspeech 2025, 4143–4147. https://doi.org/10.21437/interspeech.2025-432
- von Däniken, P., Deriu, J. M., & Cieliebak, M. (2025). A measure of the system dependence of automated metrics [Conference paper]. In W. Che, J. Nabende, E. Shutova, & M. T. Pilehvar (Eds.), Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) (pp. 87–99). Association for Computational Linguistics. https://doi.org/10.18653/v1/2025.acl-short.8
- Michot, J., Hürlimann, M., Deriu, J. M., Sauer, L., Mlynchyk, K., & Cieliebak, M. (2024). Error-preserving automatic speech recognition of young English learners’ language [Conference paper]. In L.-W. Ku, A. Martins, & V. Srikumar (Eds.), Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp. 6444–6454). Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.acl-long.348
- von Däniken, P., Deriu, J. M., Tuggener, D., & Cieliebak, M. (2024). Favi-Score : a measure for favoritism in automated preference ratings for generative AI evaluation [Conference paper]. In L.-W. Ku, A. Martins, & V. Srikumar (eds.), Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp. 4437–4454). Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.acl-long.243
- von Däniken, P., Deriu, J. M., Rodrigo, A., & Cieliebak, M. (2024). Improving quantification with minimal in-domain annotations : beyond classify and count [Conference paper]. Proceedings of the International AAAI Conference on Web and Social Media, 18(1), 1585–1598. https://doi.org/10.1609/icwsm.v18i1.31411
- Plüss, M., Deriu, J. M., Schraner, Y., Paonessa, C., Hartmann, J., Schmidt, L., Scheller, C., Hürlimann, M., Samardžic, T., Vogel, M., & Cieliebak, M. (2023). STT4SG-350 : a speech corpus for all Swiss German dialect regions [Conference paper]. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 1763–1772. https://doi.org/10.18653/v1/2023.acl-short.150
- Peñas, A., Deriu, J., Sharma, R., Valentin, G., & Reyes-Montesinos, J. (2023). Holistic analysis of organised misinformation activity in social networks [Conference paper]. In D. Ceolin, T. Caselli, & M. Tulin (Eds.), Disinformation in Open Online Media (pp. 132–143). Springer. https://doi.org/10.1007/978-3-031-47896-3_10
- Deriu, J., von Däniken, P., Tuggener, D., & Cieliebak, M. (2023). Correction of errors in preference ratings from automated metrics for text generation [Conference paper]. In A. Rogers, R. Boyd-Graber, & N. Okazaki (Eds.), Findings of the Association for Computational Linguistics: ACL 2023 (pp. 6456–6474). Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.findings-acl.404
- von Däniken, P., Deriu, J. M., & Cieliebak, M. (2023). ZHAW-CAI at CheckThat! 2023 : ensembling using kernel averaging [Conference paper]. In M. Aliannejadi, G. Faggioli, N. Ferro, & M. Vlachos (Eds.), Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023) (pp. 534–545). CEUR Workshop Proceedings. https://doi.org/10.21256/zhaw-29046
- Luley, P.-P., Deriu, J. M., Yan, P., Schatte, G. A., & Stadelmann, T. (2023). From concept to implementation : the data-centric development process for AI in industry [Conference paper]. 2023 10th IEEE Swiss Conference on Data Science (SDS), 73–76. https://doi.org/10.1109/SDS57534.2023.00017
- Bollinger, T., Deriu, J. M., & Vogel, M. (2023, June). Text-to-speech pipeline for Swiss German : a comparison. 8th Swiss Text Analytics Conference – SwissText 2023, Neuchâtel, Switzerland, 12-14 June 2023. https://doi.org/10.48550/arXiv.2305.19750
- von Däniken, P., Deriu, J. M., Agirre, E., Brunner, U., Cieliebak, M., & Stockinger, K. (2022). Improving NL-to-Query systems through re-ranking of semantic hypothesis [Conference paper]. In M. Abbas & A. A. Freihat (Eds.), Proceedings of the 5th International Conference on Natural Language and Speech Processing (ICNLSP 2022) (pp. 57–67). Association for Computational Linguistics. https://doi.org/10.21256/zhaw-26147
- Plüss, M., Hürlimann, M., Cuny, M., Stöckli, A., Kapotis, N., Hartmann, J., Ulasik, M. A., Scheller, C., Schraner, Y., Jain, A., Deriu, J. M., Cieliebak, M., & Vogel, M. (2022). SDS-200 : a Swiss German speech to Standard German text corpus [Conference paper]. Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), 3250–3256. https://doi.org/10.21256/zhaw-26131
- Deriu, J. M., Tuggener, D., von Däniken, P., & Cieliebak, M. (2022). Probing the robustness of trained metrics for conversational dialogue systems [Conference paper]. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2, 750–761. https://doi.org/10.18653/v1/2022.acl-short.85
- Ulasik, M. A., Hürlimann, M., Dubel, B., Kaufmann, Y., Rudolf, S., Deriu, J. M., Mlynchyk, K., Hutter, H.-P., & Cieliebak, M. (2021). ZHAW-CAI : ensemble method for Swiss German speech to Standard German text [Conference paper]. In F. Benites de Azevedo e Souza, D. Tuggener, M. Hürlimann, M. Cieliebak, & M. Vogel (Eds.), Proceedings of the Swiss Text Analytics Conference 2021. CEUR Workshop Proceedings. https://doi.org/10.21256/zhaw-23889
- Tuggener, D., Mieskes, M., Deriu, J. M., & Cieliebak, M. (2021). Are we summarizing the right way? : a survey of dialogue summarization data sets [Conference paper]. Proceedings of the Third Workshop on New Frontiers in Summarization, 107–118. https://doi.org/10.21256/zhaw-23506
- Campos, J. A., Otegi, A., Soroa, A., Deriu, J. M., Cieliebak, M., & Agirre, E. (2020). DoQA : accessing domain-specific FAQs via conversational QA [Conference paper]. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 7302–7314. https://doi.org/10.18653/v1/2020.acl-main.652
- Deriu, J. M., Tuggener, D., von Däniken, P., Campos, J. A., Rodrigo, A., Belkacem, T., Soroa, A., Agirre, E., & Cieliebak, M. (2020). Spot The Bot : a robust and efficient framework for the evaluation of conversational dialogue systems [Conference paper]. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 3971–3984. https://doi.org/10.18653/v1/2020.emnlp-main.326
- Deriu, J. M., Mlynchyk, K., Schläpfer, P., Rodrigo, A., von Grünigen, D., Kaiser, N., Stockinger, K., Agirre, E., & Cieliebak, M. (2020). A methodology for creating question answering corpora using inverse data annotation [Conference paper]. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 897–911. https://doi.org/10.18653/v1/2020.acl-main.84
- Sileo, D., Pradel, C., Peñas, A., Echegoyen, G., Otegi, A., Deriu, J. M., Cieliebak, M., Barrena, A., & Agirre, E. (2019). Matching words and knowledge graph entities with meta-embeddings [Conference paper]. Proceedings of CAp2019, 34–39.
- Cieliebak, M., Galibert, O., & Deriu, J. M. (2019). Towards understanding lifelong learning for dialogue systems. IWSDS 2019 Proceedings.
- Deriu, J. M., & Cieliebak, M. (2019). Towards a metric for automated conversational dialogue system evaluation and improvement. 2th International Conference on Natural Language Generation (INLG 2019), Tokyo, Japan, October 29 - November 1, 2019. https://www.inlg2019.com/assets/papers/132_Paper.pdf
- Deriu, J. M., & Cieliebak, M. (2018). Syntactic manipulation for generating more diverse and interesting texts [Conference paper]. Proceedings of the 11th International Conference on Natural Language Generation, 22–34. https://doi.org/10.18653/v1/W18-6503
- Benites de Azevedo e Souza, F., Grubenmann, R., von Däniken, P., von Grünigen, D., Deriu, J. M., & Cieliebak, M. (2018). Twist Bytes : German dialect identification with data mining optimization [Conference paper]. Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018), 218–227. https://doi.org/10.21256/zhaw-4850
- Grubenmann, R., Tuggener, D., von Däniken, P., Deriu, J. M., & Cieliebak, M. (2018). SB-CH : a Swiss German corpus with sentiment annotations. Proceedings of the Eleventh International Conference on Language Resources and Evaluation, LREC 2018.
- Cieliebak, M., Deriu, J. M., Egger, D., & Uzdilli, F. (2017). A Twitter corpus and benchmark resources for german sentiment analysis [Conference paper]. 5th International Workshop on Natural Language Processing for Social Media, Boston MA, USA, 11 December 2017, 45–51. https://doi.org/10.18653/v1/W17-1106
- Graf, H. D., Koc, Y., Panighetti, S., Togni, M., von Grünigen, D., Weilenmann, M., Xhoxhaj, E., Zürrer, D., Benites de Azevedo e Souza, F., Deriu, J. M., Neureiter, N., von Däniken, P., Cieliebak, M., Eich, W., Neuhaus, S., & Stockinger, K. (2017). Four different ways to build a chatbot about movies. SwissText 2017: 2nd Swiss Text Analytics Conference, Winterthur, 9. Juni 2017.
- von Grünigen, D., Weilenmann, M., Deriu, J. M., & Cieliebak, M. (2017). Potential and limitations of cross-domain sentiment classification [Conference paper]. Proceedings of the Fifth International Workshop on Natural Language Processing for Social Media, 17–24. https://doi.org/10.18653/v1/W17-1103
- Müller, S., Huonder, T., Deriu, J. M., & Cieliebak, M. (2017). TopicThunder at SemEval-2017 Task 4 : sentiment classification using a convolutional neural network with distant supervision [Conference paper]. Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), 766–771. https://doi.org/10.21256/zhaw-1529
- Deriu, J. M., & Cieliebak, M. (2016). Sentiment analysis using convolutional neural networks with multi-task training and distant supervision on italian tweets [Conference paper]. In R. Basili & S. Montemagni (Eds.), Proceedings of Third Italian Conference on Computational Linguistics (CLiC-it 2016) & Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2016). Italian Journal of Computational Linguistics. https://doi.org/10.21256/zhaw-1527
Other publications
- Paonessa, C., Schraner, Y., Deriu, J. M., Hürlimann, M., Vogel, M., & Cieliebak, M. (2023). Dialect transfer for Swiss German speech translation. arXiv. https://doi.org/10.48550/arXiv.2310.09088
- von Däniken, P., Deriu, J. M., Tuggener, D., & Cieliebak, M. (2022). On the effectiveness of automated metrics for text generation systems [Conference paper]. Findings of the Association for Computational Linguistics: EMNLP 2022, 1503–1522. https://doi.org/10.21256/zhaw-27042
- Venzin, V., Deriu, J. M., Didier, O., & Cieliebak, M. (2019). Fact-aware abstractive text summarization using a pointer-generator network. 4th Swiss Text Analytics Conference (SwissText 2019), Winterthur, June 18-19 2019. https://doi.org/10.21256/zhaw-18988
- Deriu, J. M., Rodrigo, A., Otegi, A., Guillermo, E., Rosset, S., Agirre, E., & Cieliebak, M. (2019). Survey on evaluation methods for dialogue. ZHAW Zürcher Hochschule für Angewandte Wissenschaften. https://doi.org/10.21256/zhaw-18985
- Deriu, J. M., & Cieliebak, M. (2017). SwissAlps at SemEval-2017 Task 3 : attention-based convolutional neural network for community question answering [Conference paper]. Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), 11, 334–338. https://doi.org/10.18653/v1/S17-2054
- Deriu, J. M., Lucchi, A., De Luca, V., Severyn, A., Müller, S., Cieliebak, M., Hofmann, T., & Jaggi, M. (2017). Leveraging large amounts of weakly supervised data for multi-language sentiment classification [Conference paper]. Proceedings of the 26th International Conference on World Wide Web, 1045–1052. https://doi.org/10.1145/3038912.3052611
- Deriu, J. M., & Cieliebak, M. (2017). End-to-end trainable system for enhancing diversity in natural language generation. End-to-End Natural Language Generation Challenge (E2E NLG), 2017. https://doi.org/10.21256/zhaw-4889