Data Linking Infrastructure – Foundations and Architecture
Funded by DFG
Runtime: 01.01.2019 - 31.12.2025
Principal Investigator:
- Ralf Möller (Universität Hamburg)
Research Associates:
- Thomas Asselborn, M.Sc. (Universität Hamburg)
- Dr. Marcel Gehrke, M.Sc. (Universität Hamburg)
- Dr. Sylvia Melzer, Dipl.-Ing. (University of Lübeck)
- Simon Schiff, M.Sc. (University of Lübeck)
Project Description
A data linking infrastructure is envisioned to support humanities scholars from all research fields of the Cluster of Excellence "Understanding Written Artefacts” such that various kinds of data can be easily and systematically combined to foster scientific progress. On the one hand, there are images and videos of written artefacts, in some cases associated with text data making parts of image (or video) content explicit, e.g., using optical character recognition techniques. On the other hand, different kinds of chemistry and materials science data are collected to further describe written artefacts under investigation, almost always in combination with descriptive temporal and spatial data. Data of this kind must be made available to humanities scientists such that they are best supported in their scientific work. Publications from humanities projects will refer to artefact data of the kind described above, and, after a while, artefact data are referenced in quite some number of natural language publications resulting from scientific work in humanities projects, e.g., journal articles, conference papers, and PhD theses. Publications are provided as documents, which are represented, e.g., as PDF data. Further natural language data comes from existing humanities research databases. All data can be described in an appropriate way using suitable metadata formalisms (date of creation, author, etc.). In addition, and different from metadata, all kinds of base data (also called raw data) might be extended with derived data, with which certain features are made explicit (e.g., for supporting visualization, for information retrieval, or for other research efforts).
Link to Project Details
https://www.csmc.uni-hamburg.de/research/cluster-projects/field-f/rff01.html
Activities
Editorial
- S. Melzer, J. Gippert, S. Thiemann, H. Peukert: Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2021), CEUR Workshop Proceedings, 2022 (proceedings)
- S. Melzer, S. Thiemann, H. Peukert: Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2022), CEUR Workshop Proceedings, 2022 (proceedings)
- S. Melzer, H. Peukert, S. Thiemann: Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2023), CEUR Workshop Proceedings, 2023 (proceedings)
Organisation
- R. Möller, S. Melzer: Data Linking Study Day 2021, Universität Hamburg, online, 15.06.2021, Organisator
- S. Melzer: 44th German Conference on Artificial Intelligence, September 27-October 1, 2021, Berlin, Germany (KI2021), Junior Research Chair (abstracts)
- S. Melzer, S. Thiemann, J. Gippert: Humanities-Centred Artificial Intelligence (CHAI), 44th German Conference on Artificial Intelligence, September 27-October 1, 2021, Berlin, Germany (KI2021), Workshop Organisator and Chair (proceedings, abstracts)
- S. Melzer, S. Thiemann, H. Peukert: 2nd Workshop on Humanities-Centred Artificial Intelligence (CHAI), 45th German Conference on Artificial Intelligence, September 19-September 23, 2022, Trier, Germany (KI2022), Workshop Organisator
- R. Möller, S. Melzer: Doctoral Symposium, 25th International Symposium on Formal Methods (FM 2023), 06.03.2023, Lübeck, PC member and mentor
- S. Melzer, H. Hu-von Hinüber: Data Linking Workshop 2023: Computer Vision and Natural Language Processing – Challenges in the Humanities, 27.-28. June 2023, Hamburg, Germany, Workshop Organisator and Chair
- S. Melzer, S. Thiemann, H. Peukert: 3rd Workshop on Humanities-Centred Artificial Intelligence (CHAI), 46th German Conference on Artificial Intelligence, September 26, 2023, Berlin, Germany (KI2023), Workshop Organisator
Publications
2023
Poster: Digital Data Handling at UWA, in Digital Total - Computing & Data Science an der Universität Hamburg und in der Wissenschaftsmetropole Hamburg , 2023.
File: | |
Bibtex: | ![]() @INPROCEEDINGS {MetThMo:2023, author={Sylvia Melzer and Stefan Thiemann and Ralf Möller}, doi={}, booktitle={Digital Total - Computing & Data Science an der Universität Hamburg und in der Wissenschaftsmetropole Hamburg}, title={Poster: Digital Data Handling at UWA}, year={2023}, month={October}, volume={}, pages={}, url = {https://www.conferences.uni-hamburg.de/event/387/contributions/1502/attachments/559/1055/Poster_DDH@UWA_A0_FINAL.pdf} } |
Query Transformation for Processing Streams in Decision-making Agents, in The International FLAIRS Conference Proceedings , 2023.
DOI: | 10.32473/flairs.36.133104 |
Bibtex: | ![]() @InProceedings{schiff2023transformation, author = {Simon Schiff and Mena Leemhuis and {\"{O}}zg{\"{u}}r L{\"{u}}tf{\"{u}} {\"{O}}z{\c{c}}ep and Ralf Möller}, title = {Query Transformation for Processing Streams in Decision-making Agents}, booktitle = {The International FLAIRS Conference Proceedings}, date = {2023-05-15}, language = {en}, pubstate = {to appear}, journaltitle = {The Thirty-Six International Flairs Conference} } |
Simulation of Database Interactions for Early Validation of Digitized Enterprise Processes, Procedia Computer Science, Elsevier , vol. 219, pp. 658--665, 2023.
DOI: | https://doi.org/10.1016/j.procs.2023.01.336 |
File: | S1877050923003459 |
Bibtex: | ![]() @article{Melzer2023658, author = {Sylvia Melzer and Oliver C. Eichmann and Hongxu Wang and Ralf God}, title = {Simulation of Database Interactions for Early Validation of Digitized Enterprise Processes}, journal = {Procedia Computer Science, Elsevier}, volume = {219}, pages = {658--665}, year = {2023}, issn = {1877-0509}, doi = {https://doi.org/10.1016/j.procs.2023.01.336}, url = {https://www.sciencedirect.com/science/article/pii/S1877050923003459}, note = {CENTERIS – International Conference on ENTERprise Information Systems / ProjMAN – International Conference on Project MANagement / HCist – International Conference on Health and Social Care Information Systems and Technologies 2022}, keywords = {Entity-Relationship Modeling, Relational Databases, Enterprise Process Digitization, Model-based Systems Engineering}, abstract = {Digitized enterprise processes often encompass interaction with relational databases. Describing and simulating large-scale and complex processes on different abstraction levels lead to the use of tools and methods of Model-based Systems Engineering. In practice, current entity-relationship modeling approaches solely enable modeling relational database structure without simulation of database interactions at an early development stage. However, in general, it is known that early validation improves common understanding and communication in the development team and reduces the risk of design flaws. This paper presents an approach for model-based enterprise process digitization and a previously developed and now enhanced broker-based SysML Toolbox for integrating real relational databases into SysML simulations. The approach comprises status quo documentation concerning enterprise processes, development of digitized processes and required relational database structures as well as validation of digitized processes using the SysML Toolbox.} } |
Unsupervised Estimation of Subjective Content Descriptions, in 17th IEEE International Conference on Semantic Computing, (ICSC 2023), February 1-3 , IEEE, 2023.
DOI: | 10.1109/ICSC56153.2023.00052 |
File: | |
Bibtex: | ![]() @INPROCEEDINGS{BeBrMoGe, author ={Magnus Bender and Tanya Braun and Ralf M\"oller and Marcel Gehrke}, title ={Unsupervised Estimation of Subjective Content Descriptions}, booktitle ={17th {IEEE} International Conference on Semantic Computing, ({ICSC} 2023), February 1-3}, year ={2023}, pages = {}, publisher = {{IEEE}}, doi = {https://dx.doi.org/10.1109/ICSC56153.2023.00052}, keywords ={Subjective Content Descriptions; Text Mining;Text Annotation;Sentence clustering}, } |
2022
Simulation of Ordering Processes across different Supply Chain Tiers in the Aviation Industry, in 2022 IEEE International Systems Conference (SysCon) (IEEE SysCon 2022) , Montreal, Canada , 2022.
Lifting in Multi-agent Systems under Uncertainty, in 38th Conference on Uncertainty in Artificial Intelligence (UAI 2022) , 2022. pp. 233--243.
File: | braun22a.html |
Bibtex: | ![]() @inproceedings{BraGeLaMo22, author ={Tanya Braun and Marcel Gehrke and Florian Lau and Ralf M\"oller}, title ={Lifting in Multi-agent Systems under Uncertainty}, booktitle ={38th Conference on Uncertainty in Artificial Intelligence (UAI 2022)}, year ={2022}, pages = {233--243}, url = {https://proceedings.mlr.press/v180/braun22a.html} } |
Towards a Model-based and Variant-oriented Development of a System of Systems, Advances in Science, Technology and Engineering Systems Journal , vol. 7, no. 3, pp. 19--31, 2022.
On the Awakening of the Buddhological Epigraphy and Philology from the AI, in Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2022) , CEUR Workshop Proceedings, 2022. pp. 38-45.
TEI-based Interactive Critical Editions, in 15th IAPR International Workshop on Document Analysis Systems , Springer, 2022.
Introduction to the Second Workshop on Humanities-Centred Artificial Intelligence, in Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2022) , Sylvia Melzer and Hagen Peukert and Stefan Thiemann, Eds. CEUR Workshop Proceedings, 2022. pp. 1-3.
File: | |
Bibtex: | ![]() @inproceedings{melzer2022introduction, title = "Introduction to the Second Workshop on Humanities-Centred Artificial Intelligence", author = "Sylvia Melzer and Hagen Peukert and Stefan Thiemann", year = "2022", booktitle = "Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2022)", editor = "Sylvia Melzer and Hagen Peukert and Stefan Thiemann", publisher = "CEUR Workshop Proceedings", volume = "3301", pages = "1-3", url = "https://ceur-ws.org/Vol-3301/preface.pdf" } |
Embodiment of an Agent by a Pepper Robot for Explaining Retrieval Results, in Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2022) , CEUR Workshop Proceedings, 2022. pp. 29--37.
File: | |
Bibtex: | ![]() @InProceedings{schiff2022embodiment, author = {Simon Schiff and Magnus Bender and Ralf Möller}, booktitle = {Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2022)}, date = {2022-09-19}, title = {Embodiment of an Agent by a Pepper Robot for Explaining Retrieval Results}, pages = {29--37}, publisher = {CEUR Workshop Proceedings}, url = {https://ceur-ws.org/Vol-3301/paper4.pdf}, } |
Databasing on demand for research data repositories explained with a large epidoc dataset, Book of Industry Papers, Poster Papers and Abstracts of the CENTERIS 2022 - Conference on ENTERprise Information Systems / ProjMAN 2022 - International Conference on Project MANagement / HCist 2022 - International Conference on Health and Social Care Info , pp. 150--153, 2022. SciKA, Portugal.
ISBN: | 978-989-54617-4-5 |
File: | |
Bibtex: | ![]() @article{melzer2022databasing, author = {Melzer, S. and Schiff, S. and Weise, F. and Harter, K. and Möller, R.}, title = {Databasing on demand for research data repositories explained with a large epidoc dataset}, journal = {Book of Industry Papers, Poster Papers and Abstracts of the CENTERIS 2022 - Conference on ENTERprise Information Systems / ProjMAN 2022 - International Conference on Project MANagement / HCist 2022 - International Conference on Health and Social Care Information Systems and Technologies (eds.: Cruz-Cunha, M. M., Martinho, R., Rijo, R., Domongos, D., Peres, E.)}, pages = {150--153}, year = {2022}, url = {https://www.scika.org/centeris/2022/CONTENTS/downloads/boa2022.pdf}, note= {SciKA} } |
{Model-based Development of a Federated Database Infrastructure to support the Usability of {Cross-Domain} Information Systems}, in 2022 IEEE International Systems Conference (SysCon) (IEEE SysCon 2022) , Montreal, Canada , 2022.
Explainable and Explorable Decision Support, in Proceedings of the 27th International Conference on Conceptual Structures (ICCS 2022) , 2022.
DOI: | https://dx.doi.org/10.1007/978-3-031-16663-1_8 |
File: | 978-3-031-16663-1_8 |
Bibtex: | ![]() @inproceedings{BraGe22, author ={Tanya Braun and Marcel Gehrke}, title ={Explainable and Explorable Decision Support}, booktitle ={Proceedings of the 27th International Conference on Conceptual Structures (ICCS 2022)}, year ={2022}, url = {https://link.springer.com/chapter/10.1007/978-3-031-16663-1_8}, doi = {https://dx.doi.org/10.1007/978-3-031-16663-1_8} } |
2021
Complementary Document Representations for Information Retrieval, in 34th International FLAIRS Conference (FLAIRS-34), North Miami Beach, Florida, USA, May 17-19 , 2021.
DOI: | https://doi.org/10.25592/uhhfdm.9569 |
Bibtex: | ![]() @inproceedings{sylvia_melzer_2021_9569, author = {Sylvia Melzer and Simon Schiff and Ralf Möller}, title = {{Complementary Document Representations for Information Retrieval}}, booktitle = {34th International FLAIRS Conference (FLAIRS-34), North Miami Beach, Florida, USA, May 17-19}, year = {2021}, pages = {}, publisher = {}, doi = {https://doi.org/10.25592/uhhfdm.9569} } |
ICCS-21 Proceedings of the 26th International Conference on Conceptual Structures., .... Springer, 2021.
DOI: | https://doi.org/10.1007/978-3-030-86982-3 |
Bibtex: | ![]() @book{BraGeHaHe21, author = {Tanya Braun and Marcel Gehrke and Tom Hanika and Nathalie Hernandez (Eds.)}, title = {ICCS-21 Proceedings of the 26th International Conference on Conceptual Structures}, year = {2021}, publisher = {Springer}, doi = {https://doi.org/10.1007/978-3-030-86982-3} } |
Identifying and Translating Subjective Content Descriptions Among Texts, International Journal of Semantic Computing , vol. 15, no. 4, pp. 461--485, 2021.
DOI: | 10.1142/S1793351X21400122 |
File: | |
Bibtex: | ![]() @article{BenBrGeKuScMo21b, author={Magnus Bender and Tanya Braun and Marcel Gehrke and Felix Kuhr and Ralf M\"oller and Simon Schiff}, title={Identifying and Translating Subjective Content Descriptions Among Texts}, journal = {International Journal of Semantic Computing}, volume= {15}, number={4}, pages= {461--485}, year={2021}, doi = {https://dx.doi.org/10.1142/S1793351X21400122} } |
Identifying Subjective Content Descriptions Among Texts, in 15th IEEE International Conference on Semantic Computing, (ICSC 2021), Laguna Hills, CA, USA, January 27-29 , IEEE, 2021. pp. 9--16.
DOI: | 10.1109/ICSC50631.2021.00008 |
File: | |
Bibtex: | ![]() @inproceedings{BenBrGeKuScMo21b, author={Magnus Bender and Tanya Braun and Marcel Gehrke and Felix Kuhr and Ralf M\"oller and Simon Schiff}, title={{Identifying Subjective Content Descriptions Among Texts}}, booktitle = {15th {IEEE} International Conference on Semantic Computing, (ICSC 2021), Laguna Hills, CA, USA, January 27-29}, year={2021}, pages= {9--16}, publisher = {IEEE}, doi = {https://doi.org/10.1109/ICSC50631.2021.00008}, keywords={Subjective Content Descriptions; Text Mining} } |
Modeling and Simulating Federated Databases for early Validation of Federated Searches using the Broker-based SysML Toolbox, in {IEEE} International Systems Conference (SysCon 2021), Vancouver, BC, Canada, April 15 - May 15, 2021 , IEEE, 2021. pp. 1--6.
DOI: | https://doi.org/10.1109/SysCon48628.2021.9447055 |
Bibtex: | ![]() @inproceedings{MelzerTM21, author = {Sylvia Melzer and Stefan Thiemann and Ralf Möller}, title = {Modeling and Simulating Federated Databases for early Validation of Federated Searches using the Broker-based SysML Toolbox}, booktitle = {{IEEE} International Systems Conference (SysCon 2021), Vancouver, BC, Canada, April 15 - May 15, 2021}, year = {2021}, pages = {1--6}, publisher = {IEEE}, doi = {https://doi.org/10.1109/SysCon48628.2021.9447055} } |
Modeling and Simulation of Database Interactions, 2021. GfSE.
DOI: | 10.25592/uhhfdm.9696 |
Bibtex: | ![]() @misc{sylvia_melzer_2021_9696, author = {Sylvia Melzer and Oliver C. Eichmann and Hongxu Wang and Ralf God}, title = {Modeling and Simulation of Database Interactions}, month = {November}, year = {2021}, publisher = {GfSE}, doi = {10.25592/uhhfdm.9696} } |
On Human-Aware Information Seeking, in Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2021) , CEUR Workshop Proceedings, 2021. pp. 31--39.
On the Complexity and Completeness of the Lifted Dynamic Junction Tree Algorithm, in 10th International Workshop on Statistical Relational AI at the 1st International Joint Conference on Learning and Reasoning , 2021.
File: | 2110.09197 |
Bibtex: | ![]() @inproceedings{Geh21, author = {Marcel Gehrke}, title = {{On the Complexity and Completeness of the Lifted Dynamic Junction Tree Algorithm}}, booktitle = {10th International Workshop on Statistical Relational AI at the 1st International Joint Conference on Learning and Reasoning}, year = {2021}, url = {https://arxiv.org/abs/2110.09197} } |
Taming Exact Inference in Temporal Probabilistic Relational Models, University of Lübeck, 2021.
2020
Restricting the Maximum Number of Actions for Decision Support under Uncertainty, in Proceedings of the 25th International Conference on Conceptual Structures (ICCS 2020) , 092020.
DOI: | https://doi.org/10.1007/978-3-030-57855-8_11 |
Bibtex: | ![]() @inproceedings{GehBrPo20, author = {Marcel Gehrke and Tanya Braun and Simon Polovina}, title = {{Restricting the Maximum Number of Actions for Decision Support under Uncertainty}}, booktitle = {Proceedings of the 25th International Conference on Conceptual Structures (ICCS 2020)}, year = {2020}, doi = {https://doi.org/10.1007/978-3-030-57855-8_11} } |
Lifted Marginal Filtering for Asymmetric Models by Clustering-based Merging, in Proceedings of the 24th European Conference on Artificial Intelligence (ECAI 2020) , IOS Press, 082020. pp. 2608--2615.
DOI: | https://doi.org/10.3233/FAIA200397 |
Bibtex: | ![]() @inproceedings{LueGeBrMoKi20, author = {Stefan L\"udtke and Marcel Gehrke and Tanya Braun and Ralf M\"oller and Thomas Kirste}, title = {{Lifted Marginal Filtering for Asymmetric Models by Clustering-based Merging}}, booktitle = {Proceedings of the 24th European Conference on Artificial Intelligence (ECAI 2020)}, year = {2020}, doi = {https://doi.org/10.3233/FAIA200397} } |