Data Linking Infrastructure – Foundations and Architecture

Funded by DFG

Runtime: 01.01.2019 - 31.12.2025

Principal Investigator: 

Research Associates: 

Project Description

A data linking infrastructure is envisioned to support humanities scholars from all research fields of the Cluster of Excellence "Understanding Written Artefacts” such that various kinds of data can be easily and systematically combined to foster scientific progress. On the one hand, there are images and videos of written artefacts, in some cases associated with text data making parts of image (or video) content explicit, e.g., using optical character recognition techniques. On the other hand, different kinds of chemistry and materials science data are collected to further describe written artefacts under investigation, almost always in combination with descriptive temporal and spatial data. Data of this kind must be made available to humanities scientists such that they are best supported in their scientific work. Publications from humanities projects will refer to artefact data of the kind described above, and, after a while, artefact data are referenced in quite some number of natural language publications resulting from scientific work in humanities projects, e.g., journal articles, conference papers, and PhD theses. Publications are provided as documents, which are represented, e.g., as PDF data. Further natural language data comes from existing humanities research databases. All data can be described in an appropriate way using suitable metadata formalisms (date of creation, author, etc.). In addition, and different from metadata, all kinds of base data (also called raw data) might be extended with derived data, with which certain features are made explicit (e.g., for supporting visualization, for information retrieval, or for other research efforts).

Link to Project Details

https://www.csmc.uni-hamburg.de/research/cluster-projects/field-f/rff01.html

Activities

Editorial

  • S. Melzer, J. Gippert, S. Thiemann, H. Peukert: Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2021), CEUR Workshop Proceedings, 2022 (proceedings)
  • S. Melzer, S. Thiemann, H. Peukert: Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2022), CEUR Workshop Proceedings, 2022 (proceedings)
  • S. Melzer, H. Peukert, S. Thiemann: Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2023), CEUR Workshop Proceedings, 2023 (proceedings)

Organisation

Publications

2023

Sylvia Melzer, Stefan Thiemann, and Ralf Möller,
Poster: Digital Data Handling at UWA, in Digital Total - Computing & Data Science an der Universität Hamburg und in der Wissenschaftsmetropole Hamburg , 2023.
File: Poster_DDH@UWA_A0_FINAL.pdf
Bibtex: BibTeX
@INPROCEEDINGS {MetThMo:2023,
author={Sylvia Melzer and Stefan Thiemann and Ralf Möller},
doi={},
booktitle={Digital Total - Computing & Data Science an der Universität Hamburg und in der Wissenschaftsmetropole Hamburg},
title={Poster: Digital Data Handling at UWA},
year={2023},
month={October},
volume={},
pages={},
url = {https://www.conferences.uni-hamburg.de/event/387/contributions/1502/attachments/559/1055/Poster_DDH@UWA_A0_FINAL.pdf} 
}
Florian Andreas Marwitz, Ralf Möller, and Marcel Gehrke,
PETS: Predicting Efficiently using Temporal Symmetries in Temporal PGMs, in Proceedings of the Seventeenth European Conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty (ECSQARU-23) , Springer, 2023.
File: Dateilink
Bibtex: BibTeX
@inproceedings{MaMoGe23,
    author    = {Florian Andreas Marwitz and Ralf M\"oller and Marcel Gehrke},
    title     = {{ PETS: Predicting Efficiently using Temporal Symmetries in Temporal PGMs}},
    booktitle = {Proceedings of the Seventeenth European Conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty (ECSQARU-23)},
    year      = {2023},
    pages     = {},
    publisher = {Springer},
}
Nadja Redzuan, Marcel Gehrke, Ralf Möller, and Tanya Braun,
On Domain-specific Topic Modelling Using the Case of a Humanities Journal, in Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2023) , CEUR Workshop Proceedings, 2023.
Bibtex: BibTeX
@InProceedings{ReGeMoBr,
  author    = {Nadja Redzuan and Marcel Gehrke and Ralf M\"oller and Tanya Braun},
  booktitle = {Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2023)},
  date      = {2023-09-26},
  title     = {On Domain-specific Topic Modelling Using the Case of a Humanities Journal},
  pages     = {},
  publisher = {CEUR Workshop Proceedings},
  url = {},
}
Magnus Bender, Tanya Braun, Ralf Möller, and Marcel Gehrke,
LESS is More: LEan Computing for Selective Summaries, in KI 2023: Advances in Artificial Intelligence , Springer Nature Switzerland, 2023. pp. 1--14.
DOI:10.1007/978-3-031-42608-7_1
File: Dateilink
Bibtex: BibTeX
@InProceedings{BeBrMoGe23c,
author={Magnus Bender and Tanya Braun and Ralf M\"oller and Marcel Gehrke},
title={LESS is More: LEan Computing for Selective Summaries},
journal = {International Journal of Semantic Computing},
booktitle= {KI 2023: Advances in Artificial Intelligence},
publisher= {Springer Nature Switzerland},
year={2023},
doi ={https://doi.org/10.1007/978-3-031-42608-7_1},
pages={1--14},
}
Sylvia Melzer, Hagen Peukert, and Stefan Thiemann,
Introduction to the Third Workshop on Humanities-Centred Artificial Intelligence, in Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2023) , Sylvia Melzer and Hagen Peukert and Stefan Thiemann, Eds. CEUR Workshop Proceedings, 2023. pp. 1-3.
File: preface.pdf
Bibtex: BibTeX
@inproceedings{melzer2023introduction,
  title        = "Introduction to the Third Workshop on Humanities-Centred Artificial Intelligence",
  author       = "Sylvia Melzer and Hagen Peukert and Stefan Thiemann",
  year         = "2023",
  booktitle    = "Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2023)",
  editor       = "Sylvia Melzer and Hagen Peukert and Stefan Thiemann",
  publisher    = "CEUR Workshop Proceedings",
  volume       = "3580",
  pages        = "1-3",
  url          = "https://ceur-ws.org/Vol-3580/preface.pdf"
}
Sylvia Melzer, Stefan Thiemann, Simon Schiff, and Ralf Möller,
Implementation of a Federated Information System by means of Reuse of Research Data archived in Research Data Repositories, Data Science Journal , 2023.
DOI:0.5334/dsj-2023-039
File: dsj-2023-039
Bibtex: BibTeX
@article{MeThScMo:2023,
  author  = {Sylvia Melzer and Stefan Thiemann and Simon Schiff and Ralf Möller},
  title   = {Implementation of a Federated Information System by means of Reuse of Research Data archived in Research Data Repositories},
  journal = {Data Science Journal},
  year    = 2023,
  pages   = {},
doi = {0.5334/dsj-2023-039},
url = {https://doi.org/10.5334/dsj-2023-039}
}
Magnus Bender, Kira Schwandt, Ralf Möller, and Marcel Gehrke,
FrESH – Feedback-reliant Enhancement of Subjective Content Descriptions by Humans, in Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2023) , CEUR Workshop Proceedings, 2023. pp. 15--24.
File: paper3.pdf
Bibtex: BibTeX
@InProceedings{BeSchMoGe,
  author    = {Magnus Bender and Kira Schwandt and Ralf M\"oller and Marcel Gehrke},
  booktitle = {Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2023)},
  date      = {2023-09-26},
  title     = {FrESH – Feedback-reliant Enhancement of Subjective Content Descriptions by Humans},
  pages     = {15--24},
  publisher = {CEUR Workshop Proceedings},
  url ={https://ceur-ws.org/Vol-3580/paper3.pdf},
}

2022

Sylvia Melzer, Stefan Thiemann, Hagen Peukert, and Möller Ralf,
Towards a Model-based and Variant-oriented Development of a System of Systems, Advances in Science, Technology and Engineering Systems Journal , vol. 7, no. 3, pp. 19--31, 2022.
File:
Bibtex: BibTeX
@article{Melzer2022,
author = {Melzer, Sylvia and Thiemann, Stefan and Peukert, Hagen and Ralf, Möller},
title = {{Towards a Model-based and Variant-oriented Development of a System of Systems}},
journal = {Advances in Science, Technology and Engineering Systems Journal},
volume = {7},
number = {3},
pages = {19--31},
url = {https://www.astesj.com/v07/i03/p03/}
}
Simon Schiff, Sylvia Melzer, Eva Wilden, and Ralf Möller,
TEI-based Interactive Critical Editions, in 15th IAPR International Workshop on Document Analysis Systems , Springer, 2022.
Bibtex: BibTeX
@inproceedings{SchMeMo2022,
author = {Simon Schiff and Sylvia Melzer and Eva Wilden and Ralf Möller}, 
title = {{TEI-based Interactive Critical Editions}},
booktitle = {15th IAPR International Workshop on Document Analysis Systems},
series={Lecture Notes in Computer Science (LNCS)},
volume={},
year = {2022},
month = {May},
publisher={Springer},
pages={},
doi={}
}
Hongxu Wang, and Sylvia Melzer,
Simulation of Ordering Processes across different Supply Chain Tiers in the Aviation Industry, in 2022 IEEE International Systems Conference (SysCon) (IEEE SysCon 2022) , Montreal, Canada , 2022.
Bibtex: BibTeX
@INPROCEEDINGS{Melz2204:Simulation,
AUTHOR={Hongxu Wang and Sylvia Melzer},
TITLE={{Simulation of Ordering Processes across different Supply Chain Tiers in the Aviation Industry}},
BOOKTITLE={2022 IEEE International Systems Conference (SysCon) (IEEE SysCon 2022)},
ADDRESS={Montreal, Canada},
YEAR={2022},
MONTH={25. April},
KEYWORDS={simulation; aircraft supply chain; aviation; federation; database system},
ABSTRACT={There is increasing concern that previous system development, characterized by the design of an isolated system with a limited number of interfaces to other systems, will be disadvantaged and have a detrimental effect on competition because recent rapid developments in the global supply chain have increased the need for digital services built on multiple interacting
systems and their communication. Platforms are one of the most widely used
services for sharing information by integrating multiple interacting
systems into a network. The current platform integrates these systems with
tight coupling to meet a new level of customer requirements.

With the rapid increase in the information traffic along the global supply
chain, many companies in the aviation industry have been confronting with
these issues of processing and storing data within and across the
enterprise. More and more companies are adopting an Enterprise Resource
Planning (ERP) system. Which consists of a set of fully integrated modules
to support a company's business processes that run from a single database.
Nowadays, the database structure in the ERP system is customized for each
company by external vendors. The aviation industry is characterized by a
large network and long supply chains with the few Original Equipment
Manufacturers (OEMs) and many suppliers (Tier 1, Tier 2, Tier 3).
Individually created databases with own terms within the company make an
inter-company exchange only possible with high effort.

In this paper, we show how to model and simulate ordering processes across
different supply chain tiers in the aviation industry rapidly to validate
the use cases concerning the communication structure during a
requirements-based engineering process.}
}
Tanya Braun, Marcel Gehrke, Florian Lau, and Ralf Möller,
Lifting in Multi-agent Systems under Uncertainty, in 38th Conference on Uncertainty in Artificial Intelligence (UAI 2022) , 2022. pp. 233--243.
File: braun22a.html
Bibtex: BibTeX
@inproceedings{BraGeLaMo22,
author ={Tanya Braun and Marcel Gehrke and Florian Lau and Ralf M\"oller},
title ={Lifting in Multi-agent Systems under Uncertainty},
booktitle ={38th Conference on Uncertainty in Artificial Intelligence (UAI 2022)},
year ={2022},
pages = {233--243},
url = {https://proceedings.mlr.press/v180/braun22a.html}
}
Haiyan Hu-von Hinüber, and Sylvia Melzer,
On the Awakening of the Buddhological Epigraphy and Philology from the AI, in Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2022) , CEUR Workshop Proceedings, 2022. pp. 38-45.
Bibtex: BibTeX
@InProceedings{hinuber2022awakening,
  author    = {Haiyan Hu-von Hinüber and Sylvia Melzer},
  booktitle = {Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2022)},
  date      = {2022-09-19},
  title     = {On the Awakening of the Buddhological Epigraphy and Philology from the AI},
  pages     = {38-45},
  publisher = {CEUR Workshop Proceedings},
}
Sylvia Melzer, Hagen Peukert, and Stefan Thiemann,
Introduction to the Second Workshop on Humanities-Centred Artificial Intelligence, in Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2022) , Sylvia Melzer and Hagen Peukert and Stefan Thiemann, Eds. CEUR Workshop Proceedings, 2022. pp. 1-3.
File: preface.pdf
Bibtex: BibTeX
@inproceedings{melzer2022introduction,
  title        = "Introduction to the Second Workshop on Humanities-Centred Artificial Intelligence",
  author       = "Sylvia Melzer and Hagen Peukert and Stefan Thiemann",
  year         = "2022",
  booktitle    = "Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2022)",
  editor       = "Sylvia Melzer and Hagen Peukert and Stefan Thiemann",
  publisher    = "CEUR Workshop Proceedings",
  volume       = "3301",
  pages        = "1-3",
  url          = "https://ceur-ws.org/Vol-3301/preface.pdf"
}
Simon Schiff, Magnus Bender, and Ralf Möller,
Embodiment of an Agent by a Pepper Robot for Explaining Retrieval Results, in Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2022) , CEUR Workshop Proceedings, 2022. pp. 29--37.
File: paper4.pdf
Bibtex: BibTeX
@InProceedings{schiff2022embodiment,
  author    = {Simon Schiff and Magnus Bender and Ralf Möller},
  booktitle = {Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2022)},
  date      = {2022-09-19},
  title     = {Embodiment of an Agent by a Pepper Robot for Explaining Retrieval Results},
  pages     = {29--37},
  publisher = {CEUR Workshop Proceedings},
  url = {https://ceur-ws.org/Vol-3301/paper4.pdf},
}
Sylvia Melzer, S. Schiff, F. Weise, K. Harter, and R. Möller,
Databasing on demand for research data repositories explained with a large epidoc dataset, Book of Industry Papers, Poster Papers and Abstracts of the CENTERIS 2022 - Conference on ENTERprise Information Systems / ProjMAN 2022 - International Conference on Project MANagement / HCist 2022 - International Conference on Health and Social Care Info , pp. 150--153, 2022. SciKA, Portugal.
ISBN:978-989-54617-4-5
File: boa2022.pdf
Bibtex: BibTeX
@article{melzer2022databasing,
author = {Melzer, S. and Schiff, S. and Weise, F. and Harter, K. and Möller, R.},
title = {Databasing on demand for research data repositories explained with a large epidoc dataset},
journal = {Book of Industry Papers, Poster Papers and Abstracts of the CENTERIS 2022 - Conference on ENTERprise Information Systems / ProjMAN 2022 - International Conference on Project MANagement / HCist 2022 - International Conference on Health and Social Care Information Systems and Technologies (eds.: Cruz-Cunha, M. M., Martinho, R., Rijo, R., Domongos, D., Peres, E.)},
pages = {150--153},
year = {2022},
url = {https://www.scika.org/centeris/2022/CONTENTS/downloads/boa2022.pdf},
note= {SciKA}
}
Sylvia Melzer, Hagen Peukert, Hongxu Wang, and Stefan Thiemann,
{Model-based Development of a Federated Database Infrastructure to support the Usability of {Cross-Domain} Information Systems}, in 2022 IEEE International Systems Conference (SysCon) (IEEE SysCon 2022) , Montreal, Canada , 2022.
Bibtex: BibTeX
@INPROCEEDINGS{Melz2204:Model,
AUTHOR={Sylvia Melzer and Hagen Peukert and Hongxu Wang and Stefan Thiemann},
TITLE={{Model-based Development of a Federated Database Infrastructure to support the Usability of {Cross-Domain} Information Systems}},
BOOKTITLE={2022 IEEE International Systems Conference (SysCon) (IEEE SysCon 2022)},
ADDRESS={Montreal, Canada},
YEAR={2022},
MONTH={April},
DAYS={25},
PUBLISHER={},
PAGES={},
URL={},
KEYWORDS={SysML; federated search; schema mapping; cross-domain information system; V-model; VAMOS}
}
Tanya Braun, and Marcel Gehrke,
Explainable and Explorable Decision Support, in Proceedings of the 27th International Conference on Conceptual Structures (ICCS 2022) , 2022.
DOI:https://dx.doi.org/10.1007/978-3-031-16663-1_8
File: 978-3-031-16663-1_8
Bibtex: BibTeX
@inproceedings{BraGe22,
author ={Tanya Braun and Marcel Gehrke},
title ={Explainable and Explorable Decision Support},
booktitle ={Proceedings of the 27th International Conference on Conceptual Structures (ICCS 2022)},
year ={2022},
url = {https://link.springer.com/chapter/10.1007/978-3-031-16663-1_8},
doi = {https://dx.doi.org/10.1007/978-3-031-16663-1_8}
}

2021

Sylvia Melzer, Simon Schiff, and Ralf Möller,
Complementary Document Representations for Information Retrieval, in 34th International FLAIRS Conference (FLAIRS-34), North Miami Beach, Florida, USA, May 17-19 , 2021.
DOI:https://doi.org/10.25592/uhhfdm.9569
Bibtex: BibTeX
@inproceedings{sylvia_melzer_2021_9569,
  author    = {Sylvia Melzer and Simon Schiff and Ralf Möller},
  title     = {{Complementary Document Representations for Information Retrieval}}, 
  booktitle = {34th International FLAIRS Conference (FLAIRS-34), North Miami Beach, Florida, USA, May 17-19},
  year      = {2021},
  pages     = {},
  publisher = {},
  doi       = {https://doi.org/10.25592/uhhfdm.9569}
}
Tanya Braun, Marcel Gehrke, Tom Hanika, and Nathalie Hernandez (Eds.),
ICCS-21 Proceedings of the 26th International Conference on Conceptual Structures., .... Springer, 2021.
DOI:https://doi.org/10.1007/978-3-030-86982-3
Bibtex: BibTeX
@book{BraGeHaHe21, 
  author = {Tanya Braun and Marcel Gehrke and Tom Hanika and Nathalie Hernandez (Eds.)},
  title = {ICCS-21 Proceedings of the 26th International Conference on Conceptual Structures},
  year = {2021},
  publisher = {Springer},
  doi = {https://doi.org/10.1007/978-3-030-86982-3}
}
Magnus Bender, Tanya Braun, Marcel Gehrke, Felix Kuhr, Ralf Möller, and Simon Schiff,
Identifying and Translating Subjective Content Descriptions Among Texts, International Journal of Semantic Computing , vol. 15, no. 4, pp. 461--485, 2021.
DOI:10.1142/S1793351X21400122
File: Dateilink
Bibtex: BibTeX
@article{BenBrGeKuScMo21b,
author={Magnus Bender and Tanya Braun and Marcel Gehrke and Felix Kuhr and Ralf M\"oller and Simon Schiff},
title={Identifying and Translating Subjective Content Descriptions Among Texts},
journal = {International Journal of Semantic Computing},
volume= {15},
number={4},
pages= {461--485},
year={2021},
doi  = {https://dx.doi.org/10.1142/S1793351X21400122}
}
Magnus Bender, Tanya Braun, Marcel Gehrke, Felix Kuhr, Ralf Möller, and Simon Schiff,
Identifying Subjective Content Descriptions Among Texts, in 15th IEEE International Conference on Semantic Computing, (ICSC 2021), Laguna Hills, CA, USA, January 27-29 , IEEE, 2021. pp. 9--16.
DOI:10.1109/ICSC50631.2021.00008
File: Dateilink
Bibtex: BibTeX
@inproceedings{BenBrGeKuScMo21b,
author={Magnus Bender and Tanya Braun and Marcel Gehrke and Felix Kuhr and Ralf M\"oller and Simon Schiff},
title={{Identifying Subjective Content Descriptions Among Texts}},
booktitle = {15th {IEEE} International Conference on Semantic Computing, (ICSC 2021), Laguna Hills, CA, USA, January 27-29},
year={2021},
pages= {9--16},
publisher = {IEEE},
doi  = {https://doi.org/10.1109/ICSC50631.2021.00008},
keywords={Subjective Content Descriptions; Text Mining}
}
Sylvia Melzer, Stefan Thiemann, and Ralf Möller,
Modeling and Simulating Federated Databases for early Validation of Federated Searches using the Broker-based SysML Toolbox, in {IEEE} International Systems Conference (SysCon 2021), Vancouver, BC, Canada, April 15 - May 15, 2021 , IEEE, 2021. pp. 1--6.
DOI:https://doi.org/10.1109/SysCon48628.2021.9447055
Bibtex: BibTeX
@inproceedings{MelzerTM21,
  author    = {Sylvia Melzer and Stefan Thiemann and Ralf Möller},
  title     = {Modeling and Simulating Federated Databases for early Validation of Federated Searches using the Broker-based SysML Toolbox},
  booktitle = {{IEEE} International Systems Conference (SysCon 2021), Vancouver, BC, Canada, April 15 - May 15, 2021},
  year      = {2021},
  pages     = {1--6},
  publisher = {IEEE},
  doi       = {https://doi.org/10.1109/SysCon48628.2021.9447055}
}
Sylvia Melzer, Oliver C. Eichmann, Hongxu Wang, and Ralf God,
Modeling and Simulation of Database Interactions, 2021. GfSE.
DOI:10.25592/uhhfdm.9696
Bibtex: BibTeX
@misc{sylvia_melzer_2021_9696,
  author       = {Sylvia Melzer and Oliver C. Eichmann and Hongxu Wang and Ralf God},
  title        = {Modeling and Simulation of Database Interactions},
  month        = {November},
  year         = {2021},
  publisher   = {GfSE},
  doi          = {10.25592/uhhfdm.9696}
}
[en] Simon Schiff, and Ralf Möller,
On Human-Aware Information Seeking, in Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2021) , CEUR Workshop Proceedings, 2021. pp. 31--39.
Bibtex: BibTeX
@InProceedings{schiff2021human,
  author    = {Simon Schiff and Ralf Möller},
  booktitle = {Proceedings of the Workshop on Humanities-Centred Artificial Intelligence (CHAI 2021)},
  date      = {2021-09-28},
  title     = {On Human-Aware Information Seeking},
  language  = {en},
  pages     = {31--39},
  publisher = {CEUR Workshop Proceedings},
}
Marcel Gehrke,
On the Complexity and Completeness of the Lifted Dynamic Junction Tree Algorithm, in 10th International Workshop on Statistical Relational AI at the 1st International Joint Conference on Learning and Reasoning , 2021.
File: 2110.09197
Bibtex: BibTeX
@inproceedings{Geh21,
  author = {Marcel Gehrke}, 
  title = {{On the Complexity and Completeness of the Lifted Dynamic Junction Tree Algorithm}}, 
  booktitle = {10th International Workshop on Statistical Relational AI at the 1st International Joint Conference on Learning and Reasoning}, 
  year = {2021},
  url = {https://arxiv.org/abs/2110.09197}
}