Semantic Big Data-Workshop: IMPORTANT-DATES

The International Workshop on Semantic Big Data (SBD 2018)

In conjunction with ACM SIGMOD 2018

Program Committee

Our members of the Program Committee are coming from all around the world!

More...

Semantic Big Data

Do you want to know why we believe that Semantic Big Data is a hot topic in research?

More...

Questions

If you have any questions, please do not hesitate to contact the workshop chairs!

More...

Types of Papers

We accept five types of papers

Research Papers
Experiments and Analysis Papers
Application Papers
Vision Papers
Demo Papers

More...

Evaluation Criteria

We evaluate our submissions according to a set of criteria...

More...

Topics of Interest

We are interested in submissions in any topic related to Semantic Big Data...

More...

Aims of the Workshop

The current World-Wide Web enables an easy, instant access to a vast amount of online information. However, the content in the Web is typically for human consumption, and is not tailored for machine processing. The Semantic Web is hence intended to establish a machine-understandable Web, and is currently also used in many other domains and not only in the Web. The World Wide Web Consortium (W3C) has developed a number of standards around this vision. Among them is the Resource Description Framework (RDF), which is used as the data model of the Semantic Web. The W3C has also defined SPARQL as the RDF query language, RIF as the rule language, and the ontology languages RDFS and OWL to describe schemas of RDF. The usage of common ontologies increases interoperability between heterogeneous data sets, and the proprietary ontologies with the additional abstraction layer facilitate the integration of these data sets. Therefore, we can argue that the Semantic Web is ideally designed to work in heterogeneous Big Data environments.

We define Semantic Big Data as the intersection of Semantic Web data and Big Data. There are masses of Semantic Web data freely available to the public - thanks to the efforts of the linked data initiative. According to http://stats.lod2.eu/ the current freely available Semantic Web data is approximately 150 billion triples in about 3,000 datasets, many of which are accessible via SPARQL query servers called SPARQL endpoints. Everyone can submit SPARQL queries to SPARQL endpoints via a standardized protocol, where the queries are processed on the datasets of the SPARQL endpoints and the query results are sent back in a standardized format. Hence, not only Semantic Big Data is freely available, but also distributed execution environments for Semantic Big Data are freely accessible. This makes the Semantic Web an ideal playground for Big Data research.

The goal of this workshop is to bring together academic researchers and industry practitioners to address the challenges and report and exchange the research findings in Semantic Big Data, including new approaches, techniques and applications, make substantial theoretical and empirical contributions to, and significantly advance the state of the art of Semantic Big Data.

Categories of Papers

The workshop solicits papers of different categories:

Research Papers propose new approaches, theories or techniques related to Semantic Big Data including new data structures, algorithms and whole systems. They should make substantial theoretical and empirical contributions to the research field.
Experiments and Analysis Papers focus on the experimental evaluation of existing approaches including data structures and algorithms for Semantic Big Data and bring new insights through the analysis of these experiments. Results of Experiments and Analysis Papers can be, for example, showing benefits of well-known approaches in new settings and environments, opening new research problems by demonstrating unexpected behavior or phenomena, or comparing a set of traditional approaches in an experimental survey.
Application Papers report practical experiences on applications of Semantic Big Data. Application Papers might describe how to apply Semantic Web technologies to specific application domains with big data demands like social networks, web search, e-business, collaborative environments, e-learning, medical informatics, bioinformatics and geographic information system. Application Papers might describe applications using linked data in a new way.
Vision Papers identify emerging new or future research issues and directions, and describe new research visions having demands for Semantic Big Data. The new visions will potentially have great impacts on society.
Demo Papers deal with innovative systems and applications for Semantic Big Data. These papers describe a showcase of the proposed system/application, but may also explain the novelty of the system's architecture. We are especially interested in demonstrations having a WOW-effect.

For all categories (except for demo papers), we accept two different types of papers: Short and Full papers. The length of full papers cannot exceed 6 pages. The length of all other papers (i.e., short and demo papers) cannot exceed 4 pages. Accepted full and short papers will be presented in oral presentations. Demo papers will be presented as part of a combined demo and poster session. All accepted full and short papers will also be presented as posters in the combined demo and poster session in order to increase interactivity and discussion with the audience.

Topics of Interest

We welcome papers on the following topics:

Semantic Data Management, Query Processing and Optimization in
- Big Data
- Cloud Computing
- Internet of Things
- Graph Databases
- Federations
- Spatial and Spatio-Temporal Data
Evaluation strategies for Semantic Big Data of Rule-based Languages like RIF and SWRL
Ontology-based Approaches for Modeling, Mapping, Evolution and Real-world ontologies in the context of Semantic Big Data
Reasoning Approaches (Real-World Applications, Efficient Algorithms) especially designed for Semantic Big Data environments
Linked Data
- Integration of Heterogeneous Linked Data
- Real-World Applications
- Statistics and Visualizations
- Quality
- Ranking Techniques
- Provenance
- Mining and Consuming Linked Data
Semantic Web stream processing (Dynamic Data, Temporal Semantics)
Semantic Internet of Things
Semantic Smart Homes/Companies/Cities
Performance, Evaluation and Benchmarking of Semantic Web Technologies, Applications and Databases
Semantic Web Services
Semantic Big Data Archives
- Efficient Archiving and Preservation Techniques
- Evolution Representation
- Compression Approaches
- Querying Techniques
Semantic Big Data on Emergent Hardware Technologies
- FPGA
- GPU
- SSD
- Main-Memory Databases

Important Dates

Time Schedule
Submission (extended):	March 17, 2018
Notification:	April 16, 2018
Workshop:	June 10, 2018

Diversity Considerations of the Program Committee

We have currently recruited 34 PC members and chairs listed below who are experts in the topics of interest of our workshop. The current PC members and chairs are selected from 18 nations all over the world as shown also by the map below. While most PC members are from academia, we have 3 experts also from industry (9%). 9 of the PC members and chairs are women (26%).

Legend

Program committee members and chairs: 1 8

Program Committee Chairs

Sven Groppe, University of Lübeck, Germany
Le Gruenwald, University of Oklahoma, USA

Program Committee

Muhammad Intizar Ali, Insight, National University of Ireland, Galway
Carlos Buil Aranda, Universidad Técnica Federico Santa María, Chile
Mithun Balakrishna, Lymba Corporation, USA
Isabel Cruz, University of Illinois at Chicago, USA
Paulo Rupino da Cunha, University of Coimbra, Portugal
Melike Şah Direkoglu, Near East University, North Cyprus
Julian Dolby, IBM Research, USA
Vadim Ermolayev, Zaporizhzhya National University, Ukraine
Javier D. Fernández, Vienna University of Economics and Business, WU Vienna, Austria
Carlos Juiz García, Universitat de les Illes Balears, Spain
Katja Gilly de La Sierra-Llamazares, Miguel Hernandez University, Spain
Ekaterini Ioannou, Open University of Cyprus
Prudhvi Janga, University of Cincinnati and Amazon Web Services, USA
Ioannis Konstantinou, National Technical University of Athens, Greece
Herbert Kuchen, University of Münster, Germany
Isaac Lera, Universitat de les Illes Balears, Spain
Xiang Lian, Kent State University, USA
Qing Liu, CSIRO, Australia
Ioana Manolescu, INRIA and Ecole Polytechnique, France
Daniel Miranker, The University of Texas at Austin, USA
Grażyna Paliwoda-Pękosz, Cracow University of Economics, Poland
Nikolaos Papailiou, National Technical University of Athens, Greece
Alfredo Pulvirenti, University of Catania, Italy
Sherif Sakr, School of Computer Science and Engineering University of New South Wales, Australia
Omair Shafiq, Carleton University, Canada
Marta Tatu, Lymba Corporation, USA
Martin Theobald, University of Luxembourg, Luxembourg
Konstantinos Tserpes, Harokopio University of Athens, Greece
Dimitrios Tsoumakos, Department of Informatics, Ionian University, Greece
Xiang Zhao, National University of Defense Technology, China
Weiguo Zheng, Chinese University of Hong Kong, China
Dimitrios Zissis, University of the Aegean, Greece

Evaluation of Papers

To verify the originality of submissions, we will use Plagiarism Detection Tools to check the content of the submitted manuscripts against previous publications.

Papers will be evaluated according to the following aspects:

Relevance to the Workshop
Novelty and practical impact
Technical soundness
Appropriateness and adequacy of:
- Literature review
- Background discussion
- Analysis of issues
Presentation, including:
- Overall organization and structure
- Correctness of English language
- Readability

Accepted Papers

The proceedings are available here in ACM DL.

Amgad Madkour, Walid G. Aref, Ahmed M. Aly:
SPARTI: Scalable RDF Data Management Using Query-Centric Semantic Partitioning
DOI: 10.1145/3208352.3208356
Dimitris Stripelis, Chrysovalantis Anastasiou, Jose Luis Ambite:
Extending Apache Spark With A Mediation Layer
DOI: 10.1145/3208352.3208354
Libo Gao, Lukasz Golab, M. Tamer Ozsu, Gunes Aluc:
Stream WatDiv - A Streaming RDF Benchmark
DOI: 10.1145/3208352.3208355
Andrew Sutton, Reza Samavi:
Timestamp-based Integrity Proofs for Linked Data
DOI: 10.1145/3208352.3208353
Amgad Madkour, Walid G. Aref, Sunil Prabhakar, Mohamed Ali, Siarhei Bykau:
TrueWeb: A Proposal for Scalable Semantically-Guided Data Management and Truth Finding in Heterogeneous Web Sources
DOI: 10.1145/3208352.3208357

Program

This year we will integrate the programs of SBD and WebDB and co-locate both workshops in the same room. By offering talks as well as interactive sessions with poster and demo presentations, we facilitate interactions and discussions between researchers of both communities, Semantic Big Data and Web Databases.

Session 1 (WebDB)
Time	Type	Description
8:00:	WebDB	WebDB program
10:30:	break	Coffee break
Session 2 (WebDB)
Time	Type	Description
11:00:	WebDB	WebDB program
12:30:	break	Lunch Break
Session 1 (SBD)
Time	Type	Description
14:00:	keynote	Lei Zou (Peking University, China): Graph-based RDF data management Abstract: The increasing size of RDF data requires efficient systems to store and query them. There have been efforts to map RDF data to a relational representation, and a number of systems exist that follow this approach. We have been investigating an alternative approach of maintaining the native graph model to represent RDF data, and utilizing graph database techniques (such as a structure-aware index and a graph matching algorithm) to address RDF data management. Since 2009, we have been developing a set of graph-based RDF data management systems that follow this approach: gStore, gStore-D and gAnswer. The first two are designed to support efficient SPARQL query evaluation in a centralized and distributed/parallel environments, respectively, while the last one aims to provide an easy-to-use interface (natural language question/answering) for users to access a RDF repository. In this talk, we give an overview of these systems and also discuss our design philosophy. Bio: Lei Zou is a Professor in the Institute of Computer Science and Technology (ICST) of Peking University (PKU). He joined PKU in 2009 after receiving his BS degree and Ph.D. degree in Computer Science at Huazhong University of Science and Technology (HUST) in 2003 and 2009, respectively. He received a CCF (China Computer Federation) Doctoral Dissertation Nomination Award in 2009, won Second Class Prize of CCF Natural Science Award in 2014 and Second Class Prize of Natural Science of the Ministry of Education, China in 2017. During his PhD, Lei Zou visited Hong Kong University of Science and Technology in 2007 and University of Waterloo in 2008 as a visiting scholar. His recent research interests include graph databases, knowledge graph, particularly in graph-based RDF data management. He has published more than 40 papers, including more than 30 papers published in reputed journals and major international conferences, such as SIGMOD, VLDB, ICDE, AAAI, TODS, TKDE, VLDB Journal. Slides
15:00:	paper	Amgad Madkour, Walid G. Aref, Ahmed M. Aly: SPARTI: Scalable RDF Data Management Using Query-Centric Semantic Partitioning DOI: 10.1145/3208352.3208356
15:30:	poster	Workshop Poster Session
Session 2 (SBD)
Time	Type	Description
16:30:	paper	Dimitris Stripelis, Chrysovalantis Anastasiou, Jose Luis Ambite: Extending Apache Spark With A Mediation Layer DOI: 10.1145/3208352.3208354
17:00:	paper	Libo Gao, Lukasz Golab, M. Tamer Ozsu, Gunes Aluc: Stream WatDiv - A Streaming RDF Benchmark DOI: 10.1145/3208352.3208355
17:30:	paper	Andrew Sutton, Reza Samavi: Timestamp-based Integrity Proofs for Linked Data DOI: 10.1145/3208352.3208353
18:00:	paper	Amgad Madkour, Walid G. Aref, Sunil Prabhakar, Mohamed Ali, Siarhei Bykau: TrueWeb: A Proposal for Scalable Semantically-Guided Data Management and Truth Finding in Heterogeneous Web Sources DOI: 10.1145/3208352.3208357
18:30:	break	End of Workshop

Manuscript Preparation

Authors are invited to submit original, unpublished research papers that are not being considered for publication in any other forum.

Manuscripts should be submitted electronically as PDF files using this webpage and be formatted using the camera-ready templates in the ACM proceedings double-column format according to the "sigconf" proceedings template. Papers cannot exceed 6 pages in length.

Accepted papers will be published online in the ACM digital library. The papers must include the standard ACM copyright notice on the first page.

The pdf version of your paper should consider the following items:

The pdf be optimized for fast web viewing.
The pdf should apply the ACM Computing Classification categories and terms (CCS concepts). The ACM templates provide space for this indexing and please consider the Computing Classification Scheme.
The pdf should contain the keywords.
The pdf should have the rights management statement and bibliographic strip on the bottom of the first page left column.
Please start numbering your paper with page number 1.
The pdf should have Type 1 fonts (scalable), not Type 3 (bit-mapped). All fonts MUST be embedded within the PDF file (to be corrected in the source files before the PDF is generated according to ACM documentation).

Submission

The submission is currently closed. Please check our Important Dates page.

Contact Program Chairs

Please contact us for any further information:

Editions

Please use the following links for further information on the edition of the given year of the International Workshop on Semantic Big Data (SBD):

2016
2017
2018
2019
2020

The International Workshop on Semantic Big Data (SBD 2018)

Program Committee

Semantic Big Data

Questions

Types of Papers

Evaluation Criteria

Topics of Interest

Aims of the Workshop

Categories of Papers

Topics of Interest

Important Dates

Diversity Considerations of the Program Committee

Legend

Program Committee Chairs

Program Committee

Evaluation of Papers

Accepted Papers

Program

Session 1 (WebDB)

Session 2 (WebDB)

Session 1 (SBD)

Session 2 (SBD)

Manuscript Preparation

Submission

Contact Program Chairs

Editions