Semantic Big Data-Workshop: PROGRAM-COMMITTEE

Important Dates

Time Schedule
Submission (extended):	March 6, 2017
Notification:	March 20, 2017
Workshop:	May 19, 2017

Diversity Considerations of the Program Committee

We have currently recruited 41 PC members and chairs listed below who are experts in the topics of interest of our workshop. The current PC members and chairs are selected from 18 nations all over the world as shown also by the map below. While most PC members are from academia, we have 5 experts also from industry (12%). 8 of the PC members and chairs are women (20%).

Legend

Program committee members and chairs: 1 8

Program Committee Chairs

Program Committee

Muhammad Intizar Ali, Insight, National University of Ireland, Galway
Carlos Buil Aranda, Universidad Técnica Federico Santa María, Chile
Mithun Balakrishna, Lymba Corporation, USA
Isabel Cruz, University of Illinois at Chicago, USA
Paulo Rupino da Cunha, University of Coimbra, Portugal
Melike Şah Direkoglu, Near East University, North Cyprus
Julian Dolby, IBM Research, USA
Vadim Ermolayev, Zaporizhzhya National University, Ukraine
Javier D. Fernández, Vienna University of Economics and Business, WU Vienna, Austria
Carlos Juiz García, Universitat de les Illes Balears, Spain
Katja Gilly de La Sierra-Llamazares, Miguel Hernandez University, Spain
Andreas Harth, Institute AIFB, Karlsruhe Institute of Technology (KIT), Germany
Ekaterini Ioannou, Technical University of Crete, Greece
Prudhvi Janga, University of Cincinnati and Amazon Web Services, USA
Ioannis Konstantinou, National Technical University of Athens, Greece
Nectarios Koziris, National Technical University of Athens, Greece
Herbert Kuchen, University of Münster, Germany
Wookey Lee, Inha University, Korea
Isaac Lera, Universitat de les Illes Balears, Spain
Xiang Lian, Kent State University, USA
Qing Liu, CSIRO, Australia
Nuno Lopes, TopQuadrant
Ioana Manolescu, INRIA and Ecole Polytechnique, France
Daniel Miranker, The University of Texas at Austin, USA
Grażyna Paliwoda-Pękosz, Cracow University of Economics, Poland
Nikolaos Papailiou, National Technical University of Athens, Greece
Alfredo Pulvirenti, University of Catania, Italy
Sherif Sakr, School of Computer Science and Engineering University of New South Wales, Australia
Stephan Seufert, Amazon Machine Learning (Industry), Germany
Omair Shafiq, Carleton University, Canada
Marta Tatu, Lymba Corporation, USA
Martin Theobald, University of Luxembourg, Luxembourg
Dimitrios Tsoumakos, Department of Informatics, Ionian University, Greece
Juergen Umbrich, Vienna University of Economics and Business, Vienna, Austria
Dongyan Zhao, Peking University Beijing, China
Xiang ZHAO, National University of Defense Technology, China
Weiguo Zheng, Chinese University of Hong Kong, China
Dimitrios Zissis, University of the Aegean, Greece
Lei Zou, Peking University, China

Session 1
Time	Type	Description
9:00:	keynote	Martin Theobald (University of Luxembourg, Luxembourg): Scalable RDF Data Management with a Touch of Uncertainty Abstract: The keynote provides an overview of our recent research activities and also highlights a number of research challenges in the context of extracting, indexing and querying large collections of RDF data. A core part of our work focuses on handling uncertain facts obtained from various information-extraction techniques, where we aim to develop efficient algorithms for querying the resulting uncertain RDF knowledge base with the help of a probabilistic database. A second, very recent research focus lies in scaling out these approaches to a distributed setting. Here, we aim to process declarative queries, posed in either SQL or logical query languages such as Datalog, via a proprietary, asynchronous communication protocol based on the Message Passing Interface. Our current RDF engine, coined "TriAD", has proven to be one of the fastest engines over a number of RDF benchmarks with up to 1.8 billion triples. Bio: Martin Theobald has been appointed as a Professor of Computer Science with a focus on "Big Data" by the University of Luxembourg in 2017. He previously held positions as a Professor and Co-Director of the Institute for Databases and Information Systems (DBIS) at the University of Ulm and as a Professor in the Advanced Database Research and Modeling (ADReM) group at the University of Antwerp. He obtained a doctoral degree from the Max Planck Institute for Informatics in Saarbrücken in 2006 and subsequently spent two years as a Postdoctoral Researcher at the Stanford University Infolab. Between 2008 and 2012, Martin led the research group for "Ranking and Uncertain Data Management" at the Max Planck Institute for Informatics. His current research interests are focused at the intersection of information extraction, probabilistic databases and distributed architectures. The "Big Data" group at the University of Luxembourg investigates the whole lifecycle of semantic-data management, beginning with the extraction of entities and relations from textual and semi-structured sources and on to data-cleaning aspects and probabilistic inference. Martin is an area editor for Elsevier’s "Information Systems" since 2013 and frequently serves as a reviewer and PC member of international journals and conferences such as CACM, TODS, TKDE, VLDB, SIGMOD, SIGIR, WSDM, CIKM and ICDE. Slides
10:00:	paper	Daniel Janke, Steffen Staab, Matthias Thimm: On Data Placement Strategies in Distributed RDF Stores DOI: 10.1145/3066911.3066915 Slides
10:30:	break	Coffee Break
Session 2
Time	Type	Description
11:00:	paper	Thomas Hassan, Christophe Cruz, Aurélie Bertaux: Ontology-based approach for unsupervised and adaptive focused crawling DOI: 10.1145/3066911.3066912 Slides
11:30:	paper	Mayank Kejriwal, Pedro Szekely: Supervised Typing of Big Graphs using Semantic Embeddings DOI: 10.1145/3066911.3066918 Extended Version: URN: urn:nbn:de:101:1-2017100112160 URL: Publisher Slides
12:00:	paper	Prashanti Manda, Todd J. Vision: Evolution of anatomical concept usage over time: Mining 200 years of biodiversity literature DOI: 10.1145/3066911.3066919
12:30:	break	Lunch Break
Session 3
Time	Type	Description
14:00:	keynote	Julian Dolby (IBM's Thomas J. Watson Research Center, U.S.A): Toward Scalable Semantic Big Data Abstract: SPARQL is the query language for RDF and linked data, and such data has been a focus of our work for quite a few years. In this talk, I shall start by summarizing some of our older work in the scalable semantics and reasoning space. The most basic is work scaling reasoning using refinement techniques. Built on that is work applying our reasoning to the medical domain, matching patients to clinical trials. Next, I shall discuss our work in scaling SPARQL queries in an RDF store. With this introduction, the main topic will be extending SPARQL to conveniently query across both RDF and non-RDF data. There are now standards to virtualize non-RDF datasets as RDF, such as R2RML, CSV2RDF and XSPARQL; thus SPARQL can be increasingly used to access RDF and non-RDF data. However, there are two chief shortcomings to using SPARQL in such contexts. First, SPARQL has no notion of modularity, and modularity is a key feature in assembling complex queries of the kind that are needed when one integrates very different datasets. Second, its support for query federation over different endpoints is limited: the endpoints all need to be SPARQL and the language does not allow for posting data to an endpoint. To rectify these shortcomings, we propose two simple extensions to the language to rectify these limitations: functions and generalized service. In designing these extensions, we were careful to keep the extensions minimal, to preserve SPARQL's declarative semantics. We define the semantics of each extension, and provide a open source reference implementation of this extended language, to provide processing over both relational and non-relational backends. Bio: Julian Dolby has been a Research Staff Member at IBM's Thomas J. Watson Research Center since 2000. He works on a range of topics, including static program analysis, software testing and the Semantic Web. He was educated at the University of Wisconsin-Madison as an undergraduate, and at the University of Illinois at Urbana-Champaign as a graduate student where he worked with Professor Andrew Chien on programming systems for massively-parallel machines. His work has been included in various IBM products like Rational AppScan and in the RDF support in DB2. Slides
15:00:	paper	Tien Duc Cao, Ioana Manolescu, Xavier Tannier: Extracting Linked Data from statistic spreadsheets DOI: 10.1145/3066911.3066914 Slides
15:30:	break	Coffee Break
Session 4
Time	Type	Description
16:00:	paper	Michael J. Lewis, George K. Thiruvathukal, Venkatram Vishwanath, Michael E. Papka, Andrew Johnson: A Distributed Graph Approach For Pre-processing Linked RDF Data Using Supercomputers DOI: 10.1145/3066911.3066913
16:30:	paper	Yogesh Pandey, Srividya K. Bansal: Safety Check – A Semantic Web Application for Emergency Management DOI: 10.1145/3066911.3066917 Extended Version: URL: Publisher Slides
17:00:	paper	Michelle C. Krzyzanowski, Josh Levy, Grier P. Page, Nathan C. Gaddis, Robert F. Clark: Using Semantic Web Technologies to Power LungMAP, a Molecular Data Repository DOI: 10.1145/3066911.3066916 Slides
17:30:	break	End of Workshop

The International Workshop on Semantic Big Data (SBD 2017)

Program Committee

Semantic Big Data

Questions

Types of Papers

Evaluation Criteria

Topics of Interest

Aims of the Workshop

Types of Papers

Topics of Interest

Important Dates

Diversity Considerations of the Program Committee

Legend

Program Committee Chairs

Program Committee

Evaluation of Papers

Accepted Papers

Program

Session 1

Session 2

Session 3

Session 4

Manuscript Preparation

Submission

Contact Program Chairs

Editions