SDSC Summer Institute 2014: Big Data Meets HPC

Contact Info:

For questions please contact:


arrow UC San Diego
arrow SDSC


Platinum Sponsor:Teradata




National Science Foundation

27th International Conference on Scientific and Statistical Database Management

Conference Schedule

June 29 - July 1, 2015
SDSC Auditorium at UC San Diego

Required Materials
This conference is designed to be hands-on so participants are expected to bring a laptop computer to follow along with demos and hands-on instruction throughout the program.

AGENDA at a glance

Monday, June 29
8:30 AM – 9:00 AM:  Registration
9:00 AM – 5:00 PM:  Conference
6:00 PM – 8:00 PM:  Conference Reception

Tuesday, June 30
9:00 AM – 5:00 PM:  Conference
6:30 PM – 9:00 PM:  Conference Dinner

Wednesday, July 1
9:00 AM – Noon: Conference
1:00 PM – 5:00 PM: Optional Excursion: UCSD Birch Aquarium. Tickets costing $7.50 can be purchased through registration. A limited number may be available on-site, transportation available.

Detailed Agenda

Day 1: MONDAY, June 29

8:30 AM

Continental Breakfast

9:00 AM

SSDBM 2015 Welcome by Chairs

AM Session 1
9:10 AM –
10:10 AM

KEYNOTE: Dan Graham, Teradata, Director
The Data Renaissance: Critical Big Data Technologies

Who knew Kryder’s Law outruns Moore’s Law?  Disk areal density doubles every 18 months – faster than silicon wafers doubling every 24.  Result: petabyte clusters are becoming commonplace.  Which has spawned dozens of new technologies to manage and distill the data.  In this session we will tour the state-of-the-art data management, especially data architectures and the mining of large data sets.  We will tour trends in SIMD vector processing, self-service data wrangling, the now famous data lakes, SQL-on-Hadoop, analytic graph engines, hot and cold data, data curation, and late-binding schemas.   Real world examples of large scale sensor data, eCommerce, and healthcare diagnosis will illustrate key points in the data renaissance.  Last we ask ourselves: what’s next?

Dan Graham has over 30 years in IT, Dan joined Teradata Corporation in 1989 where he was the senior product manager for the DBC/1012 parallel database computer. He then joined IBM where he wrote product plans and launched the RS/6000 SP parallel server. He then became Strategy Executive for IBM's Global Business Intelligence Solutions. As Enterprise Systems General Manager at Teradata, Dan was responsible for strategy, go-to-market success, and competitive differentiation for the Active Enterprise Data Warehouse platform. He currently leads Teradata's technical marketing activities.

10:10 – 10:30 AM


AM Session 2
10:30 AM –
11:45 AM

Research Session 1: Stream Processing and Aggregate Queries (25 minutes each)



AM Session 3
11:45 AM –
12:35 AM


Research Session 2: Analytics (25 minutes each)

12:35 – 1:30 PM


PM Session 1
1:30 PM –
2:45 PM

Research Session 3: Test Processing and Keyword Queries (25 minutes each)

2:45 – 3:30 PM


PM Session 2
3:30 PM –
5:10 PM

Short Papers (10 minutes)



5:30 PM –
8:00 PM

Welcome Reception: the Village 15th Floor, UC San Diego
See map for location

Day 2: TUESDAY, June 30

8:30 AM

Continental Breakfast

9:00 AM –
10:00 AM

KEYNOTE 2: Michael Carey, UC Irvine, Bren Professor
AsterixDB: A Counter but Intuitive Approach to Big Data Management

We are living in the Big Data era, and we are witnessing a shift in the role of data management system. Rather than “just” being the systems of record at the heart of traditional enterprises, modern Big Data management systems must model, capture, track, and react to the current state of the world. Doing so requires the ingestion of event data, arriving from a variety of devices, as well as enabling query access to the history of captured data over time. These requirements span a variety of scientific disciplines, including the handling of data produced by a variety sensors in health care, environmental monitoring applications, traffic monitoring, dynamic social network data, and many other domains.

AsterixDB is an open source Big Data Management System (BDMS) with a feature set that’s very different than those of other platforms in today's Big Data ecosystem.  The system was initially co-developed by UC Irvine and UC Riverside, starting in 2009 and leading eventually to its first beta release in mid-2013. It has recently moved to Apache, where AsterixDB is now an active incubating project. Many of the system’s key design decisions relate to the aforementioned shift. This talk will first briefly review AsterixDB’s data model, query language, and scale-out architecture. It will then examine a number of counter-cultural aspects of the AsterixDB system, including where its data lives, its runtime architecture, its approach to streaming data, its view of transactions, and its features for handling time-based data.

Michael J. Carey is a Bren Professor of Information and Computer Sciences at UC Irvine. Before joining UCI in 2008, Carey worked at BEA Systems for seven years and led the development of BEA's AquaLogic Data Services Platform product for virtual data integration. He also spent a dozen years teaching at the University of Wisconsin-Madison, five years at the IBM Almaden Research Center working on object-relational databases, and a year and a half at e-commerce platform startup Propel Software during the infamous 2000-2001 Internet bubble. Carey is an ACM Fellow, a member of the National Academy of Engineering, and a recipient of the ACM SIGMOD E.F. Codd Innovations Award. His current interests all center around data-intensive computing and scalable data management (a.k.a. Big Data).

10:00 – 10:20 AM


AM Session 1
10:20 AM –
10:45 AM

Research Session 4: Application (25 minutes each)

AM Session 2
10:45 AM –
12:00 PM

Research Session 5: Query Processing (1) (25 minutes each)

12:00 – 1:00 PM


PM Session 1
1:00 PM –
2:15 PM

Research Session 6: Query Processing (2) (25 minutes each)

PM Session 2
2:15 PM –
3:55 PM

Research Session 7: Indexing (25 minutes each)

3:55 – 4:15 PM


Announcement About SSDBM 2016 in Budapest, Hungary


PM Session 3
4:15 PM –
5:15 PM

PANEL: Aggregation and Multidimensional Analysis over Big Data
Chair: Alfredo Cuzzocrea, University of Trieste
Alfredo Cuzzocrea,
Lucasz Golab, University of Waterloo
Daniel Graham, Teradata
Amarnath Gupta, UC San Diego
Carlo Zaniolo, UC Los Angeles

6:15 – 9:00 PM

Dinner: La Jolla Shores Hotel & Restaurant
8110 Camino Del Oro, La Jolla, CA 92037
Transportation will depart from SDSC driveway
(Dinner will be on the beach, temperatures can be cool, sweater or light jacket recommended)


Day 3: WEDNESDAY, July 1

8:30 AM

Continental Breakfast

AM Session 1
9:00 AM –
10:15 AM

Research Session 8: Privacy and Systems (25 minutes each)

AM Session 2
10:15 AM –
11:30 AM

Research Session 9: Graph Queries (25 minutes each)

  • Querying RDF Data with Text Annotated Graphs, Lushan Han, Samsung Research America; Tim Finin, University of Maryland, Baltimore County; Anupam Joshi, University of Maryland, Baltimore County; Doreen Cheng, Samsung Research America
  • Relaxation of Subgraph Queries Delivering Empty Results, Elena Vasilyeva, SAP SE; Maik Thiele, ; Adrian Mocan, ; Wolfgang Lehner, TU Dresden
  • GRAPHITE: An Extensible Graph Traversal Framework for Relational Database Management Systems, Marcus Paradies, TU Dresden; Wolfgang Lehner, TU Dresden; Christof Bornhövd, Risk Management Solutions, Inc.

11:30 – 11:40 AM


AM Session 3
11:40 AM –
12: 55 PM

Research Session 10: Similarity Techniques (25 minutes each)

  • Shape-Based Similarity Search in Fuzzy Object Databases, Andreas Zuefle, LMU Munich; Klaus Arthur Schmid, Ludwig-Maximilians-Universität München; Thomas Bernecker, Ludwig-Maximilians-Universität München; Tobias Emrich, Ludwig-Maximilians-Universität München; Matthias Renz, Ludwig-Maximilians-Universität München; Diana Uskat, Ludwig-Maximilians-Universität München


12:55 PM

Wrap Up

1:15 PM

Meeting Adjourned

1:15 – 5:00 PM

UCSD’s Birch Aquarium
2300 Expedition Way, La Jolla, CA 92037
Tickets available through registration and at registration desk
Transportation available from SDSC driveway from 1:00 – 5:00 PM
Last pick-up at Birch Aquarium at 4:30 PM.  See handout for details

back to top