SC is the International Conference for
High Performance Computing, Networking,
Storage and Analysis



SCHEDULE: NOV 12-18, 2011

When viewing the Technical Program schedule, on the far righthand side is a column labeled "PLANNER." Use this planner to build your own schedule. Once you select an event and want to add it to your personal schedule, just click on the calendar icon of your choice (outlook calendar, ical calendar or google calendar) and that event will be stored there. As you select events in this manner, you will have your own schedule to guide you through the week.

You can also create your personal schedule on the SC11 app (Boopsie) on your smartphone. Simply select a session you want to attend and "add" it to your plan. Continue in this manner until you have created your own personal schedule. All your events will appear under "My Event Planner" on your smartphone.

ISABELA-QA: Query-driven Data Analytics over ISABELA-compressed Extreme-Scale Scientific Data

SESSION: Querying Large Scale Data

EVENT TYPE: Paper

TIME: 4:30PM - 5:00PM

AUTHOR(S):Sriram Lakshminarasimhan, Jonathan Jenkins, Robert Latham, Robert Ross, Nagiza F. Samatova, Isha Arkatkar, Zhenhuan Gong, Hemanth Kolla, Jackie Chen, Seung-Hoe Ku, C.S. Chang, Stephane Ethier, Scott Klasky

ROOM:TCC 305

ABSTRACT:
We present a query processing engine for scientific data based on ISABELA, a partitioned B-spline-lossy-compression scheme. We optimize spatial region and variable queries on variable and temporal constraints by performing temporal-delimited binning on the range of the variable values, ensuring near-uniform distribution of compressed data across bins. We demonstrate the high, user-controlled accuracy of reconstructed data through several analytic scenarios, and the competitive performance of variable/temporal-constrained query processing, while incurring both a smaller memory and storage footprint compared to both raw data and popular scientific database systems. Finally, we discuss a number of HPC optimizations, such as parallel I/O and multi-node/multi-core query processing parallelized by temporal constraints, that allow extreme scale scalability.

Chair/Author Details:

Sriram Lakshminarasimhan - North Carolina State University

Jonathan Jenkins - North Carolina State University

Robert Latham - Argonne National Laboratory

Robert Ross - Argonne National Laboratory

Nagiza F. Samatova - Oak Ridge National Laboratory

Isha Arkatkar - North Carolina State University

Zhenhuan Gong - North Carolina State University

Hemanth Kolla - Sandia National Laboratories

Jackie Chen - Sandia National Laboratories

Seung-Hoe Ku - New York University

C.S. Chang - New York University

Stephane Ethier - Princeton Plasma Physics Laboratory

Scott Klasky - Oak Ridge National Laboratory

Add to iCal  Click here to download .ics calendar file

Add to Outlook  Click here to download .vcs calendar file

Add to Google Calendarss  Click here to add event to your Google Calendar

The full paper can be found in the ACM Digital Library

   Sponsors    ACM    IEEE