Workshop: International Parallel Data Systems Workshop (PDSW)
Event TypeWorkshop
Registration Categories
Big Data
Data Analytics
Data Management
TimeMonday, 18 November 20199am - 5:30pm
DescriptionWe are pleased to announce that the 4th International Parallel Data Systems Workshop (PDSW’19) will be hosted at SC19: The International Conference for High Performance Computing, Networking, Storage and Analysis. The objectives of this one day workshop are to promote and stimulate researchers’ interactions to address some of the most critical challenges for scientific data storage, management, devices, and processing infrastructure for both traditional compute intensive simulations and data-intensive high performance computing solutions. Special attention will be given to issues in which community collaboration can be crucial for problem identification, workload capture, solution interoperability, standards with community buy-in, and shared tools.

Many scientific problem domains continue to be extremely data intensive. Traditional HPC systems and the programming models for using them such as MPI were designed from a compute-centric perspective with an emphasis on achieving high floating point computation rates. But processing, memory, and storage technologies have not kept pace and there is a widening performance gap between computation and the data management infrastructure. Hence data management has become the performance bottleneck for a significant number of applications targeting HPC systems. Concurrently, there are increasing challenges in meeting the growing demand for analyzing experimental and observational data. In many cases, this is leading new communities to look toward HPC platforms. In addition, the broader computing space has seen a revolution in new tools and frameworks to support Big Data analytics and machine learning.

9:00am - 9:10amInternational Parallel Data Systems Workshop (PDSW)
9:10am - 10:00amAlluxio - Data Orchestration for Analytics and AI in the Cloud
10:00am - 10:30amPDSW Morning Break
10:30am - 10:55amIn Search of a Fast and Efficient Serverless DAG Engine
10:55am - 11:20amEnabling Transparent Asynchronous I/O Using Background Threads
11:20am - 11:40amPDSW Works in Progress I
11:40am - 12:05pmActive Learning-Based Automatic Tuning and Prediction of Parallel I/O Performance
12:05pm - 12:30pmApplying Machine Learning to Understand the Write Performance of Large-Scale Parallel Filesystems
12:30pm - 2:00pmPDSW Lunch Break
2:00pm - 2:40pmA House Divided: Why Don't Cloud Storage and HPC Storage Share More Technology?
2:40pm - 3:00pmPDSW Works in Progress II
3:00pm - 3:30pmPDSW Afternoon Break
3:30pm - 3:55pmTowards Physical Design Management in Storage Systems
3:55pm - 4:20pmA Foundation for Automated Placement of Data
4:20pm - 4:45pmProfiling Platform Storage Using IO500 and Mistral
4:45pm - 5:10pmUnderstanding Data Motion in the Modern HPC Data Center
5:10pm - 5:30pmPDSW Works in Progress III
Back To Top Button