Fox, Geoffrey CIF21 DIBBs: Middleware and High Performance Analytics Libraries for Scalable Data Science Slide <div>PI: G. Fox, Co-PIs: Madhav Marathe, Shantenu Jha, Judy Qiu, Fusheng Wang </div><div>Institutions: Arizona State, Indiana (lead), Kansas, Rutgers, Stony Brook, Virginia, and Utah.</div><div><br></div><div>NSF 1443054 “Middleware and High-Performance Analytics Libraries for Scalable Data Science” is a collaboration between 7 collaborating Universities at Arizona State, Indiana (lead), Kansas, Rutgers, Stony Brook, Virginia, and Utah. It addresses the intersection of HPC and Big Data computing with several different application areas or communities driving the requirements for software systems and algorithms. The base architecture includes the HPC-ABDS, High-Performance Computing Enhanced Apache Big Data Stack, and application use cases identifying key features that determine software and algorithm requirements. The middleware includes the Harp-DAAL collective communication layer, Twister2 Big Data toolkit, and RADICAL pilot jobs for batch and streaming applications. The SPIDAL Scalable Parallel Interoperable Data Analytics Library includes core machine-learning, image processing, and the application communities, Network science, Polar Science, Biomolecular Simulations, Pathology, and Spatial systems. Recent work focuses on the integration of ML with HPC in HPCafterML in biomolecular simulations and a broad study of HPCforML.</div><div><br></div> HPC;Cloud Computing; Information Technology; Software as a Service; Infrastructure as a Service; Platform as a Service;Deep Learning Applications;machine Learning;Parallel computing;NSF-CSSI-2020-Talk;Computer Software;Computer System Architecture 2020-02-03
    https://figshare.com/articles/presentation/CIF21_DIBBs_Middleware_and_High_Performance_Analytics_Libraries_for_Scalable_Data_Science_Poster/11788911
10.6084/m9.figshare.11788911.v2