Skip to content

Latest commit

 

History

History
33 lines (16 loc) · 2.13 KB

index.md

File metadata and controls

33 lines (16 loc) · 2.13 KB
layout title description
default
BLASTNet
Bearable Large Accessible Scientific Training Network-of-Datasets

Mission

summary

BLASTNet aims to address gaps in open machine learning (ML) within the sciences, specifically fluid mechanics by providing researchers in reacting and non-reacting flow physics communities with (mostly) externally contributed open-source ML resources.

This data is useful for fluid flows in a wide range of ML applications tied to automotive, propulsion, energy, and the environment. Specifically, scientific engineering tasks related to these domains may include turbulent closure modeling, spatio-temporal modeling, and inverse modeling.

These contributions now include (i) 4.8 TB of high-fidelity simulation datasets that have been processed in a convenient format for ML applications, (ii) >13,000 lines of code that aid the training and evaluating of these models, (iii) >100 pre-trained weights in flow physics problems, and (iv) regular workshop events that disseminate ML for flow physics via seminars and competitions.

Distribution

Our ML resources are shared via github and Kaggle. Specifically, code is shared via github, while data and models are shared via Kaggle.

To circumvent Kaggle storage constraints, we partition our data into a network of <100 GB subsets, with each subset containing a separate simulation configuration. This partitioned data can then be uploaded as separate datasets on Kaggle. To download all cases via Kaggle API, download this bash script. Summary of the data are avalable here!

Our network of datasets approach: approach