OPEN DATA SETS AND PLATFORMS

Many organizations are making data publicly available for different uses and offer a variety of hosting and listing services. Although not an exhaustive list, the following table provides examples of organizations with open data platforms and specific data sets that are relevant to mining. This list aims to supplement the Guideline for Sharing Open Data Sets in Mining (currently a draft under review)The list is sorted alphabetically by organization. 

This list is intended to be a live document and we encourage you to reach out with additions or corrections. Anything missing or that you would like to add to this list? Please contact info@gmggroup.org  

Industry/Organization 

Description 

Related Links 

4TU Research Data

Provides general research data in science, engineering, and design with mining categories

Australian Government

Provides geospatial data sets and other spatial indexes

Australian Research Data Commons (ARDC)

Provides Australian research communities and industries access to data intensive eInfrastructures, platforms, skills, and collections of data

AWS S3 Data sets

Includes data sets provided and maintained by a variety of third parties under a variety of licenses, offers listing services

BDD100K

A Diverse Driving Dataset for Heterogeneous Multitask Learning

Canadian Government

Provides ability to search open data relevant to Canadians

Centre for Open Science

Provides Transparency and Openness Promotion (TOP) Guidelines, 2015, with more than 5000 signatories

Centro Avanzado de Tecnología para la Minería (AMTC)

Provides spatial navigation (3D and 2D readings) and underground mine scanning in Chile

Common Crawl

Provides a general open repository of web crawl data since 2013 that can be accessed and analyzed by anyone

The Dat Foundation

Nonprofit group that supports work in community management, user experience, and technical research and development

EU Open Data

Provides access to European Union open data

Geoscience Australia

Provides geology data set information

Global Tailings Portal

Provides disclosures from mining companies about their tailings storage facilities

Google

Provides public data sets

Google

Provides general Google data set discovery and meta data guidelines

Google Data Search

General Google data search tool

Google Landmarks Dataset 

This is the second version of the Google Landmarks dataset (GLDv2), which contains images annotated with labels representing human-made and natural landmarks.

Government of Canada

Provides data sets of locations for different mining applications in Canada

Homeland Infrastructure Foundation-Level Data (HIFLD)

Provides data sets of locations for different mining applications in the US

IBM

Provides open data sets

Kaggle

Provides a data set about a flotation plant and iron ore concentration process

Kaggle

Provides records of accidents from 12 different plants in three different countries

Microsoft Azure

Provides curated open data sets

MINEDEX

Provides a spatial and textual database with data on mining and exploration sites and projects in WA

Minelib

Shows block models of open pit mines for planning optimization benchmarking

Natural Language Processing Data Set

Provides a comprehensive set of Natural Processing Language data sets

NSW Government

Provides open data sets for the Department of Planning, Industry, and Environment

Ontario Government

Provides government data sets for Ontario

OpenML

Experiment database for machine learning on which users can upload data sets

Process Mining

Provides process mining data sets, real-life event logs, and synthetic events

Queensland Government

Provides Queensland Government Open Data Portal (mining data sets)

Shahid Bahonar University Department of Mechanical Engineering

Shows images of rocks post blasting

ToyADMOS

Provides anomaly detection in machine operating sounds

Unearthed

Community of data scientists, developers, and startups working on challenges within the energy and natural resources industries

University of WA

UWA Prognostics Data Library that provides condition monitoring information for academic use and requires registration

US Government

Provides US government open data, tools, and resources to conduct research, develop web and mobile applications, and design data visualizations

WA DMIRS

Data sets uploaded by the West Australian Dept of Mines, Industry Regulation and Safety

World Bank

Provides free and open access to global development data

X