deleted deleted Manager • over 9 years ago
5 Days to Go! Here are your DATA SETS!
Please comment on this thread if you have additional data sets to share or ideas on how to collect or use the data! This is meant to be a collaborative experience overall, and to use a ocean metaphor - high tide raises all ships. Share!
The International Council for Exploration of the Seas
http://www.ices.dk/marine-data/dataset-collections/Pages/default.aspx
NOAA.gov Dataset search
https://data.noaa.gov/dataset
Fish Watch - Provides the database on seafood counts
http://fishwatch.gov
Economics: National Ocean Watch
https://coast.noaa.gov/dataregistry/search/collection/info/enow
https://coast.noaa.gov/digitalcoast/training/diving-into-the-ocean-economy-with-economics.html
When all else fails, hit up http://data.gov - there's tons and tons of interesting stuff there. You can basically search for anything there and find cool stuff to hack on.
In the end, this event focuses on YOUR resourcefulness, ingenuity, and creativeness.
Comments are closed.

4 comments
Jim Salem • over 9 years ago
Here are some more from a GIS friend of mine:
Massachusetts Ocean Resource Information System
http://www.mass.gov/eea/agencies/czm/program-areas/mapping-and-data-management/moris/
National
http://marinecadastre.gov/
https://ioos.noaa.gov/data/
Global:
http://www.esri.com/industries/oceans
http://resources.arcgis.com/en/communities/oceans/
http://shipmaps.exactearth.com/
Bill Ostaski • over 9 years ago
I want to get an understanding of the data that will be accumulated from Gloucester Innovation's sensors. What type of data will be collected? How often will it be collected (batch vs streaming)? Then I can determine which Hadoop tools will be best suited for analyzing the data.
The Hadoop ecosystem is typically deployed across an array of hundreds to thousands of commodity servers, but I have an Oracle VM on my Win10 box that emulates a CentOS server with a Hadoop array of one server that can be used for development. I can download a couple of GB to that from the dataset(s) posted above ... just need to know which dataset(s) are most closely aligned with the actual data that we will see from Gloucester Innovation's sensors.
Kevin Urban • over 9 years ago
The Ocean Data Portal seems to be sound for quasi-real-time feeds:
http://www.oceandataportal.net/portal/
Bill Ostaski • over 9 years ago
Here's a Global Temperature and Salinity Profile Programme (GTSPP) dataset:
http://coastwatch.pfeg.noaa.gov/erddap/tabledap/erdGtsppBest.html?trajectory,org,type,platform,cruise&distinct()
and more locally focused, a Bottom Sediments of Georges Bank dataset:
http://woodshole.er.usgs.gov/openfile/of03-001/data/seddata/wigley61/wigley61.zip
Think I'll chew on these for a while.