GeoSOM Repository

 

 

In this site you will find a repository of geospatial datasets with special purpose for spatial data mining.

Please feel free to download them…

 

Data Set Name

Number of Instances

Non-spatial attributes

Date

Description

Author

Download

Preview

Squareville

100

1

March 2009

Squareville is an artificial dataset purposely created to use in GeoSOM suite. Squareville is a small-town with square boundaries and an area of 10000 m2.  Squareville has 100 houses evenly spaced with coordinates x in [5, 95] and y in [5, 95]. For each house we know the average salary, which is s in  [900 1000] for 35≤x≤65 and s in [0 100]

Victor Lobo

csv file

shp file

mat file

4 corners

5000

1

March 2009

The points follow a uniform distribution in the geographical coordinate, within the rectangle limited by [(0,0),(20,5)]. In the non-geographical dimension there are three zones of high spatial autocorrelation, where the values of z are very similar among neighbouring points, with a uniform in [90,91] in two zones and [10,11] in another. There is also one area of ‘‘negative autocorrelation’’, where half the data points have z==0 and the other half have z==90. In the rest of the input space z has a uniform distribution in [0,100]. 

Victor Lobo

csv file

shp file

mat file

4corners.jpg

Lisbon_aerial_data

16936

6

March 2009

This dataset is based on a satellite image, relative to Lisbon’s area, from Landsat 5 TM. The image was converted to a small resolution (100x100 m) and a point shapefile was created using the image pixels. Associated to each point are the 6 bands values. 

Roberto Henriques

csv file

shp file

 Lisbon_aerial_data.jpg

Sado_estuary

153

3

March 2009

 Real dataset about sedimentation on Sado Estuary

Sandra Caeiro

csv file

shp file

 Sado_estuary.JPG

Fish in Somland

225

5

March 2009

Artificial dataset about several different fish species monitored at 225 sample points

...

csv file

shp file

FishSomland.JPG