Others

85 Datasets

Datasets


CalIt2 Building People Counts

Observations come from 2 data streams (people flow in and out of the building), over 15 weeks, 48 time slices per day (half hour count aggregates). T...

time-series, multivariate

Japanese Vowels

The data was collected for examining our newly developed classifier for multidimensional curves (multidimensional time series). Nine male speakers utter...

time-series, multivariate, classification

KDD Cup 1998 Data

Please see associated text files in the download folder.

regression, multivariate

Movie

The data is stored in relational form across several files. The central file (MAIN) is a list of movies, each with a unique identifier. These identifier...

multivariate, relational

NSF Research Award Abstract...

The abstracts, one per file, were furnished by the NSF (National Science Foundation). A sample abstract is shown in the next section. The bag-of-word d...

text

Pseudo Periodic Synthetic T...

This data set is designed for testing indexing schemes in time series databases. It is a much larger dataset than has been used in any published study (...

time-series, univariate

Reuters-21578 Text Categori...

From the original readme file (please consult it for more information): ------------------------- The documents in the Reuters-21578 collection appeared...

text, classification

Synthetic Control Chart Tim...

This dataset contains 600 examples of control charts synthetically generated by the process in Alcock and Manolopoulos (1999). There are six different c...

time-series, classification, clustering

Statlog (Image Segmentation)

The instances were drawn randomly from a database of 7 outdoor images. The images were handsegmented to create a classification for every pixel. Eac...

multivariate, classification

Statlog (Vehicle Silhouettes)

The purpose is to classify a given silhouette as one of four types of vehicle, using a set of features extracted from the silhouette. The vehicle may b...

multivariate, classification