Datasets
From DDMWiki
DDM Related Data Sets
UMass Trace Repository The Umass data mining archive
General Data Sets
UCI KDD Data Mining Archive The UCI KDD Data Mining archive (there are centralized datasets mostly but they can be distributed for distributed mining).
KDnuggets Data Mining Datasets Collection
MovieLens MovieLens is a web-based recommender system for movies. There are two datasets in this repository. It is maintained jointly by University of Minnesota, Carnegie Mellon University and the University of Michigan.
Enron E-mail Dataset Enron e-mail dataset : a centralized dataset containing all the emails exchanged between 150 enron employees.
KDD Cup KDD Cup is the annual Data Mining and Knowledge Discovery competition organized by ACM Special Interest Group on Knowledge Discovery and Data Mining (ACM SIGKDD), the leading professional organization of data miners. This site allows registered users to download data on a variety of topics starting from quantum physics to biology