If you have any specific questions, comments, or suggestions regarding the contest, feel free to email us:
datacontest@cs.ucsd.edu.
Software Resources
-
Weka is an open-source
collection of machine learning algorithms for data mining tasks, written in Java.
-
R is an open-source computing
environment and language with many statistical analysis functions and
classifiers (such as neural networks and logistic regression).
-
SVM Light is a free implementation
of Support Vector Machines, written in C.
-
Matlab resources:
- Netlab is a free neural network
package for Matlab written by Christopher Bishop.
-
Bayes Net Toolbox written
by Kevin Murphy is a free Matlab package with a plethora of Bayesian and probabilistic algorithms.
Human Resources
Perhaps the best way to learn is through interaction with professors and other experts in the field. We encourage you to seek out these individuals and ask for help. Experts should be willing to provide guidance, but not direct support (e.g. actual code, processed data etc.) The contest
forum can also be useful for amatuers and experts alike.
If you have trouble locating faculty members involved in machine learning/data mining at your home university, contact us and we will try to help get you started.