I am new in data mining. For what I understand most techniques are intended to be used with large data sets, but I am curious to know if this is a must or just a general rule. In other words, is it ok to use data mining techniques in small data sets? Most examples work in small tables, but are there any limitations? Why?
Most data mining techniques are statistical approaches.
To get significant patterns, you need enough data. Otherwise anything measures may as well just be random deviations due to chance. The more data you have, the better your patterns could be.
But most data isn't "big" in the sense of "big data": a lot of methods would not scale to really big data sets. In most cases, you only have a few thousand (not a few exabyte) of data; in particular after preprocessing the data into the desired format.
Collected from the Internet
Please contact [email protected] to delete if infringement.
Comments