BK: Data mining, Chapter 2 - getting to know your data
Why:real-worlddataaretypicallynoisy,enormousinvolume,andmayoriginatefromahodgepodgeofheterogeneoussources.mean;median;mode(mostcommonvalue);distribution;Knowingsuchbasicstatisticsregardingeachattribut