normalize vectors
In data mining tasks is the attributes normalizing usually a prerequisite in order to get a meaningful result. For example, if you want to calculate the euclidean distance, the attributes with relative large numerical values will have more influence of…
Solve the path problem in eclipse pydev
If you copy a bundle of python codes into eclipse, usually the pydev can not find the paths. In this case, you have 2 two ways to deal with this problem. right click the folder -> pydev -> Set as…
Different ways to calculate the euclidean distance in python
There are already many ways to do the euclidean distance in python, you don’t need to do it actually. But it is a very good exercise for programming as long as you do it by yourself. summary: no numpy numpy.dot(vector.T,…
Useful links for Git
git Reference help.github Pro Git (free ebook) GitLab codeschool course for Git Tutorialspoint for Git Egit Icon Decorations
Useful Git Commands
$ git init #initialize an empty Git repoistory $ git status #see the current state of the project untracked file: it is a new file and not be tracked by Git staged:Files are ready to be committed. unstaged: Files with changes that…
Vector calculation in python and in R
There are different ways doing math for vector in python 1. use loop 2. use loop, but in list 3. use numpy 4. use map and lambda functions In R language, the vector math is relative easy Alternative for creating the…