In latest issue of IEEE Transactions on Dependable and Secure Computing, October-December 2006 (Vol. 3, No. 4), interesting article titled: “Detecting Phishing Web Pages with Visual Similarity Assessment Based on Earth Mover’s Distance (EMD)” has been published. This paper abstract says:
An effective approach to phishing Web page detection is proposed, which uses Earth Mover’s Distance (EMD) to measure Web page visual similarity. We first convert the involved Web pages into low resolution images and then use color and coordinate features to represent the image signatures. We use EMD to calculate the signature distances of the images of the Web pages. We train an EMD threshold vector for classifying a Web page as a phishing or a normal one. Large-scale experiments with 10,281 suspected Web pages are carried out to show high classification precision, phishing recall, and applicable time performance for online enterprise solution. We also compare our method with two others to manifest its advantage. We also built up a real system which is already used online and it has caught many real phishing cases.
It is really worth reading. You can find that article here. Note: Subscription is necessary to read full article.