<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>experiment, three &#187; phishing</title>
	<atom:link href="http://experimenthree.wordpress.com/tag/phishing/feed/" rel="self" type="application/rss+xml" />
	<link>http://experimenthree.wordpress.com</link>
	<description>The blog you couldn't live without</description>
	<lastBuildDate>Thu, 05 Nov 2009 09:16:26 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<cloud domain='experimenthree.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://www.gravatar.com/blavatar/7dd08251b23684cd89b03d5604fc5953?s=96&#038;d=http://s.wordpress.com/i/buttonw-com.png</url>
		<title>experiment, three &#187; phishing</title>
		<link>http://experimenthree.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://experimenthree.wordpress.com/osd.xml" title="experiment, three" />
		<item>
		<title>A long day with Unicode</title>
		<link>http://experimenthree.wordpress.com/2008/09/23/a-longday-with-unicode/</link>
		<comments>http://experimenthree.wordpress.com/2008/09/23/a-longday-with-unicode/#comments</comments>
		<pubDate>Tue, 23 Sep 2008 22:07:39 +0000</pubDate>
		<dc:creator>alezzandro</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[idn]]></category>
		<category><![CDATA[internet]]></category>
		<category><![CDATA[phishing]]></category>
		<category><![CDATA[report]]></category>
		<category><![CDATA[typosquatting]]></category>
		<category><![CDATA[unicode]]></category>

		<guid isPermaLink="false">http://experimenthree.wordpress.com/?p=48</guid>
		<description><![CDATA[Last week I attended a training course on Unicode by Jukka K. Korpela. It was interesting, though the subject is&#8230; “tough”!  
Following are a few (absolutely non exhaustive) notes I took during the course:
Introduction
Computers just deal with numbers. They store letters and other characters by assigning a number for each one. Before Unicode was [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=experimenthree.wordpress.com&blog=4535657&post=48&subd=experimenthree&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Last week I attended a training course on Unicode by <a href="http://books.google.co.uk/books?id=PcWU2yxc8WkC">Jukka K. Korpela</a>. It was interesting, though the subject is&#8230; “tough”! <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> </p>
<p>Following are a few (absolutely non exhaustive) notes I took during the course:<strong></strong></p>
<p><strong>Introduction</strong></p>
<blockquote><p><em>Computers just deal with numbers. They store letters and other characters by assigning a number for each one. Before Unicode was invented, there were hundreds of different encoding systems for assigning these numbers. No single encoding could contain enough characters: for example, the European Union alone requires several different encodings to cover all its languages. Even for a single language like English no single encoding was adequate for all the letters, punctuation, and technical symbols in common use.</em> (from <a href="http://www.unicode.org/standard/WhatIsUnicode.html">What is Unicode</a>)</p></blockquote>
<p>Unicode is an international standard that, due to its complexity, is still not fully accepted. However, it is the default in several applications (e.g., XML applications).</p>
<p>Unicode is a “unified coding system” that contains more than 100,000 characters. It is dynamic, because new characters are continually added in order  include “all” possible human characters. From a theoretical point of view, we could say that it tries to preserve cultural diversity while giving a universal interpretation of all human languages (it is arguable whether it is successful on this: for example, some Chinese characters are still not part of Unicode).</p>
<p><strong>It’s about encoding, not fonts</strong><br />
There is an important difference between a font and the underlying encoding. Unicode is about “encoding glyphs”: given a sign representing a character in a human language, Unicode describes it univocally. Fonts, on the other hand, are a visual representation (a rendering) of those glyphs.</p>
<p>A font usually supports a small subset of Unicode. Western languages fonts, for example, do not support Chinese characters.</p>
<p>In general, only glyphs can be encoded, not abstract ideas. This simple concept has been and still is a matter of discussion whenever a new character needs to be included.</p>
<p><strong>About characters</strong></p>
<ul>
<li> Unicode is a 32-bits characters set. Each character has only one encoding, with some exceptions (compatibility reasons with older encoding systems)</li>
<li>Some characters are obtained as a composition from other characters. The accented letter “è”, for example, is a composition of two characters: è = e + ` (see <a href="http://www.fileformat.info/info/unicode/char/00e8/index.htm">here</a> for more details)</li>
<li>The name of a character is its identifier: it contains letters, numbers, spaces, hypens.</li>
</ul>
<p>A few definitions:</p>
<ul>
<li><strong>Code point</strong>. A value in the 32-bits space. Each char has a code point, not all code points are assigned to chars. This is the numeric representation (usually hexadecimal) of a character.</li>
<li><strong>Blocks</strong>. Blocks are groups of characters. The assignment to characters to blocks, however, seems a bit confusing: for example, there is a block called “Greek and Coptic”, but it does not include all Greek characters.</li>
<li><strong>Categories</strong>. Each character has a set of properties which can be used for classification. For example, a letter category is anything used to write words in any language. There are properties which distinguish the script to which a character belongs to. There is a math symbols category.</li>
<li><strong>Normalisation</strong>. Technique to translate a complex character in two or more simpler characters.</li>
<ul>
<li>In western languages it is usually used to remove diacritics (accents, etc.) by substituting them with apostrophes</li>
<li>Used for compatibility purposes (eg, to translate to ASCII)</li>
<li>It might create problems if the process has to be reversed</li>
</ul>
</ul>
<p><strong>Unicode in real life and a bit of IDNs</strong></p>
<p>Using Unicode might lead to lots of confusion and extra care should be used when dealing with it:</p>
<ul>
<li> <strong>ASCII punctuation</strong> is different from Unicode punctuation, for example when dealing with quotes: “ ‘ ’ ” &#8216; &#8221; but to many the difference is not clear</li>
<li>Certain <strong>characters are repeated</strong> in different scripts</li>
<ul>
<li>The Latin character A and the Cyrillic character A, for example, look/are the same.</li>
<li>Different sets of numbers are presents in different scripts</li>
</ul>
<li>Sometimes the <strong>same punctuation</strong> characters can be found in <strong>different scripts</strong> with different logical meaning (this is the case of math symbols)</li>
<li><strong>Compatibility characters</strong>: they are used to make Unicode compatible with older encodings, it is a very vague concept that may easily induce in confusion. For example, K (Kelvin symbol) is different from K (letter) but identical in their representation.</li>
<li>To make things worse, <strong>characters do not have a property that identify compatibility chars</strong>. People “know” which they are only from reading the big books containing the standards.</li>
</ul>
<p>We discussed a bit about the problem of Internationalised Domain Names (IDNs), which open the doors to <a href="http://en.wikipedia.org/wiki/Typosquatting">typosquatting</a> and <a href="http://en.wikipedia.org/wiki/Phishing">phishing</a>. One policy might be to <strong>disallow mixing different scripts</strong> when registering IDNs. In certain languages it is common practice to use characters or words from the Latin alphabet as part of the sentence and such a solution would constitute a big limitation.</p>
<p>A partial solution, which might work for the most common cases, is to allow mixing any script with the “common” Latin script.</p>
<p><strong>References</strong></p>
<ul>
<li><a href="http://www.unicode.org">www.unicode.org</a> (hard to read)</li>
<li><a href="http://www.fileformat.info">http://www.fileformat.info</a> (easier to read, you can look up for single characters and see what they are)</li>
<li><a href="http://www.icu-project.org">www.icu-project.org</a></li>
</ul>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/experimenthree.wordpress.com/48/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/experimenthree.wordpress.com/48/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/experimenthree.wordpress.com/48/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/experimenthree.wordpress.com/48/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/experimenthree.wordpress.com/48/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/experimenthree.wordpress.com/48/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/experimenthree.wordpress.com/48/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/experimenthree.wordpress.com/48/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/experimenthree.wordpress.com/48/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/experimenthree.wordpress.com/48/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=experimenthree.wordpress.com&blog=4535657&post=48&subd=experimenthree&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://experimenthree.wordpress.com/2008/09/23/a-longday-with-unicode/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/8204b303bd6f96dd46394785d131ad2e?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96" medium="image">
			<media:title type="html">alezzandro</media:title>
		</media:content>
	</item>
		<item>
		<title>Italian TLD and malicious web sites</title>
		<link>http://experimenthree.wordpress.com/2008/06/12/italian-tld-and-malicious-web-sites/</link>
		<comments>http://experimenthree.wordpress.com/2008/06/12/italian-tld-and-malicious-web-sites/#comments</comments>
		<pubDate>Thu, 12 Jun 2008 04:43:16 +0000</pubDate>
		<dc:creator>alezzandro</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[dns]]></category>
		<category><![CDATA[internet]]></category>
		<category><![CDATA[phishing]]></category>
		<category><![CDATA[security]]></category>
		<category><![CDATA[spam]]></category>
		<category><![CDATA[typosquatting]]></category>

		<guid isPermaLink="false">http://experimenthree.wordpress.com/?p=40</guid>
		<description><![CDATA[Mapping the Mal Web, Revisited (McAfee, June 4).
A new security report from McAfee has just been released on the spread of malicious web sites among different TLDs. Very informative and detailed, the report integrates last year report. Some of the key findings:

.ro (Romania) and .ru (Russia) are the most risky European TLDs, i.e., the probability [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=experimenthree.wordpress.com&blog=4535657&post=40&subd=experimenthree&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><span style="font-family:trebuchet ms;"><a href="http://us.mcafee.com/en-us/local/docs/Mapping_Mal_Web.pdf?cid=45044">Mapping the Mal Web, Revisited</a> (McAfee, June 4)</span>.</p>
<p><span style="font-family:trebuchet ms;">A new security report from McAfee has just been released on the spread of malicious web sites among different TLDs. Very informative and detailed, the report integrates last year report.</span> <span style="font-family:trebuchet ms;">Some of the key findings:</span></p>
<ul>
<li><span style="font-family:trebuchet ms;">.ro (Romania) and .ru (Russia) are the most risky European TLDs, i.e., the probability of finding a malicious web site is higher if surfing one of those TLDs.</span></li>
<li><span style="font-family:trebuchet ms;">Risk related to .biz (business) and .cn (China) is also increasing (if compared to last year)</span></li>
<li><span style="font-family:trebuchet ms;">.it (Italy) has worsened, but is still &#8220;a safe place&#8221;</span></li>
<li><span style="font-family:trebuchet ms;color:#ff0000;">.hk (Hong Kong) is the riskiest TLDs</span></li>
</ul>
<p><span style="font-family:trebuchet ms;">The “Hong Kong” case, in particular, is worth a closer attention:</span><span style="font-family:trebuchet ms;"> </span></p>
<blockquote><p><span style="font-style:italic;font-family:trebuchet ms;">Bonnie Chun, an official [from the .hk] TLD, acknowledged that they had made some decisions that inadvertently encouraged the scammers:</span><br />
<span style="font-style:italic;font-family:trebuchet ms;">1 . “We enhanced our domain registration online process thus making it more user-friendly. Instances include the capability for registering several domains at one time, auto-copying of administrative contact to technical contact and billing contact, etc. Phishers usually registered eight or more domains at one time. </span><br />
<span style="font-style:italic;font-family:trebuchet ms;">2 . We offered great domain registration discounts, such as buy-one, get-two domains.</span><br />
<span style="font-style:italic;font-family:trebuchet ms;">3 . Our overseas service partners promoted .hk domains in overseas markets.”</span></p></blockquote>
<p><span style="font-family:trebuchet ms;">In a <a href="http://esperimentotre.blogspot.com/2008/05/casi-di-phishing-in-aumento-in-uk.html">previous post</a> I talked about the recent <a href="http://blog.nominet.org.uk/tech/2008/05/23/recent-uk-phishing-activity/">increased phishing activity</a> in the .uk registry, which, in that particular case, has taken advantage from Nominet&#8217;s automatic <a href="http://www.nominet.org.uk/registrars/systems/auto/">registration process</a>.<br />
</span><br />
<span style="font-family:trebuchet ms;">Other country, other problem: the <a href="http://www.nic.it/">.it registry</a> will implement automatic registration procedures by the end of the year; and I read, a couple of weeks ago on <a href="http://www.blorigo.net/ricerca-e-business-italians-do-it-better">Swartzy’s blog</a>, that the IIT/CNR is also launching an advertisement campaign for .it domains.</span></p>
<p><span style="font-family:trebuchet ms;">I am curious to see if, in analogy to what happened in Hong Kong, we will see an increase of the malicious activity in the .it TLD.</span></p>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/experimenthree.wordpress.com/40/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/experimenthree.wordpress.com/40/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/experimenthree.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/experimenthree.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/experimenthree.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/experimenthree.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/experimenthree.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/experimenthree.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/experimenthree.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/experimenthree.wordpress.com/40/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/experimenthree.wordpress.com/40/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/experimenthree.wordpress.com/40/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=experimenthree.wordpress.com&blog=4535657&post=40&subd=experimenthree&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://experimenthree.wordpress.com/2008/06/12/italian-tld-and-malicious-web-sites/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/8204b303bd6f96dd46394785d131ad2e?s=96&#38;d=http%3A%2F%2F0.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96" medium="image">
			<media:title type="html">alezzandro</media:title>
		</media:content>
	</item>
	</channel>
</rss>