LetterFrequency (%)
a6.2193%
á2.2355%
b1.5582%
c1.6067%
č0.9490%
d3.6019%
ď0.0222%
e7.6952%
é1.3346%
ě1.6453%
f0.2732%
g0.2729%
h1.2712%
ch1.1709%
i4.3528%
í3.2699%
j2.1194%
k3.7367%
l3.8424%
m3.2267%
n6.5353%
ň0.0814%
o8.6664%
ó0.0313%
p3.4127%
q0.0013%
r3.6970%
ř1.2166%
s4.5160%
š0.8052%
t5.7268%
ť0.0426%
u3.1443%
ú0.1031%
ů0.6948%
v4.6616%
w0.0088%
x0.0755%
y1.9093%
ý1.0721%
z2.1987%
ž0.9952%
aábcčdďeéěfghchiíjklmnňoópqrřsštťuúůvwxyýzž0%1%2%3%4%5%6%7%8%Frequency
Relative letter frequencies (%)

Bigraphs

ST, PR, SK, CH, DN, TR

Trigraphs

PRO, UNI, OST, STA, ANI, OVA, YCH, STI, PRI, PRE, OJE, REN, IST, STR, EHO, TER, RED, ICH


Code

01./**
02. * Prints out frequencies of input characters (in percent)
03. * @param source input file
04. * @param encoding encoding of the file
05. */
06.public static void count(File source, String encoding) throws UnsupportedEncodingException, IOException{
07.    BufferedReader reader = new BufferedReader(new InputStreamReader(new FileInputStream(source), encoding));
08.     
09.    TreeMap<Character, Integer> occurences = new TreeMap<Character, Integer>();
10. 
11.    String s = null;
12.    int counter = 0;
13.    while((s = reader.readLine())!= null){
14.        for(int i = 0; i < s.length(); i++){
15.            counter++;
16.            Character curr = (Character) s.charAt(i);
17.            if(occurences.get(curr) == null){
18.                occurences.put(curr, new Integer(1));
19.            } else {
20.                occurences.put(curr, occurences.get(curr).intValue() + 1);
21.            }
22.        }
23.    }
24. 
25.    for(Character ch : occurences.keySet()){
26.        System.out.println(ch.toString() + ": " + (occurences.get(ch).intValue()/(double)counter * 100));
27.    }
28.}

Sources

  • KRÁLÍK, Jan. Czech Alphabet. The Czech Language [online]. 2001 [cit. 2012-09-18]. Available at WWW: http://www.czech-language.cz/alphabet/alph-prehled.html







       
 

Place for your banner

Here is the position ready for our customer's banners.