要求:

1.读取文件;

2.记录出现的词汇及出现频率;

3.按照频率降序排列;

4.输出结果。

概要:

1.读取的文件路径是默认的,为了方便调试,将要统计的文章、段落复制到文本中即可;
2.只支持英文;
3.会按照词汇出现的频率降序排列。

实现:

1.使用FileReader、BufferedReader读取文件;

2.采用StringTokenizer进行字符分割;

3.用hashmap保存统计数据;

4.自定义一个类用来实现按value排序;

5.输出结果。

默认路径文件:

         String filename = "E:/Test.txt";

         FileReader fk = new FileReader(filename);
BufferedReader br = new BufferedReader(fk);

统计词频:

         String s;
while((s = br.readLine()) != null) {
file += s; //读出整篇文章,存入String类的file中。
} StringTokenizer st = new StringTokenizer(file," ,.!?\"'"); //用于切分字符串 while(st.hasMoreTokens()) {
String word = st.nextToken();
if(hm.get(word) != null) {
int value = ((Integer)hm.get(word)).intValue();
value++;
hm.put(word, new Integer(value));
}
else {
hm.put(word, new Integer(1));
}
}

排序类:

 import java.util.Comparator;
import java.util.TreeMap; public class ByValueComparator implements Comparator<String> {
TreeMap<String, Integer> treemap;
public ByValueComparator(TreeMap<String, Integer> tm) {
this.treemap = tm;
} @Override
public int compare(String o1, String o2) {
// TODO Auto-generated method stub
if(!treemap.containsKey(o1) || !treemap.containsKey(o2)) {
return 0;
}
if(treemap.get(o1) < treemap.get(o2)) {
return 1;
} else if(treemap.get(o1) == treemap.get(o2)) {
return 0;
} else {
return -1;
}
}
}

输出结果:

        TreeMap tm = new TreeMap(hm);

        ByValueComparator bvc = new ByValueComparator(tm);
List<String> ll = new ArrayList<String>(tm.keySet());
Collections.sort(ll, bvc);
for(String str:ll){
System.out.println(str+"——"+tm.get(str));
}

实例验证:

There are moments in life when you miss someone so much that you just want to pick them from your dreams and hug them for real! Dream what you want to dream;go where you want to go;be what you want to be,because you have only one life and one chance to do all the things you want to do.
May you have enough happiness to make you sweet,enough trials to make you strong,enough sorrow to keep you human,enough hope to make you happy? Always put yourself in others’shoes.If you feel that it hurts you,it probably hurts the other person, too.
The happiest of people don’t necessarily have the best of everything;they just make the most of everything that comes along their way.Happiness lies for those who cry,those who hurt, those who have searched,and those who have tried,for only they can appreciate the importance of people
who have touched their lives.Love begins with a smile,grows with a kiss and ends with a tear.The brightest future will always be based on a forgotten past, you can’t go on well in lifeuntil you let go of your past failures and heartaches.
When you were born,you were crying and everyone around you was smiling.Live your life so that when you die,you're the one who is smiling and everyone around you is crying.
Please send this message to those people who mean something to you,to those who have touched your life in one way or another,to those who make you smile when you really need it,to those that make you see the brighter side of things when you are really down,to those who you want to let them know that you appreciate their friendship.And if you don’t, don’t worry,nothing bad will happen to you,you will just miss out on the opportunity to brighten someone’s day with this message.

  结果:

you——32
to——19
who——9
those——9
the——8
have——7
and——7
of——6
make——6
that——6
want——6
your——4
with——4
when——4
one——4
life——4
a——4
in——4
enough——4
for——3
don’t——3
just——3
it——3
on——3
them——3
their——3
will——3
what——2
were——2
way——2
touched——2
this——2
things——2
so——2
smiling——2
smile——2
really——2
people——2
past——2
only——2
miss——2
message——2
let——2
is——2
hurts——2
go——2
everyone——2
do——2
crying——2
be——2
around——2
are——2
appreciate——2
The——2
another——1
always——1
along——1
all——1
When——1
There——1
Please——1
May——1
Love——1
Live——1
If——1
Happiness——1
Dream——1
And——1
Always——1
die——1
day——1
cry——1
comes——1
chance——1
can’t——1
can——1
brightest——1
brighter——1
brighten——1
born——1
best——1
begins——1
because——1
based——1
bad——1
happen——1
grows——1
go;be——1
future——1
from——1
friendship——1
forgotten——1
feel——1
failures——1
everything;they——1
everything——1
ends——1
dreams——1
dream;go——1
down——1
know——1
kiss——1
keep——1
importance——1
if——1
hurt——1
human——1
hug——1
hope——1
heartaches——1
happy——1
happiness——1
happiest——1
or——1
opportunity——1
nothing——1
need——1
necessarily——1
much——1
most——1
moments——1
mean——1
lives——1
lifeuntil——1
lies——1
side——1
send——1
see——1
searched——1
real——1
re——1
put——1
probably——1
pick——1
person——1
peoplewho——1
out——1
others’shoes——1
other——1
tried——1
trials——1
too——1
they——1
tear——1
sweet——1
strong——1
sorrow——1
something——1
someone’s——1
someone——1
yourself——1
worry——1
where——1
well——1
was——1

  

代码地址:https://coding.net/u/regretless/p/WordFrequencyCount/git

最新文章

  1. remove 清除binlog
  2. js替换字符串问题
  3. C# Winform 文件编码批量转换工具
  4. 常用的一些webshell木马官方后门
  5. init()和deinit()
  6. 【转载】linux中互斥尽量用mutex,不用semaphore
  7. 关于android listview去掉分割线
  8. Unable to automatically debug &quot;XXXXX“
  9. Codeforces2B - The least round way(DP)
  10. Python 全栈开发 -- 监控篇
  11. 开发环境----&gt;服务器(数据库迁移Migration)
  12. SharePoint Framework解决方案管理参考(二)
  13. 【MySql】启动/停止
  14. Shell命令解析
  15. DDD领域模型系统的工作流(十四)
  16. 微软BI 之SSIS 系列 - Lookup 中的字符串比较大小写处理 Case Sensitive or Insensitive
  17. 如何在servlet中获取spring创建的bean
  18. linux shell 学习笔记01
  19. IE8 frameset SESSION丢失
  20. 08JavaScript对象

热门文章

  1. 网站漏洞修复案例之Discuz!3.4最新版本
  2. 20155204 实验3《敏捷开发与XP实践》实验报告
  3. 201552-53 《Java程序设计》第三周问题汇总
  4. Java第五次实验报告
  5. mycp 补交作业
  6. day3 RHCE
  7. ADO.NET操作MySQL数据库
  8. python简介、第一个python程序、变量、字符编码、用户交互程序、if...else、while、for
  9. Linux 优化详解
  10. AsciiPic Java视频转成字符画