51、tf-idf值提取关键词
2024-09-03 00:03:57
import testWord2vec2 as tw
import tensorflow_util as tu
import numpy as np
model = tw.load_model()
namelist = tw.loadNameList()
import jieba
namelist1 = []
for name in namelist:
seg_list = jieba.cut(name)
temp_name = " ".join(seg_list)
namelist1.append(temp_name) from sklearn import feature_extraction
from sklearn.feature_extraction.text import TfidfTransformer
from sklearn.feature_extraction.text import CountVectorizer
vectorizer=CountVectorizer()
transformer=TfidfTransformer()
tfidf=transformer.fit_transform(vectorizer.fit_transform(namelist1))
word=vectorizer.get_feature_names()
weight=tfidf.toarray()
keyword = [];
for i in range(len(word)):
wei = weight[i,:]
re = np.where(wei == np.max(wei))
print(word[re[0][0]],":",wei[re[0][0]])
最新文章
- BZOJ3772: 精神污染
- 疑难杂症rendering(对角线上的线)
- [有错误]堆排序的实现 java
- Play!framework 项目部署到Tomcat
- Ubuntu 16.04 安装 VMware Tools(解决windows和Ubuntu之间不能互相复制粘贴文件的问题)
- GNU Wget 1.19.1 static built on mingw32
- Python的ctypes 和pyinstaller
- Linux下系统时间函数、DST等相关问题总结(转)
- docker swarm英文文档学习-9-使用Docker Configs存储配置数据
- xss 学习记录
- ORA-01034: ORACLE not available ORA-27101
- A Spectral Technique for Correspondence Problems Using Pairwise Constraints
- 添加vscode自定义代码块
- 从MySQL开发规范处看创业
- python 随机整数
- python进阶之py文件内置属性
- day1 作业二:多级菜单操作
- JQuery对象与javascript对象的转换
- 彻底的卸载SQL Server2005
- 第一步:Java开发环境的配置