直观理解政策文件
- 需求:对于政府工作报告等政策文件,如何直观理解?
- 体会直观的价值:生成词云 & 优化词云
政府工作报告等文件 –> 有效展示的词云
《决胜全面建成小康社会 夺取新时代中国特色社会主义伟大胜利》—— 在中国共…第十九次全国代表大会上的报告(2017年10月18日)习大大:想要《新时代中国特色社会主义》文本内容的可以加我微信:nickchen121
《中共中央 国务院关于实施乡村振兴战略的意见》 —— 2018一号文件(2018年01月02日)中共中央 国务院:想要《关于实施乡村振兴战略的意见》文本内容的可以加我微信:nickchen121
基本思路
- 步骤1:读取文件、分词整理
- 步骤2:设置并输出词云
- 步骤3:观察结果,优化迭代
| |
| import jieba |
| import wordcloud |
| |
| f = open("新时代中国特色社会主义.txt", "r", encoding="utf-8") |
| t = f.read() |
| f.close() |
| ls = jieba.lcut(t) |
| txt = " ".join(ls) |
| |
| w = wordcloud.WordCloud(font_path = "/Library/Fonts/Heiti.ttc",width = 1000, height = 700, background_color = "white") |
| w.generate(txt) |
| w.to_file("grwordcloud.png") |
| |
| |
| import jieba |
| import wordcloud |
| |
| f = open("关于实施乡村振兴战略的意见.txt", "r", encoding="utf-8") |
| t = f.read() |
| f.close() |
| ls = jieba.lcut(t) |
| txt = " ".join(ls) |
| w = wordcloud.WordCloud(font_path = "/Library/Fonts/Heiti.ttc",width = 1000, height = 700, background_color = "white") |
| w.generate(txt) |
| w.to_file("grwordcloud.png") |
| |
| import jieba |
| import wordcloud |
| |
| f = open("新时代中国特色社会主义.txt", "r", encoding="utf-8") |
| t = f.read() |
| f.close() |
| ls = jieba.lcut(t) |
| txt = " ".join(ls) |
| |
| w = wordcloud.WordCloud(font_path = "/Library/Fonts/Heiti.ttc",width = 1000, height = 700, background_color = "white") |
| w.generate(txt) |
| w.to_file("grwordcloud.png") |
| |
| |
| import jieba |
| import wordcloud |
| |
| f = open("关于实施乡村振兴战略的意见.txt", "r", encoding="utf-8") |
| t = f.read() |
| f.close() |
| ls = jieba.lcut(t) |
| txt = " ".join(ls) |
| |
| w = wordcloud.WordCloud(font_path = "/Library/Fonts/Heiti.ttc",width = 1000, height = 700, background_color = "white") |
| w.generate(txt) |
| w.to_file("grwordcloud.png") |
更有形的词云
| |
| import jieba |
| import wordcloud |
| |
| from imageio import imread |
| mask = imread("fivestart.png") |
| |
| f = open("新时代中国特色社会主义.txt", "r", encoding="utf-8") |
| t = f.read() |
| f.close() |
| ls = jieba.lcut(t) |
| txt = " ".join(ls) |
| |
| w = wordcloud.WordCloud(font_path = "/Library/Fonts/Heiti.ttc",width = 1000, height = 700, background_color = "white", mask=mask) |
| w.generate(txt) |
| w.to_file("grwordcloud.png") |
| |
| import jieba |
| import wordcloud |
| |
| from imageio import imread |
| mask = imread("chinamap.jpg?x-oss-process=style/watermark") |
| |
| f = open("新时代中国特色社会主义.txt", "r", encoding="utf-8") |
| t = f.read() |
| f.close() |
| ls = jieba.lcut(t) |
| txt = " ".join(ls) |
| |
| w = wordcloud.WordCloud(font_path = "/Library/Fonts/Heiti.ttc",width = 1000, height = 700, background_color = "white", mask=mask) |
| w.generate(txt) |
| w.to_file("grwordcloud.png") |
- 了解wordcloud更多参数,扩展词云能力
- 特色词云:设计一款属于自己的特色词云风格
- 更多文件:用更多文件练习词云生成