{"id":5348,"date":"2022-04-02T20:29:46","date_gmt":"2022-04-02T12:29:46","guid":{"rendered":"https:\/\/egonlin.com\/?p=5348"},"modified":"2022-04-02T20:29:46","modified_gmt":"2022-04-02T12:29:46","slug":"07-05-%e6%a8%a1%e5%9d%976-wordcloud%e5%ba%93%e7%9a%84%e4%bd%bf%e7%94%a8","status":"publish","type":"post","link":"https:\/\/egonlin.com\/?p=5348","title":{"rendered":"07-05 \u6a21\u57576-wordcloud\u5e93\u7684\u4f7f\u7528"},"content":{"rendered":"<h1>\u4e00\u3001wordcloud\u5e93\u57fa\u672c\u4ecb\u7ecd<\/h1>\n<h2>1.1 wordcloud\u5e93\u6982\u8ff0<\/h2>\n<p>wordcloud\u662f\u4f18\u79c0\u7684\u8bcd\u4e91\u5c55\u793a\u7b2c\u4e09\u65b9\u5e93<\/p>\n<p><div class='fancybox-wrapper lazyload-container-unload' data-fancybox='post-images' href='https:\/\/egonlin.com\/wp-content\/uploads\/2022\/04\/wordcloud\u5e93\u7684\u4f7f\u75281.png'><img class=\"lazyload lazyload-style-2\" src=\"data:image\/svg+xml;base64,PCEtLUFyZ29uTG9hZGluZy0tPgo8c3ZnIHdpZHRoPSIxIiBoZWlnaHQ9IjEiIHhtbG5zPSJodHRwOi8vd3d3LnczLm9yZy8yMDAwL3N2ZyIgc3Ryb2tlPSIjZmZmZmZmMDAiPjxnPjwvZz4KPC9zdmc+\"  data-original=\"https:\/\/egonlin.com\/wp-content\/uploads\/2022\/04\/wordcloud\u5e93\u7684\u4f7f\u75281.png\" src=\"data:image\/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAAAXNSR0IArs4c6QAAAARnQU1BAACxjwv8YQUAAAAJcEhZcwAADsQAAA7EAZUrDhsAAAANSURBVBhXYzh8+PB\/AAffA0nNPuCLAAAAAElFTkSuQmCC\" alt=\"\" \/><\/div><\/p>\n<ul>\n<li>\u8bcd\u4e91\u4ee5\u8bcd\u8bed\u4e3a\u57fa\u672c\u5355\u4f4d\uff0c\u66f4\u52a0\u76f4\u89c2\u548c\u827a\u672f\u7684\u5c55\u793a\u6587\u672c<\/li>\n<\/ul>\n<h2>1.2 wordcloud\u5e93\u7684\u5b89\u88c5<\/h2>\n<p><code>pip install wordcloud<\/code>(cmd\u547d\u4ee4\u884c)<\/p>\n<p><div class='fancybox-wrapper lazyload-container-unload' data-fancybox='post-images' href='https:\/\/egonlin.com\/wp-content\/uploads\/2022\/04\/wordcloud\u5e93\u7684\u4f7f\u75282.png'><img class=\"lazyload lazyload-style-2\" src=\"data:image\/svg+xml;base64,PCEtLUFyZ29uTG9hZGluZy0tPgo8c3ZnIHdpZHRoPSIxIiBoZWlnaHQ9IjEiIHhtbG5zPSJodHRwOi8vd3d3LnczLm9yZy8yMDAwL3N2ZyIgc3Ryb2tlPSIjZmZmZmZmMDAiPjxnPjwvZz4KPC9zdmc+\"  data-original=\"https:\/\/egonlin.com\/wp-content\/uploads\/2022\/04\/wordcloud\u5e93\u7684\u4f7f\u75282.png\" src=\"data:image\/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAAAXNSR0IArs4c6QAAAARnQU1BAACxjwv8YQUAAAAJcEhZcwAADsQAAA7EAZUrDhsAAAANSURBVBhXYzh8+PB\/AAffA0nNPuCLAAAAAElFTkSuQmCC\" alt=\"\" \/><\/div><\/p>\n<h1>\u4e8c\u3001wordcloud\u5e93\u4f7f\u7528\u8bf4\u660e<\/h1>\n<h2>2.1 wordcloud\u5e93\u57fa\u672c\u4f7f\u7528<\/h2>\n<p>wordcloud\u5e93\u628a\u8bcd\u4e91\u5f53\u4f5c\u4e00\u4e2aWordCloud\u5bf9\u8c61<\/p>\n<ul>\n<li>wordcloud.WordCloud()\u4ee3\u8868\u4e00\u4e2a\u6587\u672c\u5bf9\u5e94\u7684\u8bcd\u4e91<\/li>\n<li>\u53ef\u4ee5\u6839\u636e\u6587\u672c\u4e2d\u8bcd\u8bed\u51fa\u73b0\u7684\u9891\u7387\u7b49\u53c2\u6570\u7ed8\u5236\u8bcd\u4e91<\/li>\n<li>\u7ed8\u5236\u8bcd\u4e91\u7684\u5f62\u72b6\u3001\u5c3a\u5bf8\u548c\u989c\u8272\u90fd\u53ef\u4ee5\u8bbe\u5b9a<\/li>\n<\/ul>\n<h2>2.2 wordcloud\u5e93\u5e38\u89c4\u65b9\u6cd5<\/h2>\n<pre><code class=\"language-python\">w = wordcloud.WordCloud()<\/code><\/pre>\n<ul>\n<li>\u4ee5WordCloud\u5bf9\u8c61\u4e3a\u57fa\u7840<\/li>\n<li>\u914d\u7f6e\u53c2\u6570\u3001\u52a0\u8f7d\u6587\u672c\u3001\u8f93\u51fa\u6587\u4ef6<\/li>\n<\/ul>\n<table>\n<thead>\n<tr>\n<th style=\"text-align: center;\">\u65b9\u6cd5<\/th>\n<th style=\"text-align: center;\">\u63cf\u8ff0<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"text-align: center;\">w.generate(txt)<\/td>\n<td style=\"text-align: center;\">\u5411WordCloud\u5bf9\u8c61w\u4e2d\u52a0\u8f7d\u6587\u672ctxt\uff0c<code>w.generate(&quot;Python and WordCloud&quot;)<\/code><\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\">w.to_file(filename)<\/td>\n<td style=\"text-align: center;\">\u5c06\u8bcd\u4e91\u8f93\u51fa\u4e3a\u56fe\u50cf\u6587\u4ef6\uff0c.png\u6216.jpg?x-oss-process=style\/watermark\u683c\u5f0f\uff0c<code>w.to_file(&quot;outfile.png&quot;)<\/code><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<ul>\n<li>\u6b65\u9aa41\uff1a\u914d\u7f6e\u5bf9\u8c61\u53c2\u6570<\/li>\n<li>\u6b65\u9aa42\uff1a\u52a0\u8f7d\u8bcd\u4e91\u6587\u672c<\/li>\n<li>\u6b65\u9aa43\uff1a\u8f93\u51fa\u8bcd\u4e91\u6587\u4ef6<\/li>\n<\/ul>\n<pre><code class=\"language-python\">import wordcloud\n\nw = wordcloud.WordCloud()\nw.generate(&quot;Python and WordCloud&quot;)\nc.to_file(&quot;pywordcloud.png&quot;)<\/code><\/pre>\n<p><div class='fancybox-wrapper lazyload-container-unload' data-fancybox='post-images' href='https:\/\/egonlin.com\/wp-content\/uploads\/2022\/04\/wordcloud\u5e93\u7684\u4f7f\u75283.png'><img class=\"lazyload lazyload-style-2\" src=\"data:image\/svg+xml;base64,PCEtLUFyZ29uTG9hZGluZy0tPgo8c3ZnIHdpZHRoPSIxIiBoZWlnaHQ9IjEiIHhtbG5zPSJodHRwOi8vd3d3LnczLm9yZy8yMDAwL3N2ZyIgc3Ryb2tlPSIjZmZmZmZmMDAiPjxnPjwvZz4KPC9zdmc+\"  data-original=\"https:\/\/egonlin.com\/wp-content\/uploads\/2022\/04\/wordcloud\u5e93\u7684\u4f7f\u75283.png\" src=\"data:image\/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAAAXNSR0IArs4c6QAAAARnQU1BAACxjwv8YQUAAAAJcEhZcwAADsQAAA7EAZUrDhsAAAANSURBVBhXYzh8+PB\/AAffA0nNPuCLAAAAAElFTkSuQmCC\" alt=\"\" \/><\/div><\/p>\n<p><div class='fancybox-wrapper lazyload-container-unload' data-fancybox='post-images' href='https:\/\/egonlin.com\/wp-content\/uploads\/2022\/04\/wordcloud\u5e93\u7684\u4f7f\u75284.png'><img class=\"lazyload lazyload-style-2\" src=\"data:image\/svg+xml;base64,PCEtLUFyZ29uTG9hZGluZy0tPgo8c3ZnIHdpZHRoPSIxIiBoZWlnaHQ9IjEiIHhtbG5zPSJodHRwOi8vd3d3LnczLm9yZy8yMDAwL3N2ZyIgc3Ryb2tlPSIjZmZmZmZmMDAiPjxnPjwvZz4KPC9zdmc+\"  data-original=\"https:\/\/egonlin.com\/wp-content\/uploads\/2022\/04\/wordcloud\u5e93\u7684\u4f7f\u75284.png\" src=\"data:image\/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAAAXNSR0IArs4c6QAAAARnQU1BAACxjwv8YQUAAAAJcEhZcwAADsQAAA7EAZUrDhsAAAANSURBVBhXYzh8+PB\/AAffA0nNPuCLAAAAAElFTkSuQmCC\" alt=\"\" \/><\/div><\/p>\n<h2>2.3 \u914d\u7f6e\u5bf9\u8c61\u53c2\u6570<\/h2>\n<pre><code class=\"language-python\">w = wordcloud.WordCloud(&lt;\u53c2\u6570&gt;)<\/code><\/pre>\n<table>\n<thead>\n<tr>\n<th style=\"text-align: center;\">\u53c2\u6570<\/th>\n<th style=\"text-align: center;\">\u63cf\u8ff0<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td style=\"text-align: center;\">width<\/td>\n<td style=\"text-align: center;\">\u6307\u5b9a\u8bcd\u4e91\u5bf9\u8c61\u751f\u6210\u56fe\u7247\u7684\u5bbd\u5ea6\uff0c\u9ed8\u8ba4400\u50cf\u7d20<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\">height<\/td>\n<td style=\"text-align: center;\">\u6307\u5b9a\u8bcd\u4e91\u5bf9\u8c61\u751f\u6210\u56fe\u7247\u7684\u9ad8\u5ea6\uff0c\u9ed8\u8ba4200\u50cf\u7d20<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\">min_font_size<\/td>\n<td style=\"text-align: center;\">\u6307\u5b9a\u8bcd\u4e91\u4e2d\u5b57\u4f53\u7684\u6700\u5c0f\u5b57\u53f7\uff0c\u9ed8\u8ba44\u53f7<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\">max_font_size<\/td>\n<td style=\"text-align: center;\">\u6307\u5b9a\u8bcd\u4e91\u4e2d\u5b57\u4f53\u7684\u6700\u5927\u5b57\u53f7\uff0c\u6839\u636e\u9ad8\u5ea6\u81ea\u52a8\u8c03\u8282<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\">font_step<\/td>\n<td style=\"text-align: center;\">\u6307\u5b9a\u8bcd\u4e91\u4e2d\u5b57\u4f53\u5b57\u53f7\u7684\u6b65\u8fdb\u95f4\u9694\uff0c\u9ed8\u8ba4\u4e3a1<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\">font_path<\/td>\n<td style=\"text-align: center;\">\u6307\u5b9a\u5b57\u4f53\u6587\u4ef6\u7684\u8def\u5f84\uff0c\u9ed8\u8ba4None<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\">max_words<\/td>\n<td style=\"text-align: center;\">\u6307\u5b9a\u8bcd\u4e91\u663e\u793a\u7684\u6700\u5927\u5355\u8bcd\u6570\u91cf\uff0c\u9ed8\u8ba4200<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\">stop_words<\/td>\n<td style=\"text-align: center;\">\u6307\u5b9a\u8bcd\u4e91\u7684\u6392\u9664\u8bcd\u5217\u8868\uff0c\u5373\u4e0d\u663e\u793a\u7684\u5355\u8bcd\u5217\u8868<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\">mask<\/td>\n<td style=\"text-align: center;\">\u6307\u5b9a\u8bcd\u4e91\u5f62\u72b6\uff0c\u9ed8\u8ba4\u4e3a\u957f\u65b9\u5f62\uff0c\u9700\u8981\u5f15\u7528imread()\u51fd\u6570<\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\">background_color<\/td>\n<td style=\"text-align: center;\">\u6307\u5b9a\u8bcd\u4e91\u56fe\u7247\u7684\u80cc\u666f\u989c\u8272\uff0c\u9ed8\u8ba4\u4e3a\u9ed1\u8272<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<pre><code class=\"language-python\"># mask\nfrom imageio import imread \n\nmk=imread(&quot;pic.png&quot;)\nw=wordcloud.WordCloud(mask=mk)<\/code><\/pre>\n<h2>2.4 wordcloud\u5e94\u7528\u5b9e\u4f8b<\/h2>\n<pre><code>import wordcloud\n\ntxt = &quot;life is short, you need python&quot;\nw = wordcloud.WordCloud(background_color=&quot;white&quot;)\nw.generate(txt)\nw.to_file(&quot;pywcloud.png&quot;)<\/code><\/pre>\n<p><div class='fancybox-wrapper lazyload-container-unload' data-fancybox='post-images' href='https:\/\/egonlin.com\/wp-content\/uploads\/2022\/04\/wordcloud\u5e93\u7684\u4f7f\u75285.png'><img class=\"lazyload lazyload-style-2\" src=\"data:image\/svg+xml;base64,PCEtLUFyZ29uTG9hZGluZy0tPgo8c3ZnIHdpZHRoPSIxIiBoZWlnaHQ9IjEiIHhtbG5zPSJodHRwOi8vd3d3LnczLm9yZy8yMDAwL3N2ZyIgc3Ryb2tlPSIjZmZmZmZmMDAiPjxnPjwvZz4KPC9zdmc+\"  data-original=\"https:\/\/egonlin.com\/wp-content\/uploads\/2022\/04\/wordcloud\u5e93\u7684\u4f7f\u75285.png\" src=\"data:image\/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAAAXNSR0IArs4c6QAAAARnQU1BAACxjwv8YQUAAAAJcEhZcwAADsQAAA7EAZUrDhsAAAANSURBVBhXYzh8+PB\/AAffA0nNPuCLAAAAAElFTkSuQmCC\" alt=\"\" \/><\/div><\/p>\n<p>\u4e2d\u6587\u9700\u8981\u5148\u5206\u8bcd\u5e76\u7ec4\u6210\u7a7a\u683c\u5206\u9694\u5b57\u7b26\u4e32<\/p>\n<pre><code class=\"language-python\">import jieba\nimport wordcloud\n\ntxt = &quot;Egon \u662f\u4e0a\u6d77\u6821\u533a\u6700\u5e05\u7684\u7537\u4eba\uff0c\u6ca1\u6709\u4e4b\u4e00\uff0c\u56e0\u4e3a\u4ed6\u5c31\u662f\u6700\u5e05\u7684&quot;\n\nw = wordcloud.WordCloud( width=1000,\\\nfont_path=&quot;\/Library\/Fonts\/Heiti.ttc&quot;,height=700)\nw.generate(&quot; &quot;.join(jieba.lcut(txt)))\nw.to_file(&quot;pywcloud.png&quot;)<\/code><\/pre>\n<pre><code class=\"language-python\">import jieba\nimport wordcloud\n\ntxt = &quot;Egon \u662f\u4e0a\u6d77\u6821\u533a\u6700\u5e05\u7684\u7537\u4eba\uff0c\u6ca1\u6709\u4e4b\u4e00\uff0c\u56e0\u4e3a\u4ed6\u5c31\u662f\u6700\u5e05\u7684&quot;\n\nw = wordcloud.WordCloud( width=1000,\\\nfont_path=&quot;\/Library\/Fonts\/Heiti.ttc&quot;,height=700)\nw.generate(&quot; &quot;.join(jieba.lcut(txt)))\nw.to_file(&quot;pywcloud.png&quot;)\n\n&lt;wordcloud.wordcloud.WordCloud at 0x1150979e8&gt;<\/code><\/pre>\n<p><div class='fancybox-wrapper lazyload-container-unload' data-fancybox='post-images' href='https:\/\/img2018.cnblogs.com\/blog\/1825659\/201910\/1825659-20191021133756412-1500992717..png'><img class=\"lazyload lazyload-style-2\" src=\"data:image\/svg+xml;base64,PCEtLUFyZ29uTG9hZGluZy0tPgo8c3ZnIHdpZHRoPSIxIiBoZWlnaHQ9IjEiIHhtbG5zPSJodHRwOi8vd3d3LnczLm9yZy8yMDAwL3N2ZyIgc3Ryb2tlPSIjZmZmZmZmMDAiPjxnPjwvZz4KPC9zdmc+\"  data-original=\"https:\/\/img2018.cnblogs.com\/blog\/1825659\/201910\/1825659-20191021133756412-1500992717..png\" src=\"data:image\/png;base64,iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAAAXNSR0IArs4c6QAAAARnQU1BAACxjwv8YQUAAAAJcEhZcwAADsQAAA7EAZUrDhsAAAANSURBVBhXYzh8+PB\/AAffA0nNPuCLAAAAAElFTkSuQmCC\" alt=\"pywcloud\" \/><\/div><\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u4e00\u3001wordcloud\u5e93\u57fa\u672c\u4ecb\u7ecd 1.1 wordcloud\u5e93\u6982\u8ff0 wordcloud\u662f\u4f18\u79c0\u7684\u8bcd\u4e91\u5c55\u793a\u7b2c\u4e09\u65b9\u5e93 [&hellip;]<\/p>\n","protected":false},"author":3,"featured_media":5352,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[371,384],"tags":[],"_links":{"self":[{"href":"https:\/\/egonlin.com\/index.php?rest_route=\/wp\/v2\/posts\/5348"}],"collection":[{"href":"https:\/\/egonlin.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/egonlin.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/egonlin.com\/index.php?rest_route=\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/egonlin.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=5348"}],"version-history":[{"count":0,"href":"https:\/\/egonlin.com\/index.php?rest_route=\/wp\/v2\/posts\/5348\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/egonlin.com\/index.php?rest_route=\/wp\/v2\/media\/5352"}],"wp:attachment":[{"href":"https:\/\/egonlin.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=5348"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/egonlin.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=5348"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/egonlin.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=5348"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}