交大門文化衫设计

anonymous_warrior · May 22, 2024, 2:54pm

word cloud 我之前试着做过：

标题：

posts:

with mask:

标题：

posts:

代码很简单，主要是没有导出的数据库文件

xm_wordcloud.py

# src: https://github.com/leisurelicht/WordCloud-CN/
# required:
# wget https://github.com/leisurelicht/WordCloud-CN/blob/master/stopwords.txt

from pathlib import Path
import jieba
from wordcloud import WordCloud
import sqlite3
import pandas as pd

font_path = '/usr/share/fonts/wenquanyi/wqy-microhei/wqy-microhei.ttc'
stopword_path = 'stopwords.txt'
db = Path('/f/discourse_public_import/xjtu.app.public.dump.2024.04.05.db')


def txt2wc(txt, save_png_name='test.png'):
    with open(stopword_path) as f_stop:
        f_stop_text = f_stop.read()
        f_stop_seg_list = f_stop_text.splitlines()

    seg_list = jieba.cut(txt, cut_all=False)

    my_word_list = []

    for my_word in seg_list:
        if len(my_word.strip()) > 1 and not (my_word.strip() in f_stop_seg_list):
            my_word_list.append(my_word)

    my_word_str = ' '.join(my_word_list)
    from pathlib import Path
    from PIL import Image
    import numpy as np
    mask = np.array(Image.open(Path('~/Documents/jdm-mask-large.png').expanduser()))
    # use mask = None to generate wordcloud without mask
    wc = WordCloud(
        font_path=font_path,
        background_color="white",
        mask=mask,
        random_state=42,
        width=mask.shape[1]-200,
        height=mask.shape[0]-100,
    )
    wc.generate(my_word_str)

    wc.to_file(save_png_name)


con = sqlite3.connect(db)
cur = con.cursor()
cur.execute("SELECT name FROM sqlite_master WHERE type='table';")
print(cur.fetchall())  # [('topics',), ('users',), ('posts',), ('likes',)]
df_topics = pd.read_sql_query("SELECT * FROM topics", con)
df_posts = pd.read_sql_query("SELECT * FROM posts", con)

titles = df_topics['title']
posts = df_posts['raw']

titles = [t for t in titles if not ('关于' in t and '类别' in t)]
posts = [p.replace('<br>', '\n') for p in posts]

txt2wc('\n'.join(list(posts)), 'xmen-posts-wordcloud.png')
txt2wc('\n'.join(list(titles)), 'xmen-titles-wordcloud.png')

Quit · May 22, 2024, 2:56pm

居然没有成功

但我不抽烟 · May 22, 2024, 3:52pm

支持

sola · May 23, 2024, 5:27pm

参考知乎，拟了一个初稿

sola · May 23, 2024, 5:27pm

1.psd (1.9 MB)

↑psd 文件

AI · May 23, 2024, 6:00pm

有没有设计大佬来设计个

argmin · May 24, 2024, 3:14am

有种这个的感觉

Time_Limit_Exceed · May 24, 2024, 6:14am

供参考

anonymous_warrior · May 24, 2024, 6:28am

不够幽默

anonymous_warrior · May 24, 2024, 7:41am

can can need

anonymous_warrior · May 24, 2024, 7:45am

开启左右互搏模式

anonymous_warrior · May 24, 2024, 9:37am

早些年 VIXEN 打开之后显示：
安全检测 | 百度云加速

Topic		Replies	Views
随缘转发宣传海报网站	14	920	September 19, 2023
第二届交大门粉丝聚会 The 2nd Conference of JDM Fans (COJF2024) 网站 party	162	919	July 6, 2024
本站名称实时记录闲聊吹水	37	629	September 2, 2024
土源趣图楼 1.0 闲聊吹水趣图楼	41	279	May 4, 2025
交大門 logo 合集（史官记录帖）网站 logo	10	208	March 4, 2024
感觉水你站的学生好少闲聊吹水	27	567	October 14, 2024
交大門生日快乐闲聊吹水	10	149	June 2, 2024
关于交大门 logo（求助&征集帖）闲聊吹水	13	132	November 21, 2023
求🚪U 帮我 review 一个讲义（关于计网？闲聊吹水	25	180	December 19, 2024
轻松一刻，来分享大家看到的趣图趣事吧（精勤求学版）闲聊吹水	26	526	April 24, 2025

交大門文化衫设计

with mask:

Related topics