Python詞頻統計的方法有哪些

發布時間：2021-12-06 16:08:58 來源：億速云閱讀：229 作者：小新欄目：開發技術

這篇文章將為大家詳細講解有關Python詞頻統計的方法有哪些，小編覺得挺實用的，因此分享給大家做個參考，希望大家閱讀完這篇文章后可以有所收獲。

統計文件里每個單詞的個數

思路：

分別統計文檔中的單詞，與出現的次數

用兩個列表將其保存起來，最后再用zip()函數連接輸出**

想法成立開始實踐

方法一：

# 導入文件
with open("passage.txt", 'r') as file:
    dates = file.readlines()
# 處理
words = []
for i in dates:
    words += i.replace("\n", "").split(" ")  # 用空字符來代替換行 words +是為了不被覆蓋無+將只有最后一條數據
    # print(i.replace("\n","").split(" "))
setWords = list(set(words))  # 集合自動去重
num = []  # 統計一個單詞出現的次數
for k in setWords:
    count = 0
    for j in words:
        if k == j:
            count = count + 1
    num.append(count)
print(num)
print(setWords)
# 輸出
for x, y in zip(setWords, num):  # 將兩個列表用zip結合
    print(x + ":" + str(y))、

效果圖：

Python詞頻統計的方法有哪些

方法二：

此方法用來字典，較前一個相對簡潔一點

# 導入
with open("passage.txt", 'r') as file:
    dates = file.readlines()
# 處理
words = []
for i in dates:
    words += i.replace("\n", "").split(" ")
    # print(i.replace("\n","").split(" "))
# setWords=list(set(words))  #可以不用這個
print(words)
print("-" * 40)
# print(setWords)
diccount = dict()
for i in words:
    if (i not in diccount):
        diccount[i] = 1  # 第一遍字典為空 賦值相當于 i=1，i為words里的單詞
        # print(diccount)
    else:
        diccount[i] = diccount[i] + 1  # 等不在里面的全部遍歷一遍賦值就都在里面了，我們再來記數
print(diccount)

效果圖：

Python詞頻統計的方法有哪些

統計的文檔

Python詞頻統計的方法有哪些

關于“Python詞頻統計的方法有哪些”這篇文章就分享到這里了，希望以上內容可以對大家有一定的幫助，使各位可以學到更多知識，如果覺得文章不錯，請把它分享出去讓更多的人看到。

向AI問一下細節

中文字幕av专区_日韩电影在线播放_精品国产精品久久一区免费式_av在线免费观看网站

Python詞頻統計的方法有哪些

統計文件里每個單詞的個數

思路：

想法成立開始實踐

方法一：

方法二：

猜你喜歡

中文字幕av专区_日韩电影在线播放_精品国产精品久久一区免费式_av在线免费观看网站

Python詞頻統計的方法有哪些

統計文件里每個單詞的個數

思路：

想法成立開始實踐

方法一：

方法二：

猜你喜歡

最新資訊

相關推薦

相關標簽