python统计文本文件内单词数量的方法

本文实例讲述了python统计文本文件内单词数量的方法。分享给大家供大家参考。具体实现方法如下:

# count lines, sentences, and words of a text file

# set all the counters to zero

lines, blanklines, sentences, words = 0, 0, 0, 0

print '-' * 50

try:

# use a text file you have, or google for this one ...

filename = 'GettysburgAddress.txt'

textf = open(filename, 'r')

except IOError:

print 'Cannot open file %s for reading' % filename

import sys

sys.exit(0)

# reads one line at a time

for line in textf:

print line, # test

lines += 1

if line.startswith('\n'):

blanklines += 1

else:

# assume that each sentence ends with . or ! or ?

# so simply count these characters

sentences += line.count('.') + line.count('!') + line.count('?')

# create a list of words

# use None to split at any whitespace regardless of length

# so for instance double space counts as one space

tempwords = line.split(None)

print tempwords # test

# word total count

words += len(tempwords)

textf.close()

print '-' * 50

print "Lines : ", lines

print "Blank lines: ", blanklines

print "Sentences : ", sentences

print "Words : ", words

# optional console wait for keypress

from msvcrt import getch

getch()

希望本文所述对大家的python程序设计有所帮助。

以上是 python统计文本文件内单词数量的方法 的全部内容, 来源链接: utcz.com/z/345099.html

回到顶部