I am very new to python and also didn't work with text before…I have 100 text files, each has around 100 to 150 lines of unstructured text describing patient's condition. I read one file in python using:
with open("C:\\...\\...\\...\\record-13.txt") as f:
content = f.readlines()
print (content)
Now I can split each line of this file to its words using for example:
a = content[0].split()
print (a)
but I don't know how to split whole file to words?
do loops (while or for) help with that?
Thank you for your help guys. Your answers help me to write this (in my file, words are split by space so that's delimiter I think!):
with open ("C:\\...\\...\\...\\record-13.txt") as f:
lines = f.readlines()
for line in lines:
words = line.split()
for word in words:
print (word)
that simply splits words by line (one word in one line).
Best Answer
It depends on how you define
words
, or what you regard as thedelimiters
.Notice
string.split
in Python receives an optional parameterdelimiter
, so you could pass it as this:Unfortunately,
string.split
receives a single delimiter only, so you may need multi-level splitting like this:Looks ugly, right? Luckily we can use iteration instead:
EDITED: Or simply we could use the regular expression package: