Text Cleaning Issues
I'm learning text cleaning using python online.
I have get rid of some stop words and lower the letter.
but when i execute this code, it doesn't show anything.
I don't know why.
# we add some words to the stop word list
texts, article = ,
for w in doc:
# if it's not a stop word or punctuation mark, add it to our article!
if w.text != 'n' and not w.is_stop and not w.is_punct and not w.like_num and w.text != 'I':
# we add the lematized version of the word
article.append(w.lemma_)
# if it's a new line, it means we're onto our next document
if w.text == 'n':
texts.append(article)
article =
when i try to output texts, it's just blank.
python text nlp topic-modeling
|
show 1 more comment
I'm learning text cleaning using python online.
I have get rid of some stop words and lower the letter.
but when i execute this code, it doesn't show anything.
I don't know why.
# we add some words to the stop word list
texts, article = ,
for w in doc:
# if it's not a stop word or punctuation mark, add it to our article!
if w.text != 'n' and not w.is_stop and not w.is_punct and not w.like_num and w.text != 'I':
# we add the lematized version of the word
article.append(w.lemma_)
# if it's a new line, it means we're onto our next document
if w.text == 'n':
texts.append(article)
article =
when i try to output texts, it's just blank.
python text nlp topic-modeling
2
what is doc ? how is it defined and initialized ?
– Osuman AAA
Nov 21 at 8:25
Doc is a plain txt file
– WillZhang
Nov 21 at 8:29
How do you open the file and read from it ? Can you show that part? Also, It would be good idea to put aprint(w)
right at the beginning of the for loop to see if w has any value
– Osuman AAA
Nov 21 at 8:32
text = open("BioTest.txt").read()
– WillZhang
Nov 21 at 8:34
And i tried to print w, it has value
– WillZhang
Nov 21 at 8:35
|
show 1 more comment
I'm learning text cleaning using python online.
I have get rid of some stop words and lower the letter.
but when i execute this code, it doesn't show anything.
I don't know why.
# we add some words to the stop word list
texts, article = ,
for w in doc:
# if it's not a stop word or punctuation mark, add it to our article!
if w.text != 'n' and not w.is_stop and not w.is_punct and not w.like_num and w.text != 'I':
# we add the lematized version of the word
article.append(w.lemma_)
# if it's a new line, it means we're onto our next document
if w.text == 'n':
texts.append(article)
article =
when i try to output texts, it's just blank.
python text nlp topic-modeling
I'm learning text cleaning using python online.
I have get rid of some stop words and lower the letter.
but when i execute this code, it doesn't show anything.
I don't know why.
# we add some words to the stop word list
texts, article = ,
for w in doc:
# if it's not a stop word or punctuation mark, add it to our article!
if w.text != 'n' and not w.is_stop and not w.is_punct and not w.like_num and w.text != 'I':
# we add the lematized version of the word
article.append(w.lemma_)
# if it's a new line, it means we're onto our next document
if w.text == 'n':
texts.append(article)
article =
when i try to output texts, it's just blank.
python text nlp topic-modeling
python text nlp topic-modeling
edited Nov 21 at 8:25
Osuman AAA
413212
413212
asked Nov 21 at 8:20
WillZhang
41
41
2
what is doc ? how is it defined and initialized ?
– Osuman AAA
Nov 21 at 8:25
Doc is a plain txt file
– WillZhang
Nov 21 at 8:29
How do you open the file and read from it ? Can you show that part? Also, It would be good idea to put aprint(w)
right at the beginning of the for loop to see if w has any value
– Osuman AAA
Nov 21 at 8:32
text = open("BioTest.txt").read()
– WillZhang
Nov 21 at 8:34
And i tried to print w, it has value
– WillZhang
Nov 21 at 8:35
|
show 1 more comment
2
what is doc ? how is it defined and initialized ?
– Osuman AAA
Nov 21 at 8:25
Doc is a plain txt file
– WillZhang
Nov 21 at 8:29
How do you open the file and read from it ? Can you show that part? Also, It would be good idea to put aprint(w)
right at the beginning of the for loop to see if w has any value
– Osuman AAA
Nov 21 at 8:32
text = open("BioTest.txt").read()
– WillZhang
Nov 21 at 8:34
And i tried to print w, it has value
– WillZhang
Nov 21 at 8:35
2
2
what is doc ? how is it defined and initialized ?
– Osuman AAA
Nov 21 at 8:25
what is doc ? how is it defined and initialized ?
– Osuman AAA
Nov 21 at 8:25
Doc is a plain txt file
– WillZhang
Nov 21 at 8:29
Doc is a plain txt file
– WillZhang
Nov 21 at 8:29
How do you open the file and read from it ? Can you show that part? Also, It would be good idea to put a
print(w)
right at the beginning of the for loop to see if w has any value– Osuman AAA
Nov 21 at 8:32
How do you open the file and read from it ? Can you show that part? Also, It would be good idea to put a
print(w)
right at the beginning of the for loop to see if w has any value– Osuman AAA
Nov 21 at 8:32
text = open("BioTest.txt").read()
– WillZhang
Nov 21 at 8:34
text = open("BioTest.txt").read()
– WillZhang
Nov 21 at 8:34
And i tried to print w, it has value
– WillZhang
Nov 21 at 8:35
And i tried to print w, it has value
– WillZhang
Nov 21 at 8:35
|
show 1 more comment
1 Answer
1
active
oldest
votes
I believe the 'texts' list and 'article' list refer to same content and hence, clearing one list's content also clears the other list.
Here is a link to a similar question: Python: Append a list to another list and Clear the first list
Please see if the above are useful.
add a comment |
Your Answer
StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53407797%2ftext-cleaning-issues%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
1 Answer
1
active
oldest
votes
1 Answer
1
active
oldest
votes
active
oldest
votes
active
oldest
votes
I believe the 'texts' list and 'article' list refer to same content and hence, clearing one list's content also clears the other list.
Here is a link to a similar question: Python: Append a list to another list and Clear the first list
Please see if the above are useful.
add a comment |
I believe the 'texts' list and 'article' list refer to same content and hence, clearing one list's content also clears the other list.
Here is a link to a similar question: Python: Append a list to another list and Clear the first list
Please see if the above are useful.
add a comment |
I believe the 'texts' list and 'article' list refer to same content and hence, clearing one list's content also clears the other list.
Here is a link to a similar question: Python: Append a list to another list and Clear the first list
Please see if the above are useful.
I believe the 'texts' list and 'article' list refer to same content and hence, clearing one list's content also clears the other list.
Here is a link to a similar question: Python: Append a list to another list and Clear the first list
Please see if the above are useful.
answered Dec 3 at 7:31
Karthik
11
11
add a comment |
add a comment |
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Some of your past answers have not been well-received, and you're in danger of being blocked from answering.
Please pay close attention to the following guidance:
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53407797%2ftext-cleaning-issues%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
2
what is doc ? how is it defined and initialized ?
– Osuman AAA
Nov 21 at 8:25
Doc is a plain txt file
– WillZhang
Nov 21 at 8:29
How do you open the file and read from it ? Can you show that part? Also, It would be good idea to put a
print(w)
right at the beginning of the for loop to see if w has any value– Osuman AAA
Nov 21 at 8:32
text = open("BioTest.txt").read()
– WillZhang
Nov 21 at 8:34
And i tried to print w, it has value
– WillZhang
Nov 21 at 8:35