AssertionError: Index is not unique on dataframe (Entity cust) using Featuretools
up vote
1
down vote
favorite
I have a dataframe as below
customerid term age
08a858899538ddb8e015390510b321f0830199897 30 24
18a858959537a097401537a4e316e25f730196361 60 72
a8589c253ace09b0153af6ba58f1f313019822366 45 38
I am creating an entity as below using featuretools
es = es.entity_from_dataframe(entity_id = 'cust', dataframe = df, index = 'customerid')
but i get the error
AssertionError: Index is not unique on dataframe (Entity cust)
yet customerid is the identifier
python python-3.x feature-extraction featuretools
add a comment |
up vote
1
down vote
favorite
I have a dataframe as below
customerid term age
08a858899538ddb8e015390510b321f0830199897 30 24
18a858959537a097401537a4e316e25f730196361 60 72
a8589c253ace09b0153af6ba58f1f313019822366 45 38
I am creating an entity as below using featuretools
es = es.entity_from_dataframe(entity_id = 'cust', dataframe = df, index = 'customerid')
but i get the error
AssertionError: Index is not unique on dataframe (Entity cust)
yet customerid is the identifier
python python-3.x feature-extraction featuretools
1
Is it possible thatcustomerid
is not unique? You can check if that is the case withlen(es) == es.customerid.nunique()
(if it's False, then you have repeated customerids).
– Julian Peller
Nov 20 at 14:15
1
@JulianPeller is right. You should run that check on the dataframe not the entityset though, so the code would belen(df) == df.customerid.nunique()
– Max Kanter
Nov 20 at 14:45
Oh, you are right. My mistake.
– Julian Peller
Nov 20 at 14:48
add a comment |
up vote
1
down vote
favorite
up vote
1
down vote
favorite
I have a dataframe as below
customerid term age
08a858899538ddb8e015390510b321f0830199897 30 24
18a858959537a097401537a4e316e25f730196361 60 72
a8589c253ace09b0153af6ba58f1f313019822366 45 38
I am creating an entity as below using featuretools
es = es.entity_from_dataframe(entity_id = 'cust', dataframe = df, index = 'customerid')
but i get the error
AssertionError: Index is not unique on dataframe (Entity cust)
yet customerid is the identifier
python python-3.x feature-extraction featuretools
I have a dataframe as below
customerid term age
08a858899538ddb8e015390510b321f0830199897 30 24
18a858959537a097401537a4e316e25f730196361 60 72
a8589c253ace09b0153af6ba58f1f313019822366 45 38
I am creating an entity as below using featuretools
es = es.entity_from_dataframe(entity_id = 'cust', dataframe = df, index = 'customerid')
but i get the error
AssertionError: Index is not unique on dataframe (Entity cust)
yet customerid is the identifier
python python-3.x feature-extraction featuretools
python python-3.x feature-extraction featuretools
edited Nov 20 at 17:34
Max Kanter
1,04019
1,04019
asked Nov 20 at 12:27
Ian Okeyo
112
112
1
Is it possible thatcustomerid
is not unique? You can check if that is the case withlen(es) == es.customerid.nunique()
(if it's False, then you have repeated customerids).
– Julian Peller
Nov 20 at 14:15
1
@JulianPeller is right. You should run that check on the dataframe not the entityset though, so the code would belen(df) == df.customerid.nunique()
– Max Kanter
Nov 20 at 14:45
Oh, you are right. My mistake.
– Julian Peller
Nov 20 at 14:48
add a comment |
1
Is it possible thatcustomerid
is not unique? You can check if that is the case withlen(es) == es.customerid.nunique()
(if it's False, then you have repeated customerids).
– Julian Peller
Nov 20 at 14:15
1
@JulianPeller is right. You should run that check on the dataframe not the entityset though, so the code would belen(df) == df.customerid.nunique()
– Max Kanter
Nov 20 at 14:45
Oh, you are right. My mistake.
– Julian Peller
Nov 20 at 14:48
1
1
Is it possible that
customerid
is not unique? You can check if that is the case with len(es) == es.customerid.nunique()
(if it's False, then you have repeated customerids).– Julian Peller
Nov 20 at 14:15
Is it possible that
customerid
is not unique? You can check if that is the case with len(es) == es.customerid.nunique()
(if it's False, then you have repeated customerids).– Julian Peller
Nov 20 at 14:15
1
1
@JulianPeller is right. You should run that check on the dataframe not the entityset though, so the code would be
len(df) == df.customerid.nunique()
– Max Kanter
Nov 20 at 14:45
@JulianPeller is right. You should run that check on the dataframe not the entityset though, so the code would be
len(df) == df.customerid.nunique()
– Max Kanter
Nov 20 at 14:45
Oh, you are right. My mistake.
– Julian Peller
Nov 20 at 14:48
Oh, you are right. My mistake.
– Julian Peller
Nov 20 at 14:48
add a comment |
active
oldest
votes
Your Answer
StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53392988%2fassertionerror-index-is-not-unique-on-dataframe-entity-cust-using-featuretool%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
active
oldest
votes
active
oldest
votes
active
oldest
votes
active
oldest
votes
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Some of your past answers have not been well-received, and you're in danger of being blocked from answering.
Please pay close attention to the following guidance:
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53392988%2fassertionerror-index-is-not-unique-on-dataframe-entity-cust-using-featuretool%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
1
Is it possible that
customerid
is not unique? You can check if that is the case withlen(es) == es.customerid.nunique()
(if it's False, then you have repeated customerids).– Julian Peller
Nov 20 at 14:15
1
@JulianPeller is right. You should run that check on the dataframe not the entityset though, so the code would be
len(df) == df.customerid.nunique()
– Max Kanter
Nov 20 at 14:45
Oh, you are right. My mistake.
– Julian Peller
Nov 20 at 14:48