Plotting from large Dataset in Python

Multi tool use
up vote
1
down vote
favorite
I am facing a collection of 50000
data points (incurred losses in insurance) and want to find a fitting distribution. Since I would like to make a plot in python it is not working - but it should be very simple e.g. making a plot
of the first 500
incurred losses :
df = pd.read_excel('data.xlsx')
plt.hist(df[1:500], bins='auto')
but the plot
is empty.
What do you think?
Thank you in advance and kind regards :)
python python-3.x pandas matplotlib
New contributor
Math_Man1 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
add a comment |
up vote
1
down vote
favorite
I am facing a collection of 50000
data points (incurred losses in insurance) and want to find a fitting distribution. Since I would like to make a plot in python it is not working - but it should be very simple e.g. making a plot
of the first 500
incurred losses :
df = pd.read_excel('data.xlsx')
plt.hist(df[1:500], bins='auto')
but the plot
is empty.
What do you think?
Thank you in advance and kind regards :)
python python-3.x pandas matplotlib
New contributor
Math_Man1 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
please provide a sample of the data
– Vivek Kalyanarangan
Nov 19 at 11:29
due to data protection I am not quite sure, if I can provide you with the exact data, but : it's a list containing 52000 entries in the range of 20$ up to 3.000.000$. The aggregation of the data should be done automatically with bins='auto' ....
– Math_Man1
Nov 19 at 11:43
1
just add some random data that will reproduce your problem, if you can't add the actual data that you are using.
– user3471881
Nov 19 at 12:18
Hi, maybe its usefull to group the losses before? like: group 1 : 0-500$ group 2: 501-1000$ and so on is this possible via the groupby function?
– Math_Man1
2 days ago
add a comment |
up vote
1
down vote
favorite
up vote
1
down vote
favorite
I am facing a collection of 50000
data points (incurred losses in insurance) and want to find a fitting distribution. Since I would like to make a plot in python it is not working - but it should be very simple e.g. making a plot
of the first 500
incurred losses :
df = pd.read_excel('data.xlsx')
plt.hist(df[1:500], bins='auto')
but the plot
is empty.
What do you think?
Thank you in advance and kind regards :)
python python-3.x pandas matplotlib
New contributor
Math_Man1 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
I am facing a collection of 50000
data points (incurred losses in insurance) and want to find a fitting distribution. Since I would like to make a plot in python it is not working - but it should be very simple e.g. making a plot
of the first 500
incurred losses :
df = pd.read_excel('data.xlsx')
plt.hist(df[1:500], bins='auto')
but the plot
is empty.
What do you think?
Thank you in advance and kind regards :)
python python-3.x pandas matplotlib
python python-3.x pandas matplotlib
New contributor
Math_Man1 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
New contributor
Math_Man1 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
edited Nov 19 at 12:39
user3471881
8791619
8791619
New contributor
Math_Man1 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
asked Nov 19 at 11:20
Math_Man1
63
63
New contributor
Math_Man1 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
New contributor
Math_Man1 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
Math_Man1 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.
please provide a sample of the data
– Vivek Kalyanarangan
Nov 19 at 11:29
due to data protection I am not quite sure, if I can provide you with the exact data, but : it's a list containing 52000 entries in the range of 20$ up to 3.000.000$. The aggregation of the data should be done automatically with bins='auto' ....
– Math_Man1
Nov 19 at 11:43
1
just add some random data that will reproduce your problem, if you can't add the actual data that you are using.
– user3471881
Nov 19 at 12:18
Hi, maybe its usefull to group the losses before? like: group 1 : 0-500$ group 2: 501-1000$ and so on is this possible via the groupby function?
– Math_Man1
2 days ago
add a comment |
please provide a sample of the data
– Vivek Kalyanarangan
Nov 19 at 11:29
due to data protection I am not quite sure, if I can provide you with the exact data, but : it's a list containing 52000 entries in the range of 20$ up to 3.000.000$. The aggregation of the data should be done automatically with bins='auto' ....
– Math_Man1
Nov 19 at 11:43
1
just add some random data that will reproduce your problem, if you can't add the actual data that you are using.
– user3471881
Nov 19 at 12:18
Hi, maybe its usefull to group the losses before? like: group 1 : 0-500$ group 2: 501-1000$ and so on is this possible via the groupby function?
– Math_Man1
2 days ago
please provide a sample of the data
– Vivek Kalyanarangan
Nov 19 at 11:29
please provide a sample of the data
– Vivek Kalyanarangan
Nov 19 at 11:29
due to data protection I am not quite sure, if I can provide you with the exact data, but : it's a list containing 52000 entries in the range of 20$ up to 3.000.000$. The aggregation of the data should be done automatically with bins='auto' ....
– Math_Man1
Nov 19 at 11:43
due to data protection I am not quite sure, if I can provide you with the exact data, but : it's a list containing 52000 entries in the range of 20$ up to 3.000.000$. The aggregation of the data should be done automatically with bins='auto' ....
– Math_Man1
Nov 19 at 11:43
1
1
just add some random data that will reproduce your problem, if you can't add the actual data that you are using.
– user3471881
Nov 19 at 12:18
just add some random data that will reproduce your problem, if you can't add the actual data that you are using.
– user3471881
Nov 19 at 12:18
Hi, maybe its usefull to group the losses before? like: group 1 : 0-500$ group 2: 501-1000$ and so on is this possible via the groupby function?
– Math_Man1
2 days ago
Hi, maybe its usefull to group the losses before? like: group 1 : 0-500$ group 2: 501-1000$ and so on is this possible via the groupby function?
– Math_Man1
2 days ago
add a comment |
active
oldest
votes
active
oldest
votes
active
oldest
votes
active
oldest
votes
active
oldest
votes
Math_Man1 is a new contributor. Be nice, and check out our Code of Conduct.
Math_Man1 is a new contributor. Be nice, and check out our Code of Conduct.
Math_Man1 is a new contributor. Be nice, and check out our Code of Conduct.
Math_Man1 is a new contributor. Be nice, and check out our Code of Conduct.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53373534%2fplotting-from-large-dataset-in-python%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
4TUnpxCn8vQMhT6rCRWsAdsCfjUw7,fZJh6hxu8DFxsvV i293PG,qF7uHWFs0vZ6sq,r4j1 SpE0XhXelzi2L01 gi X
please provide a sample of the data
– Vivek Kalyanarangan
Nov 19 at 11:29
due to data protection I am not quite sure, if I can provide you with the exact data, but : it's a list containing 52000 entries in the range of 20$ up to 3.000.000$. The aggregation of the data should be done automatically with bins='auto' ....
– Math_Man1
Nov 19 at 11:43
1
just add some random data that will reproduce your problem, if you can't add the actual data that you are using.
– user3471881
Nov 19 at 12:18
Hi, maybe its usefull to group the losses before? like: group 1 : 0-500$ group 2: 501-1000$ and so on is this possible via the groupby function?
– Math_Man1
2 days ago