Plotting from large Dataset in Python
up vote
1
down vote
favorite
I am facing a collection of 50000
data points (incurred losses in insurance) and want to find a fitting distribution. Since I would like to make a plot in python it is not working - but it should be very simple e.g. making a plot
of the first 500
incurred losses :
df = pd.read_excel('data.xlsx')
plt.hist(df[1:500], bins='auto')
but the plot
is empty.
What do you think?
Thank you in advance and kind regards :)
python python-3.x pandas matplotlib
New contributor
add a comment |
up vote
1
down vote
favorite
I am facing a collection of 50000
data points (incurred losses in insurance) and want to find a fitting distribution. Since I would like to make a plot in python it is not working - but it should be very simple e.g. making a plot
of the first 500
incurred losses :
df = pd.read_excel('data.xlsx')
plt.hist(df[1:500], bins='auto')
but the plot
is empty.
What do you think?
Thank you in advance and kind regards :)
python python-3.x pandas matplotlib
New contributor
please provide a sample of the data
– Vivek Kalyanarangan
Nov 19 at 11:29
due to data protection I am not quite sure, if I can provide you with the exact data, but : it's a list containing 52000 entries in the range of 20$ up to 3.000.000$. The aggregation of the data should be done automatically with bins='auto' ....
– Math_Man1
Nov 19 at 11:43
1
just add some random data that will reproduce your problem, if you can't add the actual data that you are using.
– user3471881
Nov 19 at 12:18
Hi, maybe its usefull to group the losses before? like: group 1 : 0-500$ group 2: 501-1000$ and so on is this possible via the groupby function?
– Math_Man1
2 days ago
add a comment |
up vote
1
down vote
favorite
up vote
1
down vote
favorite
I am facing a collection of 50000
data points (incurred losses in insurance) and want to find a fitting distribution. Since I would like to make a plot in python it is not working - but it should be very simple e.g. making a plot
of the first 500
incurred losses :
df = pd.read_excel('data.xlsx')
plt.hist(df[1:500], bins='auto')
but the plot
is empty.
What do you think?
Thank you in advance and kind regards :)
python python-3.x pandas matplotlib
New contributor
I am facing a collection of 50000
data points (incurred losses in insurance) and want to find a fitting distribution. Since I would like to make a plot in python it is not working - but it should be very simple e.g. making a plot
of the first 500
incurred losses :
df = pd.read_excel('data.xlsx')
plt.hist(df[1:500], bins='auto')
but the plot
is empty.
What do you think?
Thank you in advance and kind regards :)
python python-3.x pandas matplotlib
python python-3.x pandas matplotlib
New contributor
New contributor
edited Nov 19 at 12:39
user3471881
8791619
8791619
New contributor
asked Nov 19 at 11:20
Math_Man1
63
63
New contributor
New contributor
please provide a sample of the data
– Vivek Kalyanarangan
Nov 19 at 11:29
due to data protection I am not quite sure, if I can provide you with the exact data, but : it's a list containing 52000 entries in the range of 20$ up to 3.000.000$. The aggregation of the data should be done automatically with bins='auto' ....
– Math_Man1
Nov 19 at 11:43
1
just add some random data that will reproduce your problem, if you can't add the actual data that you are using.
– user3471881
Nov 19 at 12:18
Hi, maybe its usefull to group the losses before? like: group 1 : 0-500$ group 2: 501-1000$ and so on is this possible via the groupby function?
– Math_Man1
2 days ago
add a comment |
please provide a sample of the data
– Vivek Kalyanarangan
Nov 19 at 11:29
due to data protection I am not quite sure, if I can provide you with the exact data, but : it's a list containing 52000 entries in the range of 20$ up to 3.000.000$. The aggregation of the data should be done automatically with bins='auto' ....
– Math_Man1
Nov 19 at 11:43
1
just add some random data that will reproduce your problem, if you can't add the actual data that you are using.
– user3471881
Nov 19 at 12:18
Hi, maybe its usefull to group the losses before? like: group 1 : 0-500$ group 2: 501-1000$ and so on is this possible via the groupby function?
– Math_Man1
2 days ago
please provide a sample of the data
– Vivek Kalyanarangan
Nov 19 at 11:29
please provide a sample of the data
– Vivek Kalyanarangan
Nov 19 at 11:29
due to data protection I am not quite sure, if I can provide you with the exact data, but : it's a list containing 52000 entries in the range of 20$ up to 3.000.000$. The aggregation of the data should be done automatically with bins='auto' ....
– Math_Man1
Nov 19 at 11:43
due to data protection I am not quite sure, if I can provide you with the exact data, but : it's a list containing 52000 entries in the range of 20$ up to 3.000.000$. The aggregation of the data should be done automatically with bins='auto' ....
– Math_Man1
Nov 19 at 11:43
1
1
just add some random data that will reproduce your problem, if you can't add the actual data that you are using.
– user3471881
Nov 19 at 12:18
just add some random data that will reproduce your problem, if you can't add the actual data that you are using.
– user3471881
Nov 19 at 12:18
Hi, maybe its usefull to group the losses before? like: group 1 : 0-500$ group 2: 501-1000$ and so on is this possible via the groupby function?
– Math_Man1
2 days ago
Hi, maybe its usefull to group the losses before? like: group 1 : 0-500$ group 2: 501-1000$ and so on is this possible via the groupby function?
– Math_Man1
2 days ago
add a comment |
active
oldest
votes
active
oldest
votes
active
oldest
votes
active
oldest
votes
active
oldest
votes
Math_Man1 is a new contributor. Be nice, and check out our Code of Conduct.
Math_Man1 is a new contributor. Be nice, and check out our Code of Conduct.
Math_Man1 is a new contributor. Be nice, and check out our Code of Conduct.
Math_Man1 is a new contributor. Be nice, and check out our Code of Conduct.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53373534%2fplotting-from-large-dataset-in-python%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
please provide a sample of the data
– Vivek Kalyanarangan
Nov 19 at 11:29
due to data protection I am not quite sure, if I can provide you with the exact data, but : it's a list containing 52000 entries in the range of 20$ up to 3.000.000$. The aggregation of the data should be done automatically with bins='auto' ....
– Math_Man1
Nov 19 at 11:43
1
just add some random data that will reproduce your problem, if you can't add the actual data that you are using.
– user3471881
Nov 19 at 12:18
Hi, maybe its usefull to group the losses before? like: group 1 : 0-500$ group 2: 501-1000$ and so on is this possible via the groupby function?
– Math_Man1
2 days ago