Align data gamma distribution
Clash Royale CLAN TAG#URR8PPP
up vote
-1
down vote
favorite
For a program I am writing I have a large dataset of "messages" that a analyser program I have made reads in based on a given timestamp in whole seconds. For example, the analyser will read in all messages with a timestamp between 5 and 9. These messages are to replicate social media posts about a topic with a sudden "explosion" of talk in the beginning before tailing off after a period of time.
I am trying to create some formulas in Excel to allow me to distribute these messages over a set range in time. For example between 240 and 400 seconds. This will form the range of time the messages are read by the analyser. The distribution shape that kind of matches what I'm trying to do is a gamma distribution as in the picture. Green line fits best.
It is best to think of the gamma graph with the x axis being the timestamp with my set range (240 - 400 for example) and the y being the number of messages that have that timestamp. Based on this distribution shape, this would give the "explosion" of posts at the beginning and less per second towards the end.
I suppose what I am asking for help with is taking a gamma distribution shape and overlaying my dataset so that towards the beginning of my time range, there are more with the same timestamp and less later on.
I can explain specifics better if needed...but this type of problem is a tad out of my comfort zone...I can convert things into Excel speak easy enough. It is just coming up with the base maths behind it that I am struggling with.
Thanks in Advance!
data-analysis
add a comment |Â
up vote
-1
down vote
favorite
For a program I am writing I have a large dataset of "messages" that a analyser program I have made reads in based on a given timestamp in whole seconds. For example, the analyser will read in all messages with a timestamp between 5 and 9. These messages are to replicate social media posts about a topic with a sudden "explosion" of talk in the beginning before tailing off after a period of time.
I am trying to create some formulas in Excel to allow me to distribute these messages over a set range in time. For example between 240 and 400 seconds. This will form the range of time the messages are read by the analyser. The distribution shape that kind of matches what I'm trying to do is a gamma distribution as in the picture. Green line fits best.
It is best to think of the gamma graph with the x axis being the timestamp with my set range (240 - 400 for example) and the y being the number of messages that have that timestamp. Based on this distribution shape, this would give the "explosion" of posts at the beginning and less per second towards the end.
I suppose what I am asking for help with is taking a gamma distribution shape and overlaying my dataset so that towards the beginning of my time range, there are more with the same timestamp and less later on.
I can explain specifics better if needed...but this type of problem is a tad out of my comfort zone...I can convert things into Excel speak easy enough. It is just coming up with the base maths behind it that I am struggling with.
Thanks in Advance!
data-analysis
add a comment |Â
up vote
-1
down vote
favorite
up vote
-1
down vote
favorite
For a program I am writing I have a large dataset of "messages" that a analyser program I have made reads in based on a given timestamp in whole seconds. For example, the analyser will read in all messages with a timestamp between 5 and 9. These messages are to replicate social media posts about a topic with a sudden "explosion" of talk in the beginning before tailing off after a period of time.
I am trying to create some formulas in Excel to allow me to distribute these messages over a set range in time. For example between 240 and 400 seconds. This will form the range of time the messages are read by the analyser. The distribution shape that kind of matches what I'm trying to do is a gamma distribution as in the picture. Green line fits best.
It is best to think of the gamma graph with the x axis being the timestamp with my set range (240 - 400 for example) and the y being the number of messages that have that timestamp. Based on this distribution shape, this would give the "explosion" of posts at the beginning and less per second towards the end.
I suppose what I am asking for help with is taking a gamma distribution shape and overlaying my dataset so that towards the beginning of my time range, there are more with the same timestamp and less later on.
I can explain specifics better if needed...but this type of problem is a tad out of my comfort zone...I can convert things into Excel speak easy enough. It is just coming up with the base maths behind it that I am struggling with.
Thanks in Advance!
data-analysis
For a program I am writing I have a large dataset of "messages" that a analyser program I have made reads in based on a given timestamp in whole seconds. For example, the analyser will read in all messages with a timestamp between 5 and 9. These messages are to replicate social media posts about a topic with a sudden "explosion" of talk in the beginning before tailing off after a period of time.
I am trying to create some formulas in Excel to allow me to distribute these messages over a set range in time. For example between 240 and 400 seconds. This will form the range of time the messages are read by the analyser. The distribution shape that kind of matches what I'm trying to do is a gamma distribution as in the picture. Green line fits best.
It is best to think of the gamma graph with the x axis being the timestamp with my set range (240 - 400 for example) and the y being the number of messages that have that timestamp. Based on this distribution shape, this would give the "explosion" of posts at the beginning and less per second towards the end.
I suppose what I am asking for help with is taking a gamma distribution shape and overlaying my dataset so that towards the beginning of my time range, there are more with the same timestamp and less later on.
I can explain specifics better if needed...but this type of problem is a tad out of my comfort zone...I can convert things into Excel speak easy enough. It is just coming up with the base maths behind it that I am struggling with.
Thanks in Advance!
data-analysis
edited Jul 25 at 12:01
TZakrevskiy
19.8k12253
19.8k12253
asked Jul 23 at 21:46
DPashley
1
1
add a comment |Â
add a comment |Â
active
oldest
votes
active
oldest
votes
active
oldest
votes
active
oldest
votes
active
oldest
votes
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2860808%2falign-data-gamma-distribution%23new-answer', 'question_page');
);
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password