Align data gamma distribution

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP











up vote
-1
down vote

favorite












For a program I am writing I have a large dataset of "messages" that a analyser program I have made reads in based on a given timestamp in whole seconds. For example, the analyser will read in all messages with a timestamp between 5 and 9. These messages are to replicate social media posts about a topic with a sudden "explosion" of talk in the beginning before tailing off after a period of time.



I am trying to create some formulas in Excel to allow me to distribute these messages over a set range in time. For example between 240 and 400 seconds. This will form the range of time the messages are read by the analyser. The distribution shape that kind of matches what I'm trying to do is a gamma distribution as in the picture. Green line fits best.



It is best to think of the gamma graph with the x axis being the timestamp with my set range (240 - 400 for example) and the y being the number of messages that have that timestamp. Based on this distribution shape, this would give the "explosion" of posts at the beginning and less per second towards the end.



I suppose what I am asking for help with is taking a gamma distribution shape and overlaying my dataset so that towards the beginning of my time range, there are more with the same timestamp and less later on.



I can explain specifics better if needed...but this type of problem is a tad out of my comfort zone...I can convert things into Excel speak easy enough. It is just coming up with the base maths behind it that I am struggling with.



Thanks in Advance!







share|cite|improve this question

























    up vote
    -1
    down vote

    favorite












    For a program I am writing I have a large dataset of "messages" that a analyser program I have made reads in based on a given timestamp in whole seconds. For example, the analyser will read in all messages with a timestamp between 5 and 9. These messages are to replicate social media posts about a topic with a sudden "explosion" of talk in the beginning before tailing off after a period of time.



    I am trying to create some formulas in Excel to allow me to distribute these messages over a set range in time. For example between 240 and 400 seconds. This will form the range of time the messages are read by the analyser. The distribution shape that kind of matches what I'm trying to do is a gamma distribution as in the picture. Green line fits best.



    It is best to think of the gamma graph with the x axis being the timestamp with my set range (240 - 400 for example) and the y being the number of messages that have that timestamp. Based on this distribution shape, this would give the "explosion" of posts at the beginning and less per second towards the end.



    I suppose what I am asking for help with is taking a gamma distribution shape and overlaying my dataset so that towards the beginning of my time range, there are more with the same timestamp and less later on.



    I can explain specifics better if needed...but this type of problem is a tad out of my comfort zone...I can convert things into Excel speak easy enough. It is just coming up with the base maths behind it that I am struggling with.



    Thanks in Advance!







    share|cite|improve this question























      up vote
      -1
      down vote

      favorite









      up vote
      -1
      down vote

      favorite











      For a program I am writing I have a large dataset of "messages" that a analyser program I have made reads in based on a given timestamp in whole seconds. For example, the analyser will read in all messages with a timestamp between 5 and 9. These messages are to replicate social media posts about a topic with a sudden "explosion" of talk in the beginning before tailing off after a period of time.



      I am trying to create some formulas in Excel to allow me to distribute these messages over a set range in time. For example between 240 and 400 seconds. This will form the range of time the messages are read by the analyser. The distribution shape that kind of matches what I'm trying to do is a gamma distribution as in the picture. Green line fits best.



      It is best to think of the gamma graph with the x axis being the timestamp with my set range (240 - 400 for example) and the y being the number of messages that have that timestamp. Based on this distribution shape, this would give the "explosion" of posts at the beginning and less per second towards the end.



      I suppose what I am asking for help with is taking a gamma distribution shape and overlaying my dataset so that towards the beginning of my time range, there are more with the same timestamp and less later on.



      I can explain specifics better if needed...but this type of problem is a tad out of my comfort zone...I can convert things into Excel speak easy enough. It is just coming up with the base maths behind it that I am struggling with.



      Thanks in Advance!







      share|cite|improve this question













      For a program I am writing I have a large dataset of "messages" that a analyser program I have made reads in based on a given timestamp in whole seconds. For example, the analyser will read in all messages with a timestamp between 5 and 9. These messages are to replicate social media posts about a topic with a sudden "explosion" of talk in the beginning before tailing off after a period of time.



      I am trying to create some formulas in Excel to allow me to distribute these messages over a set range in time. For example between 240 and 400 seconds. This will form the range of time the messages are read by the analyser. The distribution shape that kind of matches what I'm trying to do is a gamma distribution as in the picture. Green line fits best.



      It is best to think of the gamma graph with the x axis being the timestamp with my set range (240 - 400 for example) and the y being the number of messages that have that timestamp. Based on this distribution shape, this would give the "explosion" of posts at the beginning and less per second towards the end.



      I suppose what I am asking for help with is taking a gamma distribution shape and overlaying my dataset so that towards the beginning of my time range, there are more with the same timestamp and less later on.



      I can explain specifics better if needed...but this type of problem is a tad out of my comfort zone...I can convert things into Excel speak easy enough. It is just coming up with the base maths behind it that I am struggling with.



      Thanks in Advance!









      share|cite|improve this question












      share|cite|improve this question




      share|cite|improve this question








      edited Jul 25 at 12:01









      TZakrevskiy

      19.8k12253




      19.8k12253









      asked Jul 23 at 21:46









      DPashley

      1




      1

























          active

          oldest

          votes











          Your Answer




          StackExchange.ifUsing("editor", function ()
          return StackExchange.using("mathjaxEditing", function ()
          StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
          StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
          );
          );
          , "mathjax-editing");

          StackExchange.ready(function()
          var channelOptions =
          tags: "".split(" "),
          id: "69"
          ;
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function()
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled)
          StackExchange.using("snippets", function()
          createEditor();
          );

          else
          createEditor();

          );

          function createEditor()
          StackExchange.prepareEditor(
          heartbeatType: 'answer',
          convertImagesToLinks: true,
          noModals: false,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: 10,
          bindNavPrevention: true,
          postfix: "",
          noCode: true, onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          );



          );








           

          draft saved


          draft discarded


















          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2860808%2falign-data-gamma-distribution%23new-answer', 'question_page');

          );

          Post as a guest



































          active

          oldest

          votes













          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes










           

          draft saved


          draft discarded


























           


          draft saved


          draft discarded














          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2860808%2falign-data-gamma-distribution%23new-answer', 'question_page');

          );

          Post as a guest













































































          Comments

          Popular posts from this blog

          What is the equation of a 3D cone with generalised tilt?

          Color the edges and diagonals of a regular polygon

          Relationship between determinant of matrix and determinant of adjoint?