How to find diversified vectors in a given set?

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP











up vote
1
down vote

favorite












I will describe my problem in details. I have a set of $N$ vectors, each of them defines a logistic regresion model. From this $N$ vectors (models) I want to take $n$ which are the most unique.



I'm not interested in similar vectors (models) because they share same information, and they will produce similar predictions. All of my $N$ models have high predictive power and I want to create single clasifier from $n$ most independent in a given set.



It is useles to make single calsifier $c=(A+B)/2$ from model A (predictive power=75%), and model $B$ (predictive power=75%) when they are equal ($A=B$).



But if they differ in predictor variables (defined in my vectors) and still have predictive power at high level (75%) in case when $Aneq B$ then combined clasifier $c=(A+B)/2$ will perform much better.



Is there any algoritmic (R), or mathematical framework for finding most independent (diversified?) subset of vectors in a given set?







share|cite|improve this question

























    up vote
    1
    down vote

    favorite












    I will describe my problem in details. I have a set of $N$ vectors, each of them defines a logistic regresion model. From this $N$ vectors (models) I want to take $n$ which are the most unique.



    I'm not interested in similar vectors (models) because they share same information, and they will produce similar predictions. All of my $N$ models have high predictive power and I want to create single clasifier from $n$ most independent in a given set.



    It is useles to make single calsifier $c=(A+B)/2$ from model A (predictive power=75%), and model $B$ (predictive power=75%) when they are equal ($A=B$).



    But if they differ in predictor variables (defined in my vectors) and still have predictive power at high level (75%) in case when $Aneq B$ then combined clasifier $c=(A+B)/2$ will perform much better.



    Is there any algoritmic (R), or mathematical framework for finding most independent (diversified?) subset of vectors in a given set?







    share|cite|improve this question























      up vote
      1
      down vote

      favorite









      up vote
      1
      down vote

      favorite











      I will describe my problem in details. I have a set of $N$ vectors, each of them defines a logistic regresion model. From this $N$ vectors (models) I want to take $n$ which are the most unique.



      I'm not interested in similar vectors (models) because they share same information, and they will produce similar predictions. All of my $N$ models have high predictive power and I want to create single clasifier from $n$ most independent in a given set.



      It is useles to make single calsifier $c=(A+B)/2$ from model A (predictive power=75%), and model $B$ (predictive power=75%) when they are equal ($A=B$).



      But if they differ in predictor variables (defined in my vectors) and still have predictive power at high level (75%) in case when $Aneq B$ then combined clasifier $c=(A+B)/2$ will perform much better.



      Is there any algoritmic (R), or mathematical framework for finding most independent (diversified?) subset of vectors in a given set?







      share|cite|improve this question













      I will describe my problem in details. I have a set of $N$ vectors, each of them defines a logistic regresion model. From this $N$ vectors (models) I want to take $n$ which are the most unique.



      I'm not interested in similar vectors (models) because they share same information, and they will produce similar predictions. All of my $N$ models have high predictive power and I want to create single clasifier from $n$ most independent in a given set.



      It is useles to make single calsifier $c=(A+B)/2$ from model A (predictive power=75%), and model $B$ (predictive power=75%) when they are equal ($A=B$).



      But if they differ in predictor variables (defined in my vectors) and still have predictive power at high level (75%) in case when $Aneq B$ then combined clasifier $c=(A+B)/2$ will perform much better.



      Is there any algoritmic (R), or mathematical framework for finding most independent (diversified?) subset of vectors in a given set?









      share|cite|improve this question












      share|cite|improve this question




      share|cite|improve this question








      edited Jul 24 at 11:18









      Javi

      2,1481725




      2,1481725









      asked Jul 24 at 11:10









      Łukasz Czop

      61




      61

























          active

          oldest

          votes











          Your Answer




          StackExchange.ifUsing("editor", function ()
          return StackExchange.using("mathjaxEditing", function ()
          StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
          StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
          );
          );
          , "mathjax-editing");

          StackExchange.ready(function()
          var channelOptions =
          tags: "".split(" "),
          id: "69"
          ;
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function()
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled)
          StackExchange.using("snippets", function()
          createEditor();
          );

          else
          createEditor();

          );

          function createEditor()
          StackExchange.prepareEditor(
          heartbeatType: 'answer',
          convertImagesToLinks: true,
          noModals: false,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: 10,
          bindNavPrevention: true,
          postfix: "",
          noCode: true, onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          );



          );








           

          draft saved


          draft discarded


















          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2861226%2fhow-to-find-diversified-vectors-in-a-given-set%23new-answer', 'question_page');

          );

          Post as a guest



































          active

          oldest

          votes













          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes










           

          draft saved


          draft discarded


























           


          draft saved


          draft discarded














          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2861226%2fhow-to-find-diversified-vectors-in-a-given-set%23new-answer', 'question_page');

          );

          Post as a guest













































































          Comments

          Popular posts from this blog

          What is the equation of a 3D cone with generalised tilt?

          Color the edges and diagonals of a regular polygon

          Relationship between determinant of matrix and determinant of adjoint?