improving accuracy of classificationImproving Naive Bayes accuracy for text classificationOver-fitting issue in a classification problem (unbalanced data)Aggregating Decision TreesDecision Tree generating leaves for only one caseNeed Advice, Classification Problem in Python: Should I use Decision tree, Random Forests, or Logistic Regression?Fetching rules from rpart using caret packageEstimating Propensity Score via Regression Trees (in R Using rpart)Desicision tree classification with a “false” attributeProblems successfuly implementing stacked autoencoder in binary classification problemWhy is recall so high?

Recursively updating the MLE as new observations stream in

Help with identifying unique aircraft over NE Pennsylvania

What (if any) is the reason to buy in small local stores?

Exit shell with shortcut (not typing exit) that closes session properly

Justification failure in beamer enumerate list

Hackerrank All Women's Codesprint 2019: Name the Product

"Marked down as someone wanting to sell shares." What does that mean?

Writing in a Christian voice

UK Tourist Visa- Enquiry

Output visual diagram of picture

Friend wants my recommendation but I don't want to

Print a physical multiplication table

pipe commands inside find -exec?

What is the tangent at a sharp point on a curve?

How to test the sharpness of a knife?

Did Nintendo change its mind about 68000 SNES?

Why doesn't the fusion process of the sun speed up?

What are the rules for concealing thieves' tools (or items in general)?

Homology of the fiber

Knife as defense against stray dogs

Fair way to split coins

Is xar preinstalled on macOS?

Nested Dynamic SOQL Query

Symbolism of 18 Journeyers



improving accuracy of classification


Improving Naive Bayes accuracy for text classificationOver-fitting issue in a classification problem (unbalanced data)Aggregating Decision TreesDecision Tree generating leaves for only one caseNeed Advice, Classification Problem in Python: Should I use Decision tree, Random Forests, or Logistic Regression?Fetching rules from rpart using caret packageEstimating Propensity Score via Regression Trees (in R Using rpart)Desicision tree classification with a “false” attributeProblems successfuly implementing stacked autoencoder in binary classification problemWhy is recall so high?













0












$begingroup$


I have data with 95 numeric variables and 5 categorical variables. My Y has 2 values. I built a decision tree and my accuracy was 81.8%. Then I created 3 new variables as follows. It improved accuracy to 84.3%



  1. Normalize numeric variables and for training data, find mean vector for Y=1 and Y=0

  2. for each data point, find euclidean distance from each mean vector - distance0 and distance1

  3. third variable will be 0 if distance0 is <= distance1

I was wondering if there is any other new variables that i could create to improve the accuracy



I used a decision tree as it is fast to build and gives me indication whether a newly created variable is useful or not.



Please let me know if you have any thoughts










share|improve this question









$endgroup$
















    0












    $begingroup$


    I have data with 95 numeric variables and 5 categorical variables. My Y has 2 values. I built a decision tree and my accuracy was 81.8%. Then I created 3 new variables as follows. It improved accuracy to 84.3%



    1. Normalize numeric variables and for training data, find mean vector for Y=1 and Y=0

    2. for each data point, find euclidean distance from each mean vector - distance0 and distance1

    3. third variable will be 0 if distance0 is <= distance1

    I was wondering if there is any other new variables that i could create to improve the accuracy



    I used a decision tree as it is fast to build and gives me indication whether a newly created variable is useful or not.



    Please let me know if you have any thoughts










    share|improve this question









    $endgroup$














      0












      0








      0





      $begingroup$


      I have data with 95 numeric variables and 5 categorical variables. My Y has 2 values. I built a decision tree and my accuracy was 81.8%. Then I created 3 new variables as follows. It improved accuracy to 84.3%



      1. Normalize numeric variables and for training data, find mean vector for Y=1 and Y=0

      2. for each data point, find euclidean distance from each mean vector - distance0 and distance1

      3. third variable will be 0 if distance0 is <= distance1

      I was wondering if there is any other new variables that i could create to improve the accuracy



      I used a decision tree as it is fast to build and gives me indication whether a newly created variable is useful or not.



      Please let me know if you have any thoughts










      share|improve this question









      $endgroup$




      I have data with 95 numeric variables and 5 categorical variables. My Y has 2 values. I built a decision tree and my accuracy was 81.8%. Then I created 3 new variables as follows. It improved accuracy to 84.3%



      1. Normalize numeric variables and for training data, find mean vector for Y=1 and Y=0

      2. for each data point, find euclidean distance from each mean vector - distance0 and distance1

      3. third variable will be 0 if distance0 is <= distance1

      I was wondering if there is any other new variables that i could create to improve the accuracy



      I used a decision tree as it is fast to build and gives me indication whether a newly created variable is useful or not.



      Please let me know if you have any thoughts







      classification predictive-modeling






      share|improve this question













      share|improve this question











      share|improve this question




      share|improve this question










      asked 32 mins ago









      user2543622user2543622

      1236




      1236




















          0






          active

          oldest

          votes











          Your Answer





          StackExchange.ifUsing("editor", function ()
          return StackExchange.using("mathjaxEditing", function ()
          StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
          StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
          );
          );
          , "mathjax-editing");

          StackExchange.ready(function()
          var channelOptions =
          tags: "".split(" "),
          id: "557"
          ;
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function()
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled)
          StackExchange.using("snippets", function()
          createEditor();
          );

          else
          createEditor();

          );

          function createEditor()
          StackExchange.prepareEditor(
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: false,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: null,
          bindNavPrevention: true,
          postfix: "",
          imageUploader:
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          ,
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          );



          );













          draft saved

          draft discarded


















          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47573%2fimproving-accuracy-of-classification%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown

























          0






          active

          oldest

          votes








          0






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes















          draft saved

          draft discarded
















































          Thanks for contributing an answer to Data Science Stack Exchange!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid


          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.

          Use MathJax to format equations. MathJax reference.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function ()
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47573%2fimproving-accuracy-of-classification%23new-answer', 'question_page');

          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          ValueError: Error when checking input: expected conv2d_13_input to have shape (3, 150, 150) but got array with shape (150, 150, 3)2019 Community Moderator ElectionError when checking : expected dense_1_input to have shape (None, 5) but got array with shape (200, 1)Error 'Expected 2D array, got 1D array instead:'ValueError: Error when checking input: expected lstm_41_input to have 3 dimensions, but got array with shape (40000,100)ValueError: Error when checking target: expected dense_1 to have shape (7,) but got array with shape (1,)ValueError: Error when checking target: expected dense_2 to have shape (1,) but got array with shape (0,)Keras exception: ValueError: Error when checking input: expected conv2d_1_input to have shape (150, 150, 3) but got array with shape (256, 256, 3)Steps taking too long to completewhen checking input: expected dense_1_input to have shape (13328,) but got array with shape (317,)ValueError: Error when checking target: expected dense_3 to have shape (None, 1) but got array with shape (7715, 40000)Keras exception: Error when checking input: expected dense_input to have shape (2,) but got array with shape (1,)

          Ружовы пелікан Змест Знешні выгляд | Пашырэнне | Асаблівасці біялогіі | Літаратура | НавігацыяДагледжаная версіяправерана1 зменаДагледжаная версіяправерана1 змена/ 22697590 Сістэматыкана ВіківідахВыявына Вікісховішчы174693363011049382

          Illegal assignment from SObject to ContactFetching String, Id from Map - Illegal Assignment Id to Field / ObjectError: Compile Error: Illegal assignment from String to BooleanError: List has no rows for assignment to SObjectError on Test Class - System.QueryException: List has no rows for assignment to SObjectRemote action problemDML requires SObject or SObject list type error“Illegal assignment from List to List”Test Class Fail: Batch Class: System.QueryException: List has no rows for assignment to SObjectMapping to a user'List has no rows for assignment to SObject' Mystery