Use text classifier on unseen data Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern) 2019 Moderator Election Q&A - Questionnaire 2019 Community Moderator Election ResultsBinary classification model for sparse / biased dataText categorization: combining different kind of featuresCoalitional effect in logistic regression and assessing explanarory variable contributionPreprocessing text before use RNNImbalanced Data how to use random forest to select important variables?Which machine (or deep) learning methods could suit my text classification problem?sklearn: SGDClassifier yields lower accuracy than LogisticRegressionclassification of small groupWhat is the difference between SVM and logistic regression?A few questions to understand a random forest blog

Irreducible of finite Krull dimension implies quasi-compact?

How to react to hostile behavior from a senior developer?

Where are Serre’s lectures at Collège de France to be found?

Is it ethical to give a final exam after the professor has quit before teaching the remaining chapters of the course?

Using et al. for a last / senior author rather than for a first author

Crossing US/Canada Border for less than 24 hours

What is the meaning of the simile “quick as silk”?

An adverb for when you're not exaggerating

Compare a given version number in the form major.minor.build.patch and see if one is less than the other

Significance of Cersei's obsession with elephants?

Amount of permutations on an NxNxN Rubik's Cube

Do jazz musicians improvise on the parent scale in addition to the chord-scales?

Withdrew £2800, but only £2000 shows as withdrawn on online banking; what are my obligations?

Is "Reachable Object" really an NP-complete problem?

How come Sam didn't become Lord of Horn Hill?

Is it cost-effective to upgrade an old-ish Giant Escape R3 commuter bike with entry-level branded parts (wheels, drivetrain)?

Using audio cues to encourage good posture

What do you call a floor made of glass so you can see through the floor?

Is safe to use va_start macro with this as parameter?

Why are the trig functions versine, haversine, exsecant, etc, rarely used in modern mathematics?

Is CEO the profession with the most psychopaths?

Why are there no cargo aircraft with "flying wing" design?

Uniqueness of spanning tree on a grid.

Can anything be seen from the center of the Boötes void? How dark would it be?



Use text classifier on unseen data



Announcing the arrival of Valued Associate #679: Cesar Manara
Planned maintenance scheduled April 17/18, 2019 at 00:00UTC (8:00pm US/Eastern)
2019 Moderator Election Q&A - Questionnaire
2019 Community Moderator Election ResultsBinary classification model for sparse / biased dataText categorization: combining different kind of featuresCoalitional effect in logistic regression and assessing explanarory variable contributionPreprocessing text before use RNNImbalanced Data how to use random forest to select important variables?Which machine (or deep) learning methods could suit my text classification problem?sklearn: SGDClassifier yields lower accuracy than LogisticRegressionclassification of small groupWhat is the difference between SVM and logistic regression?A few questions to understand a random forest blog










0












$begingroup$


I've trained a few models to classify between two categories of text. Logistic regression was the best. Now how can i test it on unseen data?
I tried this:



def train_model():
classifier.fit(feature_vector_train, label)
predictions = classifier.predict(feature_vector_valid)
joblib.dump(classifier, url+name)
...

load_model =joblib.load('my_model.pkl)
result = load_model.score('testx')


It tells me i need a y input. However, if it's new i don't have the label. WHat am i missing?










share|improve this question







New contributor




Bia Calico Cat is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.







$endgroup$
















    0












    $begingroup$


    I've trained a few models to classify between two categories of text. Logistic regression was the best. Now how can i test it on unseen data?
    I tried this:



    def train_model():
    classifier.fit(feature_vector_train, label)
    predictions = classifier.predict(feature_vector_valid)
    joblib.dump(classifier, url+name)
    ...

    load_model =joblib.load('my_model.pkl)
    result = load_model.score('testx')


    It tells me i need a y input. However, if it's new i don't have the label. WHat am i missing?










    share|improve this question







    New contributor




    Bia Calico Cat is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
    Check out our Code of Conduct.







    $endgroup$














      0












      0








      0





      $begingroup$


      I've trained a few models to classify between two categories of text. Logistic regression was the best. Now how can i test it on unseen data?
      I tried this:



      def train_model():
      classifier.fit(feature_vector_train, label)
      predictions = classifier.predict(feature_vector_valid)
      joblib.dump(classifier, url+name)
      ...

      load_model =joblib.load('my_model.pkl)
      result = load_model.score('testx')


      It tells me i need a y input. However, if it's new i don't have the label. WHat am i missing?










      share|improve this question







      New contributor




      Bia Calico Cat is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.







      $endgroup$




      I've trained a few models to classify between two categories of text. Logistic regression was the best. Now how can i test it on unseen data?
      I tried this:



      def train_model():
      classifier.fit(feature_vector_train, label)
      predictions = classifier.predict(feature_vector_valid)
      joblib.dump(classifier, url+name)
      ...

      load_model =joblib.load('my_model.pkl)
      result = load_model.score('testx')


      It tells me i need a y input. However, if it's new i don't have the label. WHat am i missing?







      logistic-regression






      share|improve this question







      New contributor




      Bia Calico Cat is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.











      share|improve this question







      New contributor




      Bia Calico Cat is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.









      share|improve this question




      share|improve this question






      New contributor




      Bia Calico Cat is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.









      asked 34 mins ago









      Bia Calico CatBia Calico Cat

      1




      1




      New contributor




      Bia Calico Cat is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.





      New contributor





      Bia Calico Cat is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.






      Bia Calico Cat is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.




















          1 Answer
          1






          active

          oldest

          votes


















          0












          $begingroup$

          Welcome to the forums.



          My understanding is that you're wanting to use the previously trained model to label new data points? If so, you'll be wanting to use .predict(X). From sklearn's documentation they say.




          All supervised estimators in scikit-learn implement a fit(X, y) method to fit the model > and a predict(X) method that, given unlabeled observations X, returns the predicted
          labels y. (Source)




          Another note, is that you can't pass direct strings to a model - you'll need to preprocess your data like you did for your training set. Here is a good example of building a classifier and using it to predict new points.



          Let me know if you have any questions of I've misunderstood.






          share|improve this answer









          $endgroup$













            Your Answer








            StackExchange.ready(function()
            var channelOptions =
            tags: "".split(" "),
            id: "557"
            ;
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function()
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled)
            StackExchange.using("snippets", function()
            createEditor();
            );

            else
            createEditor();

            );

            function createEditor()
            StackExchange.prepareEditor(
            heartbeatType: 'answer',
            autoActivateHeartbeat: false,
            convertImagesToLinks: false,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: null,
            bindNavPrevention: true,
            postfix: "",
            imageUploader:
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            ,
            onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            );



            );






            Bia Calico Cat is a new contributor. Be nice, and check out our Code of Conduct.









            draft saved

            draft discarded


















            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f49503%2fuse-text-classifier-on-unseen-data%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown

























            1 Answer
            1






            active

            oldest

            votes








            1 Answer
            1






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes









            0












            $begingroup$

            Welcome to the forums.



            My understanding is that you're wanting to use the previously trained model to label new data points? If so, you'll be wanting to use .predict(X). From sklearn's documentation they say.




            All supervised estimators in scikit-learn implement a fit(X, y) method to fit the model > and a predict(X) method that, given unlabeled observations X, returns the predicted
            labels y. (Source)




            Another note, is that you can't pass direct strings to a model - you'll need to preprocess your data like you did for your training set. Here is a good example of building a classifier and using it to predict new points.



            Let me know if you have any questions of I've misunderstood.






            share|improve this answer









            $endgroup$

















              0












              $begingroup$

              Welcome to the forums.



              My understanding is that you're wanting to use the previously trained model to label new data points? If so, you'll be wanting to use .predict(X). From sklearn's documentation they say.




              All supervised estimators in scikit-learn implement a fit(X, y) method to fit the model > and a predict(X) method that, given unlabeled observations X, returns the predicted
              labels y. (Source)




              Another note, is that you can't pass direct strings to a model - you'll need to preprocess your data like you did for your training set. Here is a good example of building a classifier and using it to predict new points.



              Let me know if you have any questions of I've misunderstood.






              share|improve this answer









              $endgroup$















                0












                0








                0





                $begingroup$

                Welcome to the forums.



                My understanding is that you're wanting to use the previously trained model to label new data points? If so, you'll be wanting to use .predict(X). From sklearn's documentation they say.




                All supervised estimators in scikit-learn implement a fit(X, y) method to fit the model > and a predict(X) method that, given unlabeled observations X, returns the predicted
                labels y. (Source)




                Another note, is that you can't pass direct strings to a model - you'll need to preprocess your data like you did for your training set. Here is a good example of building a classifier and using it to predict new points.



                Let me know if you have any questions of I've misunderstood.






                share|improve this answer









                $endgroup$



                Welcome to the forums.



                My understanding is that you're wanting to use the previously trained model to label new data points? If so, you'll be wanting to use .predict(X). From sklearn's documentation they say.




                All supervised estimators in scikit-learn implement a fit(X, y) method to fit the model > and a predict(X) method that, given unlabeled observations X, returns the predicted
                labels y. (Source)




                Another note, is that you can't pass direct strings to a model - you'll need to preprocess your data like you did for your training set. Here is a good example of building a classifier and using it to predict new points.



                Let me know if you have any questions of I've misunderstood.







                share|improve this answer












                share|improve this answer



                share|improve this answer










                answered 11 mins ago









                James CJames C

                362




                362




















                    Bia Calico Cat is a new contributor. Be nice, and check out our Code of Conduct.









                    draft saved

                    draft discarded


















                    Bia Calico Cat is a new contributor. Be nice, and check out our Code of Conduct.












                    Bia Calico Cat is a new contributor. Be nice, and check out our Code of Conduct.











                    Bia Calico Cat is a new contributor. Be nice, and check out our Code of Conduct.














                    Thanks for contributing an answer to Data Science Stack Exchange!


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid


                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.

                    Use MathJax to format equations. MathJax reference.


                    To learn more, see our tips on writing great answers.




                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function ()
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f49503%2fuse-text-classifier-on-unseen-data%23new-answer', 'question_page');

                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    Popular posts from this blog

                    ValueError: Error when checking input: expected conv2d_13_input to have shape (3, 150, 150) but got array with shape (150, 150, 3)2019 Community Moderator ElectionError when checking : expected dense_1_input to have shape (None, 5) but got array with shape (200, 1)Error 'Expected 2D array, got 1D array instead:'ValueError: Error when checking input: expected lstm_41_input to have 3 dimensions, but got array with shape (40000,100)ValueError: Error when checking target: expected dense_1 to have shape (7,) but got array with shape (1,)ValueError: Error when checking target: expected dense_2 to have shape (1,) but got array with shape (0,)Keras exception: ValueError: Error when checking input: expected conv2d_1_input to have shape (150, 150, 3) but got array with shape (256, 256, 3)Steps taking too long to completewhen checking input: expected dense_1_input to have shape (13328,) but got array with shape (317,)ValueError: Error when checking target: expected dense_3 to have shape (None, 1) but got array with shape (7715, 40000)Keras exception: Error when checking input: expected dense_input to have shape (2,) but got array with shape (1,)

                    Ружовы пелікан Змест Знешні выгляд | Пашырэнне | Асаблівасці біялогіі | Літаратура | НавігацыяДагледжаная версіяправерана1 зменаДагледжаная версіяправерана1 змена/ 22697590 Сістэматыкана ВіківідахВыявына Вікісховішчы174693363011049382

                    Illegal assignment from SObject to ContactFetching String, Id from Map - Illegal Assignment Id to Field / ObjectError: Compile Error: Illegal assignment from String to BooleanError: List has no rows for assignment to SObjectError on Test Class - System.QueryException: List has no rows for assignment to SObjectRemote action problemDML requires SObject or SObject list type error“Illegal assignment from List to List”Test Class Fail: Batch Class: System.QueryException: List has no rows for assignment to SObjectMapping to a user'List has no rows for assignment to SObject' Mystery