How to resume training of a model?2019 Community Moderator ElectionDeep Learning: Feed Forward for Unbalanced Classes Using Tensor FlowHow to know the model has started overfitting?Neural Network: how to interpret this loss graph?Why is training take so long on my GPU?Problem with keras model loadingTraining Error decreasing with each epochTensorflow CNN sometimes converges, sometimes notWhy does my LSTM perform better when randomizing training subset vs. standard batch training?How to speed up passing of images to a GPUStop CNN model at high accuracy and low loss rate?

Why is the design of haulage companies so “special”?

Infinite past with a beginning?

How can the DM most effectively choose 1 out of an odd number of players to be targeted by an attack or effect?

How to type dʒ symbol (IPA) on Mac?

Is it possible to make sharp wind that can cut stuff from afar?

Pronouncing Dictionary.com's W.O.D "vade mecum" in English

How do we improve the relationship with a client software team that performs poorly and is becoming less collaborative?

How to make payment on the internet without leaving a money trail?

A newer friend of my brother's gave him a load of baseball cards that are supposedly extremely valuable. Is this a scam?

What would the Romans have called "sorcery"?

How can I hide my bitcoin transactions to protect anonymity from others?

I see my dog run

Copycat chess is back

How to add power-LED to my small amplifier?

Can a German sentence have two subjects?

Why is this code 6.5x slower with optimizations enabled?

Why don't electron-positron collisions release infinite energy?

What would happen to a modern skyscraper if it rains micro blackholes?

How is it possible to have an ability score that is less than 3?

Motorized valve interfering with button?

What is the command to reset a PC without deleting any files

Copenhagen passport control - US citizen

What Brexit solution does the DUP want?

What defenses are there against being summoned by the Gate spell?



How to resume training of a model?



2019 Community Moderator ElectionDeep Learning: Feed Forward for Unbalanced Classes Using Tensor FlowHow to know the model has started overfitting?Neural Network: how to interpret this loss graph?Why is training take so long on my GPU?Problem with keras model loadingTraining Error decreasing with each epochTensorflow CNN sometimes converges, sometimes notWhy does my LSTM perform better when randomizing training subset vs. standard batch training?How to speed up passing of images to a GPUStop CNN model at high accuracy and low loss rate?










1












$begingroup$


I have not GPU support so it often happens that my model takes hours to train. Can I train my model in batches , for example if I want to have 100 epochs for my model,but due to power cut my training stops(at 50th epoch) but when I retrain my model I want to train it from where it was left (from 50th epoch).



It would be much appreciated if anyone can explain it by some example.










share|improve this question











$endgroup$











  • $begingroup$
    This is possible with most (all?) mainstream deep learning frameworks by simply storing the model every N training iterations and checking for the last stored model before starting the training. Which framework are you using?
    $endgroup$
    – ncasas
    Oct 16 '17 at 17:56










  • $begingroup$
    I am using tensorflow
    $endgroup$
    – Berry
    Oct 16 '17 at 18:27















1












$begingroup$


I have not GPU support so it often happens that my model takes hours to train. Can I train my model in batches , for example if I want to have 100 epochs for my model,but due to power cut my training stops(at 50th epoch) but when I retrain my model I want to train it from where it was left (from 50th epoch).



It would be much appreciated if anyone can explain it by some example.










share|improve this question











$endgroup$











  • $begingroup$
    This is possible with most (all?) mainstream deep learning frameworks by simply storing the model every N training iterations and checking for the last stored model before starting the training. Which framework are you using?
    $endgroup$
    – ncasas
    Oct 16 '17 at 17:56










  • $begingroup$
    I am using tensorflow
    $endgroup$
    – Berry
    Oct 16 '17 at 18:27













1












1








1





$begingroup$


I have not GPU support so it often happens that my model takes hours to train. Can I train my model in batches , for example if I want to have 100 epochs for my model,but due to power cut my training stops(at 50th epoch) but when I retrain my model I want to train it from where it was left (from 50th epoch).



It would be much appreciated if anyone can explain it by some example.










share|improve this question











$endgroup$




I have not GPU support so it often happens that my model takes hours to train. Can I train my model in batches , for example if I want to have 100 epochs for my model,but due to power cut my training stops(at 50th epoch) but when I retrain my model I want to train it from where it was left (from 50th epoch).



It would be much appreciated if anyone can explain it by some example.







machine-learning python neural-network deep-learning tensorflow






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Oct 16 '17 at 20:58









ncasas

3,7481131




3,7481131










asked Oct 16 '17 at 17:23









BerryBerry

374




374











  • $begingroup$
    This is possible with most (all?) mainstream deep learning frameworks by simply storing the model every N training iterations and checking for the last stored model before starting the training. Which framework are you using?
    $endgroup$
    – ncasas
    Oct 16 '17 at 17:56










  • $begingroup$
    I am using tensorflow
    $endgroup$
    – Berry
    Oct 16 '17 at 18:27
















  • $begingroup$
    This is possible with most (all?) mainstream deep learning frameworks by simply storing the model every N training iterations and checking for the last stored model before starting the training. Which framework are you using?
    $endgroup$
    – ncasas
    Oct 16 '17 at 17:56










  • $begingroup$
    I am using tensorflow
    $endgroup$
    – Berry
    Oct 16 '17 at 18:27















$begingroup$
This is possible with most (all?) mainstream deep learning frameworks by simply storing the model every N training iterations and checking for the last stored model before starting the training. Which framework are you using?
$endgroup$
– ncasas
Oct 16 '17 at 17:56




$begingroup$
This is possible with most (all?) mainstream deep learning frameworks by simply storing the model every N training iterations and checking for the last stored model before starting the training. Which framework are you using?
$endgroup$
– ncasas
Oct 16 '17 at 17:56












$begingroup$
I am using tensorflow
$endgroup$
– Berry
Oct 16 '17 at 18:27




$begingroup$
I am using tensorflow
$endgroup$
– Berry
Oct 16 '17 at 18:27










2 Answers
2






active

oldest

votes


















3












$begingroup$

With tensorflow, currently the most straightforward and easy way to get persistence for your model is to use a tf.train.MonitoredTrainingSession. You just need to use it instead the normal tf.Session() that is frequently used. This an illustrative Python snippet:





with tf.train.MonitoredTrainingSession(checkpoint_dir='/tmp/mymodel',
save_summaries_secs=600) as sess:
_ = sess.run(train_op, feed_dict=...)


With this, your model is automagically saved every 600 secs in /tmp/mymodel and restored the next time you restart the program.






share|improve this answer











$endgroup$




















    0












    $begingroup$

    @ncasas could you please give how to do the same using keras ?






    share|improve this answer








    New contributor




    user2351509 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
    Check out our Code of Conduct.






    $endgroup$








    • 1




      $begingroup$
      This does not provide an answer to the question. Once you have sufficient reputation you will be able to comment on any post; instead, provide answers that don't require clarification from the asker. - From Review
      $endgroup$
      – oW_
      3 hours ago











    Your Answer





    StackExchange.ifUsing("editor", function ()
    return StackExchange.using("mathjaxEditing", function ()
    StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
    StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
    );
    );
    , "mathjax-editing");

    StackExchange.ready(function()
    var channelOptions =
    tags: "".split(" "),
    id: "557"
    ;
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function()
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled)
    StackExchange.using("snippets", function()
    createEditor();
    );

    else
    createEditor();

    );

    function createEditor()
    StackExchange.prepareEditor(
    heartbeatType: 'answer',
    autoActivateHeartbeat: false,
    convertImagesToLinks: false,
    noModals: true,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: null,
    bindNavPrevention: true,
    postfix: "",
    imageUploader:
    brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
    contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
    allowUrls: true
    ,
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    );



    );













    draft saved

    draft discarded


















    StackExchange.ready(
    function ()
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f23848%2fhow-to-resume-training-of-a-model%23new-answer', 'question_page');

    );

    Post as a guest















    Required, but never shown

























    2 Answers
    2






    active

    oldest

    votes








    2 Answers
    2






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes









    3












    $begingroup$

    With tensorflow, currently the most straightforward and easy way to get persistence for your model is to use a tf.train.MonitoredTrainingSession. You just need to use it instead the normal tf.Session() that is frequently used. This an illustrative Python snippet:





    with tf.train.MonitoredTrainingSession(checkpoint_dir='/tmp/mymodel',
    save_summaries_secs=600) as sess:
    _ = sess.run(train_op, feed_dict=...)


    With this, your model is automagically saved every 600 secs in /tmp/mymodel and restored the next time you restart the program.






    share|improve this answer











    $endgroup$

















      3












      $begingroup$

      With tensorflow, currently the most straightforward and easy way to get persistence for your model is to use a tf.train.MonitoredTrainingSession. You just need to use it instead the normal tf.Session() that is frequently used. This an illustrative Python snippet:





      with tf.train.MonitoredTrainingSession(checkpoint_dir='/tmp/mymodel',
      save_summaries_secs=600) as sess:
      _ = sess.run(train_op, feed_dict=...)


      With this, your model is automagically saved every 600 secs in /tmp/mymodel and restored the next time you restart the program.






      share|improve this answer











      $endgroup$















        3












        3








        3





        $begingroup$

        With tensorflow, currently the most straightforward and easy way to get persistence for your model is to use a tf.train.MonitoredTrainingSession. You just need to use it instead the normal tf.Session() that is frequently used. This an illustrative Python snippet:





        with tf.train.MonitoredTrainingSession(checkpoint_dir='/tmp/mymodel',
        save_summaries_secs=600) as sess:
        _ = sess.run(train_op, feed_dict=...)


        With this, your model is automagically saved every 600 secs in /tmp/mymodel and restored the next time you restart the program.






        share|improve this answer











        $endgroup$



        With tensorflow, currently the most straightforward and easy way to get persistence for your model is to use a tf.train.MonitoredTrainingSession. You just need to use it instead the normal tf.Session() that is frequently used. This an illustrative Python snippet:





        with tf.train.MonitoredTrainingSession(checkpoint_dir='/tmp/mymodel',
        save_summaries_secs=600) as sess:
        _ = sess.run(train_op, feed_dict=...)


        With this, your model is automagically saved every 600 secs in /tmp/mymodel and restored the next time you restart the program.







        share|improve this answer














        share|improve this answer



        share|improve this answer








        edited Oct 16 '17 at 21:04

























        answered Oct 16 '17 at 20:58









        ncasasncasas

        3,7481131




        3,7481131





















            0












            $begingroup$

            @ncasas could you please give how to do the same using keras ?






            share|improve this answer








            New contributor




            user2351509 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
            Check out our Code of Conduct.






            $endgroup$








            • 1




              $begingroup$
              This does not provide an answer to the question. Once you have sufficient reputation you will be able to comment on any post; instead, provide answers that don't require clarification from the asker. - From Review
              $endgroup$
              – oW_
              3 hours ago















            0












            $begingroup$

            @ncasas could you please give how to do the same using keras ?






            share|improve this answer








            New contributor




            user2351509 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
            Check out our Code of Conduct.






            $endgroup$








            • 1




              $begingroup$
              This does not provide an answer to the question. Once you have sufficient reputation you will be able to comment on any post; instead, provide answers that don't require clarification from the asker. - From Review
              $endgroup$
              – oW_
              3 hours ago













            0












            0








            0





            $begingroup$

            @ncasas could you please give how to do the same using keras ?






            share|improve this answer








            New contributor




            user2351509 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
            Check out our Code of Conduct.






            $endgroup$



            @ncasas could you please give how to do the same using keras ?







            share|improve this answer








            New contributor




            user2351509 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
            Check out our Code of Conduct.









            share|improve this answer



            share|improve this answer






            New contributor




            user2351509 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
            Check out our Code of Conduct.









            answered 7 hours ago









            user2351509user2351509

            1




            1




            New contributor




            user2351509 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
            Check out our Code of Conduct.





            New contributor





            user2351509 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
            Check out our Code of Conduct.






            user2351509 is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
            Check out our Code of Conduct.







            • 1




              $begingroup$
              This does not provide an answer to the question. Once you have sufficient reputation you will be able to comment on any post; instead, provide answers that don't require clarification from the asker. - From Review
              $endgroup$
              – oW_
              3 hours ago












            • 1




              $begingroup$
              This does not provide an answer to the question. Once you have sufficient reputation you will be able to comment on any post; instead, provide answers that don't require clarification from the asker. - From Review
              $endgroup$
              – oW_
              3 hours ago







            1




            1




            $begingroup$
            This does not provide an answer to the question. Once you have sufficient reputation you will be able to comment on any post; instead, provide answers that don't require clarification from the asker. - From Review
            $endgroup$
            – oW_
            3 hours ago




            $begingroup$
            This does not provide an answer to the question. Once you have sufficient reputation you will be able to comment on any post; instead, provide answers that don't require clarification from the asker. - From Review
            $endgroup$
            – oW_
            3 hours ago

















            draft saved

            draft discarded
















































            Thanks for contributing an answer to Data Science Stack Exchange!


            • Please be sure to answer the question. Provide details and share your research!

            But avoid


            • Asking for help, clarification, or responding to other answers.

            • Making statements based on opinion; back them up with references or personal experience.

            Use MathJax to format equations. MathJax reference.


            To learn more, see our tips on writing great answers.




            draft saved


            draft discarded














            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f23848%2fhow-to-resume-training-of-a-model%23new-answer', 'question_page');

            );

            Post as a guest















            Required, but never shown





















































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown

































            Required, but never shown














            Required, but never shown












            Required, but never shown







            Required, but never shown







            Popular posts from this blog

            Францішак Багушэвіч Змест Сям'я | Біяграфія | Творчасць | Мова Багушэвіча | Ацэнкі дзейнасці | Цікавыя факты | Спадчына | Выбраная бібліяграфія | Ушанаванне памяці | У філатэліі | Зноскі | Літаратура | Спасылкі | НавігацыяЛяхоўскі У. Рупіўся дзеля Бога і людзей: Жыццёвы шлях Лявона Вітан-Дубейкаўскага // Вольскі і Памідораў з песняй пра немца Адвакат, паэт, народны заступнік Ашмянскі веснікВ Минске появится площадь Богушевича и улица Сырокомли, Белорусская деловая газета, 19 июля 2001 г.Айцец беларускай нацыянальнай ідэі паўстаў у бронзе Сяргей Аляксандравіч Адашкевіч (1918, Мінск). 80-я гады. Бюст «Францішак Багушэвіч».Яўген Мікалаевіч Ціхановіч. «Партрэт Францішка Багушэвіча»Мікола Мікалаевіч Купава. «Партрэт зачынальніка новай беларускай літаратуры Францішка Багушэвіча»Уладзімір Іванавіч Мелехаў. На помніку «Змагарам за родную мову» Барэльеф «Францішак Багушэвіч»Памяць пра Багушэвіча на Віленшчыне Страчаная сталіца. Беларускія шыльды на вуліцах Вільні«Krynica». Ideologia i przywódcy białoruskiego katolicyzmuФранцішак БагушэвічТворы на knihi.comТворы Францішка Багушэвіча на bellib.byСодаль Уладзімір. Францішак Багушэвіч на Лідчыне;Луцкевіч Антон. Жыцьцё і творчасьць Фр. Багушэвіча ў успамінах ягоных сучасьнікаў // Запісы Беларускага Навуковага таварыства. Вільня, 1938. Сшытак 1. С. 16-34.Большая российская1188761710000 0000 5537 633Xn9209310021619551927869394п

            Partai Komunis Tiongkok Daftar isi Kepemimpinan | Pranala luar | Referensi | Menu navigasidiperiksa1 perubahan tertundacpc.people.com.cnSitus resmiSurat kabar resmi"Why the Communist Party is alive, well and flourishing in China"0307-1235"Full text of Constitution of Communist Party of China"smengembangkannyas

            ValueError: Expected n_neighbors <= n_samples, but n_samples = 1, n_neighbors = 6 (SMOTE) The 2019 Stack Overflow Developer Survey Results Are InCan SMOTE be applied over sequence of words (sentences)?ValueError when doing validation with random forestsSMOTE and multi class oversamplingLogic behind SMOTE-NC?ValueError: Error when checking target: expected dense_1 to have shape (7,) but got array with shape (1,)SmoteBoost: Should SMOTE be ran individually for each iteration/tree in the boosting?solving multi-class imbalance classification using smote and OSSUsing SMOTE for Synthetic Data generation to improve performance on unbalanced dataproblem of entry format for a simple model in KerasSVM SMOTE fit_resample() function runs forever with no result