Looping over pandas DataFrame












0















I have a weird issue that the result doesn't change for each iteration. The code is the following:



import pandas as pd
import numpy as np

X = np.arange(10,100)
Y = X[::-1]
Z = np.array([X,Y]).T

df = pd.DataFrame(Z ,columns = ['col1','col2'])
dif = df['col1'] - df['col2']

for gap in range(100):
Up = dif > gap
Down = dif < -gap

df.loc[Up,'predict'] = 'Up'
df.loc[Down,'predict'] = 'Down'

df_result = df.dropna()
Total = df.shape[0]
count = df_result.shape[0]
ratio = count/Total
print(f'Total: {Total}; count: {count}; ratio: {ratio}')


The result is always



Total: 90; count: 90; ratio: 1.0


when it shouldn't be. Thank you in advance










share|improve this question



























    0















    I have a weird issue that the result doesn't change for each iteration. The code is the following:



    import pandas as pd
    import numpy as np

    X = np.arange(10,100)
    Y = X[::-1]
    Z = np.array([X,Y]).T

    df = pd.DataFrame(Z ,columns = ['col1','col2'])
    dif = df['col1'] - df['col2']

    for gap in range(100):
    Up = dif > gap
    Down = dif < -gap

    df.loc[Up,'predict'] = 'Up'
    df.loc[Down,'predict'] = 'Down'

    df_result = df.dropna()
    Total = df.shape[0]
    count = df_result.shape[0]
    ratio = count/Total
    print(f'Total: {Total}; count: {count}; ratio: {ratio}')


    The result is always



    Total: 90; count: 90; ratio: 1.0


    when it shouldn't be. Thank you in advance










    share|improve this question

























      0












      0








      0








      I have a weird issue that the result doesn't change for each iteration. The code is the following:



      import pandas as pd
      import numpy as np

      X = np.arange(10,100)
      Y = X[::-1]
      Z = np.array([X,Y]).T

      df = pd.DataFrame(Z ,columns = ['col1','col2'])
      dif = df['col1'] - df['col2']

      for gap in range(100):
      Up = dif > gap
      Down = dif < -gap

      df.loc[Up,'predict'] = 'Up'
      df.loc[Down,'predict'] = 'Down'

      df_result = df.dropna()
      Total = df.shape[0]
      count = df_result.shape[0]
      ratio = count/Total
      print(f'Total: {Total}; count: {count}; ratio: {ratio}')


      The result is always



      Total: 90; count: 90; ratio: 1.0


      when it shouldn't be. Thank you in advance










      share|improve this question














      I have a weird issue that the result doesn't change for each iteration. The code is the following:



      import pandas as pd
      import numpy as np

      X = np.arange(10,100)
      Y = X[::-1]
      Z = np.array([X,Y]).T

      df = pd.DataFrame(Z ,columns = ['col1','col2'])
      dif = df['col1'] - df['col2']

      for gap in range(100):
      Up = dif > gap
      Down = dif < -gap

      df.loc[Up,'predict'] = 'Up'
      df.loc[Down,'predict'] = 'Down'

      df_result = df.dropna()
      Total = df.shape[0]
      count = df_result.shape[0]
      ratio = count/Total
      print(f'Total: {Total}; count: {count}; ratio: {ratio}')


      The result is always



      Total: 90; count: 90; ratio: 1.0


      when it shouldn't be. Thank you in advance







      python-3.x pandas loops numpy na






      share|improve this question













      share|improve this question











      share|improve this question




      share|improve this question










      asked Nov 25 '18 at 9:11









      mathguymathguy

      1147




      1147
























          1 Answer
          1






          active

          oldest

          votes


















          1














          Found the root of the problem 5 mins after posting this question. I just needed to reset the dataFrame to the original to fix the problem.



          import pandas as pd
          import numpy as np

          X = np.arange(10,100)
          Y = X[::-1]
          Z = np.array([X,Y]).T

          df = pd.DataFrame(Z ,columns = ['col1','col2'])
          df2 = df.copy()#added this line to preserve the original df
          dif = df['col1'] - df['col2']

          for gap in range(100):
          df = df2.copy()#reset the altered df back to the original
          Up = dif > gap
          Down = dif < -gap

          df.loc[Up,'predict'] = 'Up'
          df.loc[Down,'predict'] = 'Down'

          df_result = df.dropna()
          Total = df.shape[0]
          count = df_result.shape[0]
          ratio = count/Total
          print(f'Total: {Total}; count: {count}; ratio: {ratio}')





          share|improve this answer























            Your Answer






            StackExchange.ifUsing("editor", function () {
            StackExchange.using("externalEditor", function () {
            StackExchange.using("snippets", function () {
            StackExchange.snippets.init();
            });
            });
            }, "code-snippets");

            StackExchange.ready(function() {
            var channelOptions = {
            tags: "".split(" "),
            id: "1"
            };
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function() {
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled) {
            StackExchange.using("snippets", function() {
            createEditor();
            });
            }
            else {
            createEditor();
            }
            });

            function createEditor() {
            StackExchange.prepareEditor({
            heartbeatType: 'answer',
            autoActivateHeartbeat: false,
            convertImagesToLinks: true,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: 10,
            bindNavPrevention: true,
            postfix: "",
            imageUploader: {
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            },
            onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            });


            }
            });














            draft saved

            draft discarded


















            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53466086%2flooping-over-pandas-dataframe%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown

























            1 Answer
            1






            active

            oldest

            votes








            1 Answer
            1






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes









            1














            Found the root of the problem 5 mins after posting this question. I just needed to reset the dataFrame to the original to fix the problem.



            import pandas as pd
            import numpy as np

            X = np.arange(10,100)
            Y = X[::-1]
            Z = np.array([X,Y]).T

            df = pd.DataFrame(Z ,columns = ['col1','col2'])
            df2 = df.copy()#added this line to preserve the original df
            dif = df['col1'] - df['col2']

            for gap in range(100):
            df = df2.copy()#reset the altered df back to the original
            Up = dif > gap
            Down = dif < -gap

            df.loc[Up,'predict'] = 'Up'
            df.loc[Down,'predict'] = 'Down'

            df_result = df.dropna()
            Total = df.shape[0]
            count = df_result.shape[0]
            ratio = count/Total
            print(f'Total: {Total}; count: {count}; ratio: {ratio}')





            share|improve this answer




























              1














              Found the root of the problem 5 mins after posting this question. I just needed to reset the dataFrame to the original to fix the problem.



              import pandas as pd
              import numpy as np

              X = np.arange(10,100)
              Y = X[::-1]
              Z = np.array([X,Y]).T

              df = pd.DataFrame(Z ,columns = ['col1','col2'])
              df2 = df.copy()#added this line to preserve the original df
              dif = df['col1'] - df['col2']

              for gap in range(100):
              df = df2.copy()#reset the altered df back to the original
              Up = dif > gap
              Down = dif < -gap

              df.loc[Up,'predict'] = 'Up'
              df.loc[Down,'predict'] = 'Down'

              df_result = df.dropna()
              Total = df.shape[0]
              count = df_result.shape[0]
              ratio = count/Total
              print(f'Total: {Total}; count: {count}; ratio: {ratio}')





              share|improve this answer


























                1












                1








                1







                Found the root of the problem 5 mins after posting this question. I just needed to reset the dataFrame to the original to fix the problem.



                import pandas as pd
                import numpy as np

                X = np.arange(10,100)
                Y = X[::-1]
                Z = np.array([X,Y]).T

                df = pd.DataFrame(Z ,columns = ['col1','col2'])
                df2 = df.copy()#added this line to preserve the original df
                dif = df['col1'] - df['col2']

                for gap in range(100):
                df = df2.copy()#reset the altered df back to the original
                Up = dif > gap
                Down = dif < -gap

                df.loc[Up,'predict'] = 'Up'
                df.loc[Down,'predict'] = 'Down'

                df_result = df.dropna()
                Total = df.shape[0]
                count = df_result.shape[0]
                ratio = count/Total
                print(f'Total: {Total}; count: {count}; ratio: {ratio}')





                share|improve this answer













                Found the root of the problem 5 mins after posting this question. I just needed to reset the dataFrame to the original to fix the problem.



                import pandas as pd
                import numpy as np

                X = np.arange(10,100)
                Y = X[::-1]
                Z = np.array([X,Y]).T

                df = pd.DataFrame(Z ,columns = ['col1','col2'])
                df2 = df.copy()#added this line to preserve the original df
                dif = df['col1'] - df['col2']

                for gap in range(100):
                df = df2.copy()#reset the altered df back to the original
                Up = dif > gap
                Down = dif < -gap

                df.loc[Up,'predict'] = 'Up'
                df.loc[Down,'predict'] = 'Down'

                df_result = df.dropna()
                Total = df.shape[0]
                count = df_result.shape[0]
                ratio = count/Total
                print(f'Total: {Total}; count: {count}; ratio: {ratio}')






                share|improve this answer












                share|improve this answer



                share|improve this answer










                answered Nov 25 '18 at 9:20









                mathguymathguy

                1147




                1147
































                    draft saved

                    draft discarded




















































                    Thanks for contributing an answer to Stack Overflow!


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid



                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.


                    To learn more, see our tips on writing great answers.




                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function () {
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53466086%2flooping-over-pandas-dataframe%23new-answer', 'question_page');
                    }
                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    Popular posts from this blog

                    404 Error Contact Form 7 ajax form submitting

                    How to know if a Active Directory user can login interactively

                    Refactoring coordinates for Minecraft Pi buildings written in Python