Python Count Leading and Trailing Whitespace












0















I have the following dataframe note the leading and trailing whitespace in the stings:



import pandas as pd
data = ['foo ', ' bar', ' baz ', 'beetle juice']
df = pd.DataFrame(data)


I need to count all strings that have leading andor trailing whitespace but ignore whitespace in the middle of the sting.



So, in the example above, the whitespace count should equal 3.



What's the best way to do this?










share|improve this question



























    0















    I have the following dataframe note the leading and trailing whitespace in the stings:



    import pandas as pd
    data = ['foo ', ' bar', ' baz ', 'beetle juice']
    df = pd.DataFrame(data)


    I need to count all strings that have leading andor trailing whitespace but ignore whitespace in the middle of the sting.



    So, in the example above, the whitespace count should equal 3.



    What's the best way to do this?










    share|improve this question

























      0












      0








      0








      I have the following dataframe note the leading and trailing whitespace in the stings:



      import pandas as pd
      data = ['foo ', ' bar', ' baz ', 'beetle juice']
      df = pd.DataFrame(data)


      I need to count all strings that have leading andor trailing whitespace but ignore whitespace in the middle of the sting.



      So, in the example above, the whitespace count should equal 3.



      What's the best way to do this?










      share|improve this question














      I have the following dataframe note the leading and trailing whitespace in the stings:



      import pandas as pd
      data = ['foo ', ' bar', ' baz ', 'beetle juice']
      df = pd.DataFrame(data)


      I need to count all strings that have leading andor trailing whitespace but ignore whitespace in the middle of the sting.



      So, in the example above, the whitespace count should equal 3.



      What's the best way to do this?







      python-3.x pandas dataframe






      share|improve this question













      share|improve this question











      share|improve this question




      share|improve this question










      asked Nov 22 '18 at 20:26









      FunnyChefFunnyChef

      6402615




      6402615
























          3 Answers
          3






          active

          oldest

          votes


















          1














          This code does what you want.



          import pandas as pd

          data = ['foo ', ' bar', ' baz ', 'beetle juice']

          df = pd.DataFrame(data)
          count = 0

          for i,row in df.iterrows():
          if row[0][0] == " " or row[0][-1] == " ":
          count += 1

          print(count)





          share|improve this answer

































            1














            With .str accessor you can achieve it in one line:



            (df[0].str.startswith(" ") | df[0].str.endswith(" ")).sum()





            share|improve this answer































              0














              Here is a solution using defaultdict from collection module:



              from collections import defaultdict as df

              data = ['foo ', ' bar', ' baz ', 'beetle juice']
              result = df(int)

              for elm in data:
              if elm.startswith(' '):
              result['leading'] += 1
              elif elm.endswith(' '):
              result['trailing'] += 1

              print(result)
              print(dict(result))
              count = sum(k for k in result.values())
              print(count)


              Output:



              defaultdict(<class 'int'>, {'trailing': 1, 'leading': 2})
              {'trailing': 1, 'leading': 2}
              3





              share|improve this answer























                Your Answer






                StackExchange.ifUsing("editor", function () {
                StackExchange.using("externalEditor", function () {
                StackExchange.using("snippets", function () {
                StackExchange.snippets.init();
                });
                });
                }, "code-snippets");

                StackExchange.ready(function() {
                var channelOptions = {
                tags: "".split(" "),
                id: "1"
                };
                initTagRenderer("".split(" "), "".split(" "), channelOptions);

                StackExchange.using("externalEditor", function() {
                // Have to fire editor after snippets, if snippets enabled
                if (StackExchange.settings.snippets.snippetsEnabled) {
                StackExchange.using("snippets", function() {
                createEditor();
                });
                }
                else {
                createEditor();
                }
                });

                function createEditor() {
                StackExchange.prepareEditor({
                heartbeatType: 'answer',
                autoActivateHeartbeat: false,
                convertImagesToLinks: true,
                noModals: true,
                showLowRepImageUploadWarning: true,
                reputationToPostImages: 10,
                bindNavPrevention: true,
                postfix: "",
                imageUploader: {
                brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
                contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
                allowUrls: true
                },
                onDemand: true,
                discardSelector: ".discard-answer"
                ,immediatelyShowMarkdownHelp:true
                });


                }
                });














                draft saved

                draft discarded


















                StackExchange.ready(
                function () {
                StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53437644%2fpython-count-leading-and-trailing-whitespace%23new-answer', 'question_page');
                }
                );

                Post as a guest















                Required, but never shown

























                3 Answers
                3






                active

                oldest

                votes








                3 Answers
                3






                active

                oldest

                votes









                active

                oldest

                votes






                active

                oldest

                votes









                1














                This code does what you want.



                import pandas as pd

                data = ['foo ', ' bar', ' baz ', 'beetle juice']

                df = pd.DataFrame(data)
                count = 0

                for i,row in df.iterrows():
                if row[0][0] == " " or row[0][-1] == " ":
                count += 1

                print(count)





                share|improve this answer






























                  1














                  This code does what you want.



                  import pandas as pd

                  data = ['foo ', ' bar', ' baz ', 'beetle juice']

                  df = pd.DataFrame(data)
                  count = 0

                  for i,row in df.iterrows():
                  if row[0][0] == " " or row[0][-1] == " ":
                  count += 1

                  print(count)





                  share|improve this answer




























                    1












                    1








                    1







                    This code does what you want.



                    import pandas as pd

                    data = ['foo ', ' bar', ' baz ', 'beetle juice']

                    df = pd.DataFrame(data)
                    count = 0

                    for i,row in df.iterrows():
                    if row[0][0] == " " or row[0][-1] == " ":
                    count += 1

                    print(count)





                    share|improve this answer















                    This code does what you want.



                    import pandas as pd

                    data = ['foo ', ' bar', ' baz ', 'beetle juice']

                    df = pd.DataFrame(data)
                    count = 0

                    for i,row in df.iterrows():
                    if row[0][0] == " " or row[0][-1] == " ":
                    count += 1

                    print(count)






                    share|improve this answer














                    share|improve this answer



                    share|improve this answer








                    edited Nov 22 '18 at 20:42

























                    answered Nov 22 '18 at 20:37









                    Esteban QuirosEsteban Quiros

                    1015




                    1015

























                        1














                        With .str accessor you can achieve it in one line:



                        (df[0].str.startswith(" ") | df[0].str.endswith(" ")).sum()





                        share|improve this answer




























                          1














                          With .str accessor you can achieve it in one line:



                          (df[0].str.startswith(" ") | df[0].str.endswith(" ")).sum()





                          share|improve this answer


























                            1












                            1








                            1







                            With .str accessor you can achieve it in one line:



                            (df[0].str.startswith(" ") | df[0].str.endswith(" ")).sum()





                            share|improve this answer













                            With .str accessor you can achieve it in one line:



                            (df[0].str.startswith(" ") | df[0].str.endswith(" ")).sum()






                            share|improve this answer












                            share|improve this answer



                            share|improve this answer










                            answered Nov 22 '18 at 21:03









                            Julian PellerJulian Peller

                            8941511




                            8941511























                                0














                                Here is a solution using defaultdict from collection module:



                                from collections import defaultdict as df

                                data = ['foo ', ' bar', ' baz ', 'beetle juice']
                                result = df(int)

                                for elm in data:
                                if elm.startswith(' '):
                                result['leading'] += 1
                                elif elm.endswith(' '):
                                result['trailing'] += 1

                                print(result)
                                print(dict(result))
                                count = sum(k for k in result.values())
                                print(count)


                                Output:



                                defaultdict(<class 'int'>, {'trailing': 1, 'leading': 2})
                                {'trailing': 1, 'leading': 2}
                                3





                                share|improve this answer




























                                  0














                                  Here is a solution using defaultdict from collection module:



                                  from collections import defaultdict as df

                                  data = ['foo ', ' bar', ' baz ', 'beetle juice']
                                  result = df(int)

                                  for elm in data:
                                  if elm.startswith(' '):
                                  result['leading'] += 1
                                  elif elm.endswith(' '):
                                  result['trailing'] += 1

                                  print(result)
                                  print(dict(result))
                                  count = sum(k for k in result.values())
                                  print(count)


                                  Output:



                                  defaultdict(<class 'int'>, {'trailing': 1, 'leading': 2})
                                  {'trailing': 1, 'leading': 2}
                                  3





                                  share|improve this answer


























                                    0












                                    0








                                    0







                                    Here is a solution using defaultdict from collection module:



                                    from collections import defaultdict as df

                                    data = ['foo ', ' bar', ' baz ', 'beetle juice']
                                    result = df(int)

                                    for elm in data:
                                    if elm.startswith(' '):
                                    result['leading'] += 1
                                    elif elm.endswith(' '):
                                    result['trailing'] += 1

                                    print(result)
                                    print(dict(result))
                                    count = sum(k for k in result.values())
                                    print(count)


                                    Output:



                                    defaultdict(<class 'int'>, {'trailing': 1, 'leading': 2})
                                    {'trailing': 1, 'leading': 2}
                                    3





                                    share|improve this answer













                                    Here is a solution using defaultdict from collection module:



                                    from collections import defaultdict as df

                                    data = ['foo ', ' bar', ' baz ', 'beetle juice']
                                    result = df(int)

                                    for elm in data:
                                    if elm.startswith(' '):
                                    result['leading'] += 1
                                    elif elm.endswith(' '):
                                    result['trailing'] += 1

                                    print(result)
                                    print(dict(result))
                                    count = sum(k for k in result.values())
                                    print(count)


                                    Output:



                                    defaultdict(<class 'int'>, {'trailing': 1, 'leading': 2})
                                    {'trailing': 1, 'leading': 2}
                                    3






                                    share|improve this answer












                                    share|improve this answer



                                    share|improve this answer










                                    answered Nov 22 '18 at 20:45









                                    Chiheb NexusChiheb Nexus

                                    5,01031627




                                    5,01031627






























                                        draft saved

                                        draft discarded




















































                                        Thanks for contributing an answer to Stack Overflow!


                                        • Please be sure to answer the question. Provide details and share your research!

                                        But avoid



                                        • Asking for help, clarification, or responding to other answers.

                                        • Making statements based on opinion; back them up with references or personal experience.


                                        To learn more, see our tips on writing great answers.




                                        draft saved


                                        draft discarded














                                        StackExchange.ready(
                                        function () {
                                        StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53437644%2fpython-count-leading-and-trailing-whitespace%23new-answer', 'question_page');
                                        }
                                        );

                                        Post as a guest















                                        Required, but never shown





















































                                        Required, but never shown














                                        Required, but never shown












                                        Required, but never shown







                                        Required, but never shown

































                                        Required, but never shown














                                        Required, but never shown












                                        Required, but never shown







                                        Required, but never shown







                                        Popular posts from this blog

                                        404 Error Contact Form 7 ajax form submitting

                                        How to know if a Active Directory user can login interactively

                                        TypeError: fit_transform() missing 1 required positional argument: 'X'