Probability suite for an array of elements












0














I wrote a function that can be used to generate a probability distribution for an array of elements (probGen()), and another that can be used to select an element from an array with a specified probability (probSelect()). I want to optimise my code.



probGen.php



<?
require_once("randX.php"); #"randX()" generates a random floating point number in a specified range.
error_reporting(E_ERROR | E_WARNING | E_PARSE);

function probGen(array $arr, float $control = 0.01)
/*
* Generates a valid, random probability distribution for a given array of elements, that can be used in conjunction with "probSelect()".
* Input:
$arr: An array of elements.
$control: A value that decides how much mass is allowed to be unilaterally dumped onto one element. A high value would permit distributions where most of the mass is concentrated on one element.
If an invalid value is provided, the default is used.
* Output: An associative array where the keys are the elements in the original array, and the values are their probabilities.
*/
{
$control = ($control <= 1 && $control >= 0)?($control):(0.00001); #Use the default value if an invalid number is supplied.
static $result = ; #Initialises $result with an empty array on first function call.
static $max = 1; #Initialises $max with 1 on first function call.
foreach ($arr as $value)
{
$x = randX(0, $max); #Random probability value.
$result[$value] = ($result[$value] + $x)??0; #Initialise the array with 0 on first call, and on subsequent calls increment by $x to assign probability mass.
$max -= $x; #Ensures that the probability never sums to more than one.
}
/*
* After the execution of the above code, there would be some leftover probability mass.
* The code below adds it to a random element.
*/
$var = array_values($arr);
if($max <= $control) #To limit concentration of most of the probability mass in one variable.
{
$result[$var[rand(0,(count($var)-1))]] += $max; #Selects a random key and adds $max to it.
return $result;
}
else
return probGen($arr, $control);
}
?>


probSelect.php



<?
require_once("confirm.php");

function probSelect(array $arr)
/*
* A function to select an element from an array with indicated probabilites.
* Input: An associative array whose keys are the elements to be selected from, and whose values are the associated probabilities.
* Output: The selected element, or "NULL" if an invalid probability distribution was supplied.
*/
{
if(confirm($arr))
{
$var = lcg_value(); #The random float that would be used to select the element.
$sum = 0;
foreach ($arr as $key => $value)
{
$sum += $value;
if($var <= $sum)
return $key;
}
}
else
{
print("ERROR!!! The supplied probability distribution must sum to 1. <br>");
return null;
}
}
?>




Dependencies



The required functions in case they are necessary for evaluating performance:



randX.php



<?
function randX(float $a, float $b): float
/*
* Generates a random number between between two real numbers (both inclusive).
* Input: Two floating point numbers.
* Output: A random floating point number.
*/
{
$max = max($a, $b);
$min = min($a, $b);
$x = $max - $min;
$y = lcg_value()*$x;
return ($min+$y);
}
?>


confirm.php



<?
function confirm(array $arr) #Confirms that the supplied array has a valid probability distribution
{
return (array_sum($arr) == 1)?true:false;
}
?>









share|improve this question





























    0














    I wrote a function that can be used to generate a probability distribution for an array of elements (probGen()), and another that can be used to select an element from an array with a specified probability (probSelect()). I want to optimise my code.



    probGen.php



    <?
    require_once("randX.php"); #"randX()" generates a random floating point number in a specified range.
    error_reporting(E_ERROR | E_WARNING | E_PARSE);

    function probGen(array $arr, float $control = 0.01)
    /*
    * Generates a valid, random probability distribution for a given array of elements, that can be used in conjunction with "probSelect()".
    * Input:
    $arr: An array of elements.
    $control: A value that decides how much mass is allowed to be unilaterally dumped onto one element. A high value would permit distributions where most of the mass is concentrated on one element.
    If an invalid value is provided, the default is used.
    * Output: An associative array where the keys are the elements in the original array, and the values are their probabilities.
    */
    {
    $control = ($control <= 1 && $control >= 0)?($control):(0.00001); #Use the default value if an invalid number is supplied.
    static $result = ; #Initialises $result with an empty array on first function call.
    static $max = 1; #Initialises $max with 1 on first function call.
    foreach ($arr as $value)
    {
    $x = randX(0, $max); #Random probability value.
    $result[$value] = ($result[$value] + $x)??0; #Initialise the array with 0 on first call, and on subsequent calls increment by $x to assign probability mass.
    $max -= $x; #Ensures that the probability never sums to more than one.
    }
    /*
    * After the execution of the above code, there would be some leftover probability mass.
    * The code below adds it to a random element.
    */
    $var = array_values($arr);
    if($max <= $control) #To limit concentration of most of the probability mass in one variable.
    {
    $result[$var[rand(0,(count($var)-1))]] += $max; #Selects a random key and adds $max to it.
    return $result;
    }
    else
    return probGen($arr, $control);
    }
    ?>


    probSelect.php



    <?
    require_once("confirm.php");

    function probSelect(array $arr)
    /*
    * A function to select an element from an array with indicated probabilites.
    * Input: An associative array whose keys are the elements to be selected from, and whose values are the associated probabilities.
    * Output: The selected element, or "NULL" if an invalid probability distribution was supplied.
    */
    {
    if(confirm($arr))
    {
    $var = lcg_value(); #The random float that would be used to select the element.
    $sum = 0;
    foreach ($arr as $key => $value)
    {
    $sum += $value;
    if($var <= $sum)
    return $key;
    }
    }
    else
    {
    print("ERROR!!! The supplied probability distribution must sum to 1. <br>");
    return null;
    }
    }
    ?>




    Dependencies



    The required functions in case they are necessary for evaluating performance:



    randX.php



    <?
    function randX(float $a, float $b): float
    /*
    * Generates a random number between between two real numbers (both inclusive).
    * Input: Two floating point numbers.
    * Output: A random floating point number.
    */
    {
    $max = max($a, $b);
    $min = min($a, $b);
    $x = $max - $min;
    $y = lcg_value()*$x;
    return ($min+$y);
    }
    ?>


    confirm.php



    <?
    function confirm(array $arr) #Confirms that the supplied array has a valid probability distribution
    {
    return (array_sum($arr) == 1)?true:false;
    }
    ?>









    share|improve this question



























      0












      0








      0







      I wrote a function that can be used to generate a probability distribution for an array of elements (probGen()), and another that can be used to select an element from an array with a specified probability (probSelect()). I want to optimise my code.



      probGen.php



      <?
      require_once("randX.php"); #"randX()" generates a random floating point number in a specified range.
      error_reporting(E_ERROR | E_WARNING | E_PARSE);

      function probGen(array $arr, float $control = 0.01)
      /*
      * Generates a valid, random probability distribution for a given array of elements, that can be used in conjunction with "probSelect()".
      * Input:
      $arr: An array of elements.
      $control: A value that decides how much mass is allowed to be unilaterally dumped onto one element. A high value would permit distributions where most of the mass is concentrated on one element.
      If an invalid value is provided, the default is used.
      * Output: An associative array where the keys are the elements in the original array, and the values are their probabilities.
      */
      {
      $control = ($control <= 1 && $control >= 0)?($control):(0.00001); #Use the default value if an invalid number is supplied.
      static $result = ; #Initialises $result with an empty array on first function call.
      static $max = 1; #Initialises $max with 1 on first function call.
      foreach ($arr as $value)
      {
      $x = randX(0, $max); #Random probability value.
      $result[$value] = ($result[$value] + $x)??0; #Initialise the array with 0 on first call, and on subsequent calls increment by $x to assign probability mass.
      $max -= $x; #Ensures that the probability never sums to more than one.
      }
      /*
      * After the execution of the above code, there would be some leftover probability mass.
      * The code below adds it to a random element.
      */
      $var = array_values($arr);
      if($max <= $control) #To limit concentration of most of the probability mass in one variable.
      {
      $result[$var[rand(0,(count($var)-1))]] += $max; #Selects a random key and adds $max to it.
      return $result;
      }
      else
      return probGen($arr, $control);
      }
      ?>


      probSelect.php



      <?
      require_once("confirm.php");

      function probSelect(array $arr)
      /*
      * A function to select an element from an array with indicated probabilites.
      * Input: An associative array whose keys are the elements to be selected from, and whose values are the associated probabilities.
      * Output: The selected element, or "NULL" if an invalid probability distribution was supplied.
      */
      {
      if(confirm($arr))
      {
      $var = lcg_value(); #The random float that would be used to select the element.
      $sum = 0;
      foreach ($arr as $key => $value)
      {
      $sum += $value;
      if($var <= $sum)
      return $key;
      }
      }
      else
      {
      print("ERROR!!! The supplied probability distribution must sum to 1. <br>");
      return null;
      }
      }
      ?>




      Dependencies



      The required functions in case they are necessary for evaluating performance:



      randX.php



      <?
      function randX(float $a, float $b): float
      /*
      * Generates a random number between between two real numbers (both inclusive).
      * Input: Two floating point numbers.
      * Output: A random floating point number.
      */
      {
      $max = max($a, $b);
      $min = min($a, $b);
      $x = $max - $min;
      $y = lcg_value()*$x;
      return ($min+$y);
      }
      ?>


      confirm.php



      <?
      function confirm(array $arr) #Confirms that the supplied array has a valid probability distribution
      {
      return (array_sum($arr) == 1)?true:false;
      }
      ?>









      share|improve this question















      I wrote a function that can be used to generate a probability distribution for an array of elements (probGen()), and another that can be used to select an element from an array with a specified probability (probSelect()). I want to optimise my code.



      probGen.php



      <?
      require_once("randX.php"); #"randX()" generates a random floating point number in a specified range.
      error_reporting(E_ERROR | E_WARNING | E_PARSE);

      function probGen(array $arr, float $control = 0.01)
      /*
      * Generates a valid, random probability distribution for a given array of elements, that can be used in conjunction with "probSelect()".
      * Input:
      $arr: An array of elements.
      $control: A value that decides how much mass is allowed to be unilaterally dumped onto one element. A high value would permit distributions where most of the mass is concentrated on one element.
      If an invalid value is provided, the default is used.
      * Output: An associative array where the keys are the elements in the original array, and the values are their probabilities.
      */
      {
      $control = ($control <= 1 && $control >= 0)?($control):(0.00001); #Use the default value if an invalid number is supplied.
      static $result = ; #Initialises $result with an empty array on first function call.
      static $max = 1; #Initialises $max with 1 on first function call.
      foreach ($arr as $value)
      {
      $x = randX(0, $max); #Random probability value.
      $result[$value] = ($result[$value] + $x)??0; #Initialise the array with 0 on first call, and on subsequent calls increment by $x to assign probability mass.
      $max -= $x; #Ensures that the probability never sums to more than one.
      }
      /*
      * After the execution of the above code, there would be some leftover probability mass.
      * The code below adds it to a random element.
      */
      $var = array_values($arr);
      if($max <= $control) #To limit concentration of most of the probability mass in one variable.
      {
      $result[$var[rand(0,(count($var)-1))]] += $max; #Selects a random key and adds $max to it.
      return $result;
      }
      else
      return probGen($arr, $control);
      }
      ?>


      probSelect.php



      <?
      require_once("confirm.php");

      function probSelect(array $arr)
      /*
      * A function to select an element from an array with indicated probabilites.
      * Input: An associative array whose keys are the elements to be selected from, and whose values are the associated probabilities.
      * Output: The selected element, or "NULL" if an invalid probability distribution was supplied.
      */
      {
      if(confirm($arr))
      {
      $var = lcg_value(); #The random float that would be used to select the element.
      $sum = 0;
      foreach ($arr as $key => $value)
      {
      $sum += $value;
      if($var <= $sum)
      return $key;
      }
      }
      else
      {
      print("ERROR!!! The supplied probability distribution must sum to 1. <br>");
      return null;
      }
      }
      ?>




      Dependencies



      The required functions in case they are necessary for evaluating performance:



      randX.php



      <?
      function randX(float $a, float $b): float
      /*
      * Generates a random number between between two real numbers (both inclusive).
      * Input: Two floating point numbers.
      * Output: A random floating point number.
      */
      {
      $max = max($a, $b);
      $min = min($a, $b);
      $x = $max - $min;
      $y = lcg_value()*$x;
      return ($min+$y);
      }
      ?>


      confirm.php



      <?
      function confirm(array $arr) #Confirms that the supplied array has a valid probability distribution
      {
      return (array_sum($arr) == 1)?true:false;
      }
      ?>






      performance beginner php algorithm random






      share|improve this question















      share|improve this question













      share|improve this question




      share|improve this question








      edited 6 mins ago









      Jamal

      30.3k11116226




      30.3k11116226










      asked 12 hours ago









      Tobi Alafin

      1977




      1977






















          1 Answer
          1






          active

          oldest

          votes


















          1














          A have a few little things:



          Don't put comments in between function declarations and curly braces (or anything similar to that like classes or methods)



          That's hard to read and an eye sore. Put your function comments above the function itself. Ideally using standard docblock format.



          /**
          * Generates a valid, random probability distribution for a given array of elements, that can be used in conjunction with "probSelect()".
          *
          * @param array $arr An array of elements
          * @param float $control A value that decides how much mass is allowed to be unilaterally dumped onto one element. A high value would permit distributions where most of the mass is concentrated on one element.
          * @return array An associative array where the keys are the elements in the original array, and the values are their probabilities.
          */
          function probGen(array $arr, float $control = 0.01)
          {


          Use better variable names



          What does $arr mean? I can guess that it is probably an array but my IDE can already tell me that. What's actually in that array? $elements would be a better name based on the function comments. A better name would be $elementsOfSomething where something accurately describe the characteristic of those elements.



          Always use curly braces for control structures



          Although it is perfectly valid syntax to omit curly braces when a control structure only contains one line of code. But it is a best practice to always use them as they make the code more rreadable and prevent future errors. Future you or another developer may want to add a line to a control block and introduce a hard to find bug because they didn't realize the curly braces weren't there.



          if($var <= $sum)
          return $key;


          should be:



          if($var <= $sum) {
          return $key;
          }


          Use echo over print()



          print() is an alias of echo but there are minor differences between the two. Although they don't come into play here, it is the PHP convention to use echo for outputting content.



          Don't output content from your functions



          Your functions that do work and output content but only when there is an error are inconstant and doing too much. If you have an error, let the function report that back through a special return value (like false or null) or by throwing an exception. Let the calling code worry about reporting back the error and let the function focus on doing one thing and one thing only (just like a good OOP class should be doing).



          You can simply statements that check for a Boolean to return a Boolean



          Your statement return (sum($arr) == 1)?true:false; is checking if a statement is true and returning true if it is. false if it is false. So you can return the result of your conditional without having to explicitly return true or false because you already have that value:



          return (sum($arr) == 1); // will return true or false


          An alternative randx() function



          The internet seems like it is already full of example functions that will generate a random float. They tend to be similar to the one below. Is there any reason you did not choose to go this route? It has fewer function calls so it should be more performant.



          function frand($min, $max, $decimals = 0) {
          $scale = pow(10, $decimals);
          return mt_rand($min * $scale, $max * $scale) / $scale;
          }

          echo "frand(0, 10, 2) = " . frand(0, 10, 2) . "n";


          The PHP community prefers // to # for comments



          Although # is a valid syntax for a one line comment in PHP, it is common practice to use //.






          share|improve this answer










          New contributor




          John Conde is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
          Check out our Code of Conduct.


















            Your Answer





            StackExchange.ifUsing("editor", function () {
            return StackExchange.using("mathjaxEditing", function () {
            StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
            StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["\$", "\$"]]);
            });
            });
            }, "mathjax-editing");

            StackExchange.ifUsing("editor", function () {
            StackExchange.using("externalEditor", function () {
            StackExchange.using("snippets", function () {
            StackExchange.snippets.init();
            });
            });
            }, "code-snippets");

            StackExchange.ready(function() {
            var channelOptions = {
            tags: "".split(" "),
            id: "196"
            };
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function() {
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled) {
            StackExchange.using("snippets", function() {
            createEditor();
            });
            }
            else {
            createEditor();
            }
            });

            function createEditor() {
            StackExchange.prepareEditor({
            heartbeatType: 'answer',
            autoActivateHeartbeat: false,
            convertImagesToLinks: false,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: null,
            bindNavPrevention: true,
            postfix: "",
            imageUploader: {
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            },
            onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            });


            }
            });














            draft saved

            draft discarded


















            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fcodereview.stackexchange.com%2fquestions%2f210833%2fprobability-suite-for-an-array-of-elements%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown

























            1 Answer
            1






            active

            oldest

            votes








            1 Answer
            1






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes









            1














            A have a few little things:



            Don't put comments in between function declarations and curly braces (or anything similar to that like classes or methods)



            That's hard to read and an eye sore. Put your function comments above the function itself. Ideally using standard docblock format.



            /**
            * Generates a valid, random probability distribution for a given array of elements, that can be used in conjunction with "probSelect()".
            *
            * @param array $arr An array of elements
            * @param float $control A value that decides how much mass is allowed to be unilaterally dumped onto one element. A high value would permit distributions where most of the mass is concentrated on one element.
            * @return array An associative array where the keys are the elements in the original array, and the values are their probabilities.
            */
            function probGen(array $arr, float $control = 0.01)
            {


            Use better variable names



            What does $arr mean? I can guess that it is probably an array but my IDE can already tell me that. What's actually in that array? $elements would be a better name based on the function comments. A better name would be $elementsOfSomething where something accurately describe the characteristic of those elements.



            Always use curly braces for control structures



            Although it is perfectly valid syntax to omit curly braces when a control structure only contains one line of code. But it is a best practice to always use them as they make the code more rreadable and prevent future errors. Future you or another developer may want to add a line to a control block and introduce a hard to find bug because they didn't realize the curly braces weren't there.



            if($var <= $sum)
            return $key;


            should be:



            if($var <= $sum) {
            return $key;
            }


            Use echo over print()



            print() is an alias of echo but there are minor differences between the two. Although they don't come into play here, it is the PHP convention to use echo for outputting content.



            Don't output content from your functions



            Your functions that do work and output content but only when there is an error are inconstant and doing too much. If you have an error, let the function report that back through a special return value (like false or null) or by throwing an exception. Let the calling code worry about reporting back the error and let the function focus on doing one thing and one thing only (just like a good OOP class should be doing).



            You can simply statements that check for a Boolean to return a Boolean



            Your statement return (sum($arr) == 1)?true:false; is checking if a statement is true and returning true if it is. false if it is false. So you can return the result of your conditional without having to explicitly return true or false because you already have that value:



            return (sum($arr) == 1); // will return true or false


            An alternative randx() function



            The internet seems like it is already full of example functions that will generate a random float. They tend to be similar to the one below. Is there any reason you did not choose to go this route? It has fewer function calls so it should be more performant.



            function frand($min, $max, $decimals = 0) {
            $scale = pow(10, $decimals);
            return mt_rand($min * $scale, $max * $scale) / $scale;
            }

            echo "frand(0, 10, 2) = " . frand(0, 10, 2) . "n";


            The PHP community prefers // to # for comments



            Although # is a valid syntax for a one line comment in PHP, it is common practice to use //.






            share|improve this answer










            New contributor




            John Conde is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
            Check out our Code of Conduct.























              1














              A have a few little things:



              Don't put comments in between function declarations and curly braces (or anything similar to that like classes or methods)



              That's hard to read and an eye sore. Put your function comments above the function itself. Ideally using standard docblock format.



              /**
              * Generates a valid, random probability distribution for a given array of elements, that can be used in conjunction with "probSelect()".
              *
              * @param array $arr An array of elements
              * @param float $control A value that decides how much mass is allowed to be unilaterally dumped onto one element. A high value would permit distributions where most of the mass is concentrated on one element.
              * @return array An associative array where the keys are the elements in the original array, and the values are their probabilities.
              */
              function probGen(array $arr, float $control = 0.01)
              {


              Use better variable names



              What does $arr mean? I can guess that it is probably an array but my IDE can already tell me that. What's actually in that array? $elements would be a better name based on the function comments. A better name would be $elementsOfSomething where something accurately describe the characteristic of those elements.



              Always use curly braces for control structures



              Although it is perfectly valid syntax to omit curly braces when a control structure only contains one line of code. But it is a best practice to always use them as they make the code more rreadable and prevent future errors. Future you or another developer may want to add a line to a control block and introduce a hard to find bug because they didn't realize the curly braces weren't there.



              if($var <= $sum)
              return $key;


              should be:



              if($var <= $sum) {
              return $key;
              }


              Use echo over print()



              print() is an alias of echo but there are minor differences between the two. Although they don't come into play here, it is the PHP convention to use echo for outputting content.



              Don't output content from your functions



              Your functions that do work and output content but only when there is an error are inconstant and doing too much. If you have an error, let the function report that back through a special return value (like false or null) or by throwing an exception. Let the calling code worry about reporting back the error and let the function focus on doing one thing and one thing only (just like a good OOP class should be doing).



              You can simply statements that check for a Boolean to return a Boolean



              Your statement return (sum($arr) == 1)?true:false; is checking if a statement is true and returning true if it is. false if it is false. So you can return the result of your conditional without having to explicitly return true or false because you already have that value:



              return (sum($arr) == 1); // will return true or false


              An alternative randx() function



              The internet seems like it is already full of example functions that will generate a random float. They tend to be similar to the one below. Is there any reason you did not choose to go this route? It has fewer function calls so it should be more performant.



              function frand($min, $max, $decimals = 0) {
              $scale = pow(10, $decimals);
              return mt_rand($min * $scale, $max * $scale) / $scale;
              }

              echo "frand(0, 10, 2) = " . frand(0, 10, 2) . "n";


              The PHP community prefers // to # for comments



              Although # is a valid syntax for a one line comment in PHP, it is common practice to use //.






              share|improve this answer










              New contributor




              John Conde is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
              Check out our Code of Conduct.





















                1












                1








                1






                A have a few little things:



                Don't put comments in between function declarations and curly braces (or anything similar to that like classes or methods)



                That's hard to read and an eye sore. Put your function comments above the function itself. Ideally using standard docblock format.



                /**
                * Generates a valid, random probability distribution for a given array of elements, that can be used in conjunction with "probSelect()".
                *
                * @param array $arr An array of elements
                * @param float $control A value that decides how much mass is allowed to be unilaterally dumped onto one element. A high value would permit distributions where most of the mass is concentrated on one element.
                * @return array An associative array where the keys are the elements in the original array, and the values are their probabilities.
                */
                function probGen(array $arr, float $control = 0.01)
                {


                Use better variable names



                What does $arr mean? I can guess that it is probably an array but my IDE can already tell me that. What's actually in that array? $elements would be a better name based on the function comments. A better name would be $elementsOfSomething where something accurately describe the characteristic of those elements.



                Always use curly braces for control structures



                Although it is perfectly valid syntax to omit curly braces when a control structure only contains one line of code. But it is a best practice to always use them as they make the code more rreadable and prevent future errors. Future you or another developer may want to add a line to a control block and introduce a hard to find bug because they didn't realize the curly braces weren't there.



                if($var <= $sum)
                return $key;


                should be:



                if($var <= $sum) {
                return $key;
                }


                Use echo over print()



                print() is an alias of echo but there are minor differences between the two. Although they don't come into play here, it is the PHP convention to use echo for outputting content.



                Don't output content from your functions



                Your functions that do work and output content but only when there is an error are inconstant and doing too much. If you have an error, let the function report that back through a special return value (like false or null) or by throwing an exception. Let the calling code worry about reporting back the error and let the function focus on doing one thing and one thing only (just like a good OOP class should be doing).



                You can simply statements that check for a Boolean to return a Boolean



                Your statement return (sum($arr) == 1)?true:false; is checking if a statement is true and returning true if it is. false if it is false. So you can return the result of your conditional without having to explicitly return true or false because you already have that value:



                return (sum($arr) == 1); // will return true or false


                An alternative randx() function



                The internet seems like it is already full of example functions that will generate a random float. They tend to be similar to the one below. Is there any reason you did not choose to go this route? It has fewer function calls so it should be more performant.



                function frand($min, $max, $decimals = 0) {
                $scale = pow(10, $decimals);
                return mt_rand($min * $scale, $max * $scale) / $scale;
                }

                echo "frand(0, 10, 2) = " . frand(0, 10, 2) . "n";


                The PHP community prefers // to # for comments



                Although # is a valid syntax for a one line comment in PHP, it is common practice to use //.






                share|improve this answer










                New contributor




                John Conde is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                Check out our Code of Conduct.









                A have a few little things:



                Don't put comments in between function declarations and curly braces (or anything similar to that like classes or methods)



                That's hard to read and an eye sore. Put your function comments above the function itself. Ideally using standard docblock format.



                /**
                * Generates a valid, random probability distribution for a given array of elements, that can be used in conjunction with "probSelect()".
                *
                * @param array $arr An array of elements
                * @param float $control A value that decides how much mass is allowed to be unilaterally dumped onto one element. A high value would permit distributions where most of the mass is concentrated on one element.
                * @return array An associative array where the keys are the elements in the original array, and the values are their probabilities.
                */
                function probGen(array $arr, float $control = 0.01)
                {


                Use better variable names



                What does $arr mean? I can guess that it is probably an array but my IDE can already tell me that. What's actually in that array? $elements would be a better name based on the function comments. A better name would be $elementsOfSomething where something accurately describe the characteristic of those elements.



                Always use curly braces for control structures



                Although it is perfectly valid syntax to omit curly braces when a control structure only contains one line of code. But it is a best practice to always use them as they make the code more rreadable and prevent future errors. Future you or another developer may want to add a line to a control block and introduce a hard to find bug because they didn't realize the curly braces weren't there.



                if($var <= $sum)
                return $key;


                should be:



                if($var <= $sum) {
                return $key;
                }


                Use echo over print()



                print() is an alias of echo but there are minor differences between the two. Although they don't come into play here, it is the PHP convention to use echo for outputting content.



                Don't output content from your functions



                Your functions that do work and output content but only when there is an error are inconstant and doing too much. If you have an error, let the function report that back through a special return value (like false or null) or by throwing an exception. Let the calling code worry about reporting back the error and let the function focus on doing one thing and one thing only (just like a good OOP class should be doing).



                You can simply statements that check for a Boolean to return a Boolean



                Your statement return (sum($arr) == 1)?true:false; is checking if a statement is true and returning true if it is. false if it is false. So you can return the result of your conditional without having to explicitly return true or false because you already have that value:



                return (sum($arr) == 1); // will return true or false


                An alternative randx() function



                The internet seems like it is already full of example functions that will generate a random float. They tend to be similar to the one below. Is there any reason you did not choose to go this route? It has fewer function calls so it should be more performant.



                function frand($min, $max, $decimals = 0) {
                $scale = pow(10, $decimals);
                return mt_rand($min * $scale, $max * $scale) / $scale;
                }

                echo "frand(0, 10, 2) = " . frand(0, 10, 2) . "n";


                The PHP community prefers // to # for comments



                Although # is a valid syntax for a one line comment in PHP, it is common practice to use //.







                share|improve this answer










                New contributor




                John Conde is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                Check out our Code of Conduct.









                share|improve this answer



                share|improve this answer








                edited 9 hours ago





















                New contributor




                John Conde is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                Check out our Code of Conduct.









                answered 11 hours ago









                John Conde

                1786




                1786




                New contributor




                John Conde is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                Check out our Code of Conduct.





                New contributor





                John Conde is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                Check out our Code of Conduct.






                John Conde is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                Check out our Code of Conduct.






























                    draft saved

                    draft discarded




















































                    Thanks for contributing an answer to Code Review Stack Exchange!


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid



                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.


                    Use MathJax to format equations. MathJax reference.


                    To learn more, see our tips on writing great answers.





                    Some of your past answers have not been well-received, and you're in danger of being blocked from answering.


                    Please pay close attention to the following guidance:


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid



                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.


                    To learn more, see our tips on writing great answers.




                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function () {
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fcodereview.stackexchange.com%2fquestions%2f210833%2fprobability-suite-for-an-array-of-elements%23new-answer', 'question_page');
                    }
                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    Popular posts from this blog

                    404 Error Contact Form 7 ajax form submitting

                    How to know if a Active Directory user can login interactively

                    TypeError: fit_transform() missing 1 required positional argument: 'X'