How to Determine the Number of Parameters a Function Should Accept

functionsparameters

I've noticed a few functions I work with have 6 or more parameters, whereas in most libraries I use it is rare to find a function that takes more than 3.

Often a lot of these extra parameters are binary options to alter the function behaviour. I think that some of these umpteen-parametered functions should probably be refactored. Is there a guideline for what number is too many?

Best Answer

I've never seen a guideline, but in my experience a function that takes more than three or four parameters indicates one of two problems:

The function is doing too much. It should be split into several smaller functions, each which have a smaller parameter set.
There is another object hiding in there. You may need to create another object or data structure that includes these parameters. See this article on the Parameter Object pattern for more information.

It's difficult to tell what you're looking at without more information. Chances are the refactoring you need to do is split the function into smaller functions which are called from the parent depending on those flags that are currently being passed to the function.

There are some good gains to be had by doing this:

It makes your code easier to read. I personally find it much easier to read a "rules list" made up of an if structure that calls a lot of methods with descriptive names than a structure that does it all in one method.
It's more unit testable. You've split your problem into several smaller tasks that are individually very simple. The unit test collection would then be made up of a behavioral test suite that checks the paths through the master method and a collection of smaller tests for each individual procedure.

Related Solutions

How to Decide What Code to Put into a Function

there are 2 ways to pass data into a function: parameters or globals, in small scripts globals are acceptable but really try to avoid them
it's easier to simply extract y this is also better when you need to change y later

you can use lazy initialization here:

var subresult=undefined

function sub(){
    if(subresult===undefined){
        subresult=//calculate...
    }
    return subresult;
}

there are 2 competing principles here SRP and YAGNI:

A. Single Responsibility Principle means essentially that each function should do a single thing and do it correctly

B. You Aren't Going to Need It: don't waste time on stuff that may or may not be needed in the future, focus on what you need now

Method extraction vs underlying assumptions

For example, imagine an initialisation method split into a series of small ones: in the context of method itself, you clearly know that object's state is still invalid, but in an ordinary private method you probably go from assumption that object is already initialised and is in a valid state. The only solution I've seen for this is...

Your concern is well-founded. There is another solution.

Take a step back. What fundamentally is the purpose of a method? Methods only do one of two things:

Produce a value
Cause an effect

Or, unfortunately, both. I try to avoid methods that do both, but plenty do. Let's say that the effect produced or the value produced is the "result" of the method.

You note that methods are called in a "context". What is that context?

The values of the arguments
The state of the program outside of the method

Essentially what you are pointing out is: the correctness of the result of the method depends on the context in which it is called.

We call the conditions required before a method body begins for the method to produce a correct result its preconditions, and we call the conditions which will be produced after the method body returns its postconditions.

So essentially what you are pointing out is: when I extract a code block into its own method, I am losing contextual information about the preconditions and postconditions.

The solution to this problem is make the preconditions and postconditions explicit in the program. In C#, for instance, you could use Debug.Assert or Code Contracts to express preconditions and postconditions.

For example: I used to work on a compiler which moved through several "stages" of compilation. First the code would be lexed, then parsed, then types would be resolved, then inheritance hierarchies would be checked for cycles, and so on. Every bit of the code was very sensitive to its context; it would be disastrous, for instance, to ask "is this type convertible to that type?" if the graph of base types was not yet known to be acyclic! So therefore every bit of code clearly documented its preconditions. We would assert in the method that checked for type convertibility that we had already passed the "base types acylic" check, and it then became clear to the reader where the method could be called and where it could not be called.

Of course there are lots of ways in which good method design mitigates the problem you've identified:

make methods that are useful for their effects or their value but not both
make methods that are as "pure" as possible; a "pure" method produces a value that depends only on its arguments, and produces no effect. These are the easiest methods to reason about because the "context" they need is very localized.
minimize the amount of mutation that happens in program state; mutations are points where code gets harder to reason about

Best Answer

Related Solutions

How to Decide What Code to Put into a Function

Method extraction vs underlying assumptions

Related Topic