Coding Style – When to Use Classes vs. POD (PDS) and Functions

clean codecoding-stylemaintainability

Recently, I've read a blog post, that I can't find back, about how we should "free the data". The main point of the post was that we use classes and encapsulation too much since a lot of problems can be solved with less overhead by using plain old (passive) data structure combined with function overload. The post raised my awareness about the cost of creating (and maintaining) classes and classes hierarchies. In order to share this newly earned awareness with my colleagues, I tried to pinpoint conditions that justify the creation of classes. So far, I've found

Presence of an invariant. For instance, a map should always contains the same number of keys and elements. You do not want the user to add a key and forget to add the corresponding element.
Implementation hiding to have the freedom to change it easily. For instance, a Point can be encoded with Cartesian coordinates (x,y) or with a radius and an angle.
Homogeneous manipulation. For instance if you want Dog and Cat to be manipulated the same way because they are both specialization of the more general concept of Animal.

What are the other reasons to create classes or classes hierarchies?

Edit: By cost, I refer to the time, money and Technical Debt required to create and maintain classes and classes hierarchies. This cost should be compare to the cost of other solutions.

Edit 2: I realized that trying to make this question general was a mistake. I definitely have c++ in mind.

Best Answer

This is a very broad question, as it depends on the language used, and the features that language offers:

Can "plain old data structures" be made immutable?
Does the language enforce encapsulation of private functions and data, or is it by convention?
Is the language statically or dynamically typed?
Does it allow functions outside of static classes?
Does it treat functions as first class values?
Does it support interfaces?
Does it support records, structs etc?
Does it even support classes?

Depending on the answers to the above questions, the strategy used will vary. If, for example, the language doesn't support classes, you won't be using them...

Having said all that, there are some general rules that can be followed across languages:

Avoid global state. If you have data, that's globally accessible and is mutable, you're on the path to debugging hell. Just don't do it.
Avoid coupling. Whether it's through having objects spin up instances of other classes, or functions hard-coded to call other public functions, you're making the code harder to test and maintain. Use injection techniques and keep coupling as loose as possible.
Avoid inheritance. Inheritance causes coupling problems, including the Fragile Base Class Problem, weakens encapsulation and causes testing problems. Unless you are using a language that can only achieve polymorphism via inheritance (ie doesn't support truly abstract classes or interfaces), then don't use inheritance.

As a rule of thumb, for a typical modern language that supports static functions and classes:

Keep data as immutable as possible,
Keep data and functionality as separate as possible,
But use objects to encapsulate state and provide methods to handle that state,
Only use functions (static methods) when they can be made pure, ie they produce a result from the parameters in a deterministic fashion without side effects.
Design to interfaces (or the equivalent) and use injection as much as possible.

Related Solutions

C++ Coding Style – When to Use Typedef?

Your last example is very much readable, but it depends on where you define the typedef. Local scope typedefs (like in your second example) are IMVHO almost always a win.

I still like your third example best, but you might want to think about the nameing, and give the iterators names which tell the intend of the container.

Another option would be to make a template out of your function, so that it works with different containers, too. Along the lines of

template <typename Input_iterator> ... sum(Input_iterator first, Input_iterator last)

which is also very much in the spirit of the STL.

Using Syntactic Sugar – When to Use or Avoid Syntactic Sugar

I disagree with

It is hard to read in general

especially to "in general". These language features may be hard to read for beginners when they see them the first time, but they were actually added to the language to make code more concise. So after one gets used to them (which should not last longer than using them half a dozen times) they should make the code more readable, not less.

For someone coming from another language e.g. Java, it is harder to see what is going on in the code.

Yes, but is your goal to program Java in C#, or to program C#?

When you decide to use a language, you will be better off learn the idioms of the language, especially the simple ones. When you work with real-world programs, you will encounter these idioms frequently and will have to deal with them, whether you like them or not.

Let me finally add, the ultimate measure for the readibility of your code is what your peer reviewer tells you. And whenever I am in the role of a reviewer who stumbles about a simple language idiom which is new to me, I usually take it as an occasion to learn something new, not as an occasion to tell the other devs what they should not use because I don't want to learn it.

Best Answer

Related Solutions

C++ Coding Style – When to Use Typedef?

Using Syntactic Sugar – When to Use or Avoid Syntactic Sugar

Related Topic