Circle–ellipse problem: Difference between revisions
→Combine class Circle into class Ellipse: Noted possible performance advantage, which IMO outweighs possible conceptual advantage by a lot. |
|||
Line 130: | Line 130: | ||
Then, wherever you used a circle before, use an ellipse. |
Then, wherever you used a circle before, use an ellipse. |
||
A circle can already be represented by an ellipse. There is no reason to have class Circle unless it needs some circle-specific methods that can't be applied to an ellipse, or unless the programmer wishes to benefit from |
A circle can already be represented by an ellipse. There is no reason to have class Circle unless it needs some circle-specific methods that can't be applied to an ellipse, or unless the programmer wishes to benefit from conceptual and/or performance advantages of the circle's simpler model. |
||
=== Change the language === |
=== Change the language === |
Revision as of 03:53, 2 April 2009
The circle-ellipse problem in software development (sometimes known as the square-rectangle problem) illustrates a number of pitfalls which can arise when using subtype polymorphism in object modelling. The issues are most commonly encountered when using object-oriented programming.
The problem concerns what subtyping/inheritance relationship should exist between classes which represent circles and ellipses. More generally, the problem illustrates the difficulties which can occur when a base class contains methods which mutate an object in a manner which might invalidate a (stronger) invariant found in a derived class, causing the Liskov substitution principle to be violated.
The existence of the circle-ellipse problem is sometimes used to criticize object-oriented programming. It may also imply that hierarchical taxonomies are difficult to make universal, implying that situational classification systems may be more practical.
The problem
It is a central tenet of object-oriented analysis and design that subtype polymorphism, which is implemented in most OO languages via inheritance, should be used to model object types which are subsets of each other; this is commonly referred to as the is-a relationship. In the present example, the set of circles is a subset of the set of ellipses; circles can be defined as ellipses whose major and minor axes are the same length. Thus, code written in an OOPL which models shapes will frequently choose to make class Circle
as a sub-class of class Ellipse
,
i.e. inheriting from it.
If the classes are immutable, there is no problem with this state of affairs. Circles satisfy all invariants of ellipses; and an (immutable) circle can be used in any context where an immutable ellipse is expected. The relationship satisfies the Liskov substitution principle.
Ellipse
could have a method stretchX (integer Factor)
which alters the length of one of its axes, but not the other, and returns the result in a new Ellipse
object.
This would cause no problem for Circle::stretchX
, as it could return a new Ellipse
just as the Ellipse::stretchX
does (whilst remaining unaltered itself).
But some OO designs encourage the use of mutators, methods which modify instances of the class. A sub-class has to provide support for all behaviour supported by the super-class; subclasses must implement any mutators defined in a base class.
In the present case, the method Ellipse::stretchX
alters the length of one of its axes in place.
If Circle
inherits from Ellipse
,
it must also have a method stretchX
,
but the result of this method would be to change a circle into something which is no longer a circle. The Circle class can not simultaneously satisfy its own invariant, and the behavioral requirements of the Ellipse::stretchX
method.
A related problem with this inheritance arises when we consider the implementation. An ellipse requires more state to describe than a circle does, as the former needs attributes to specify the length and rotation of the major and minor axes; whereas a circle needs only a radius. It may be possible to avoid this if the language (such as Eiffel) makes constant values of a class, functions without arguments and data members interchangeable.
Some authors have suggested reversing the relationship between circle and ellipse, on the grounds that an ellipse is a circle with additional capabilities. Unfortunately, ellipses fail to satisfy many of the invariants of circles; if Circle
has a method radius
,
Ellipse
will now have to provide it as well.
The problem is sometimes expressed in statements such as "a Circle
is not a sort of Ellipse
". This looks confusingly like the absurd "a circle is not a sort of ellipse", and sounds identical, so it is unhelpful. What is actually meant is "an OO-model of a circle should not be a sort of OO-model of an ellipse".
Analysis
The problem arises when a super-class's methods are incompatible with a sub-class's. For example, parameters of method stretch(X,Y)
of Ellipse do not need to have the same values, but X!=Y is not acceptable for class Circle.
Indeed, a circle is a special ellipse in which the two foci coincide (i.e., are the same point), thus intuitively the class Circle is usually designed as sub-class of the class Ellipse. However, the usage-incompatible methods will cause unexpected results.
For example, if there is another method flatten(rate)
of class Ellipse:
Ellipse::flatten(rate){ stretch(1,1/rate); }
Since Circle is sub-class of Ellipse, it should be able to "flatten".
But after "flatten", the circle object is no longer circle, thus will fail at the circle-specific methods such as Circle::getRadius()
.
Possible solutions
One may solve the problem by changing one's model, or perhaps using a different language, which could be a (not yet implemented) extension of an existing language or use a different paradigm. Exactly which option is appropriate will depend on who wrote Circle
and who wrote Ellipse
. If the same author is designing them both from scratch, then they will be able to define the interface to handle this situation. If the Ellipse
object was already written, and cannot be changed, then the options are more limited.
Change the model
Return success or failure value
Allow the objects to return a 'success' or 'failure' value for each modifier. This is usually done in the case of file I/O, but can also be helpful here. Now, Ellipse.stretchX
works, and returns 'true', while Circle.stretchX
simply returns 'false'. This is in general good practice, but may require that the original author of ellipse
anticipated such a problem, and defined the mutators as returning a value.
Alternately, Circle.stretchX
could throw an exception (but depending on the language, this may also require that the original author of ellipse
declare that it may throw an exception).
Return the new value of X
This is a similar solution to the above, but is slightly more powerful. Ellipse.stretchX
now returns the new value of its X dimension. This is more powerful because it also allows the Ellipse
class to limit its X dimension to a maximum if it wants. Now, Circle.stretchX
simply returns its current radius.
Maintain the aspect ratio
If the interface definition for Ellipse
states only that "stretchX modifies the X axis", and does not state "and nothing else will change", then Circle
could simply force the X and Y dimensions to be the same. Circle.stretchX
and Circle.stretchY
both modify both the X and Y size.
Circle::stretchX(x) {xSize = ySize = x;}
Circle::stretchY(y) {xSize = ySize = y;}
Convert the Circle into an Ellipse
If Circle.stretchX
is called, then Circle
changes itself into an Ellipse
. This may be dangerous, however, if some other function is expecting it to be a Circle
. Some languages don't allow this type of change at all, and others impose restrictions on the Ellipse
class to be an acceptable replacement for Circle
.
Limit the interfaces
In some cases it may be possible to argue that not all theoretically defined methods are necessary. This is only applicable in a limited number of cases, and makes re-use of the model harder.
Make all instances constant
One can change the model so that instances of the classes represent constant values (i.e. they are immutable). This is exactly the implementation that is used in purely functional programming.
In this case methods such as stretchX
must be changed to yield a
new instance, rather than modifying the instance they act on.
This means that it is no longer a problem to define
Circle.stretchX
,
and the inheritance reflects the mathematical relationship between circles and ellipses.
A disadvantage is that changing the value of an instance
then requires an assignment, which
is inconvenient and prone to programming errors, e.g.
Orbit(planet[i]) := Orbit(planet[i]).stretchX
.
A second disadvantage is that such an assignment conceptually involves a temporary value, which could reduce performance and be difficult to optimise.
Factor out modifiers
One can define a new class MutableEllipse
,
and put the modifiers from Ellipse
in it.
The Circle
only inherits queries from Ellipse
.
This has a disadvantage of introducing an extra class
where all we really want to do is specify that
Circle
does not inherit modifiers from Ellipse
.
Impose pre-conditions on modifiers
One can specify that Ellipse.stretchX
is only allowed on
instances satisfying Ellipse.stretchable
,
and will otherwise throw an exception.
This allows us to implement Circle.stretchX
,
but makes it harder to check at compile-time that calls will be valid at run-time.
It also introduces methods whose sole purpose is to solve this problem,
which makes the model less clear.
Factor out common functionality into an abstract base class
Create an abstract base class called RoundObject
and put methods that work with both Circle
s and Ellipse
s in this class. Functions that can deal with either type of object will expect a RoundObject
, and functions that call Ellipse
or Circle
-specific requirements will use the descendant classes.
Drop all inheritance relationships
This solves the problem at a stroke, and may sometimes be a reasonable approach.
The disadvantage is that one can no longer use circles where ellipses are expected.
However, an acceptable workaround may be to add a method Circle.toEllipse()
which would instantiate and return a mutable ellipse object using the circle's current state. The caveat is that mutations performed on this new ellipse would not affect the original circle, and vice versa.
Combine class Circle into class Ellipse
Then, wherever you used a circle before, use an ellipse.
A circle can already be represented by an ellipse. There is no reason to have class Circle unless it needs some circle-specific methods that can't be applied to an ellipse, or unless the programmer wishes to benefit from conceptual and/or performance advantages of the circle's simpler model.
Change the language
Some of the above approaches suggest proposals to modify OO languages.
Support the value/variable distinction
One could change the language so that all methods must be defined to provide new values, a sort of value-oriented programming.
An essential part of the OO approach is, however, that variables can refer to
different instances, and instances can take on different values.
This suffers from the previously described problem with assignments,
so it is desirable to have a compact way of expressing modification,
e.g. Orbit(planet[i]) :. stretchX
.
Support query-only inheritance
A syntax to specify that a class only inherits the queries of another would
obviate the syntactic overhead of the solution Factor out modifiers,
e.g. class Circle : const Ellipse
.
See also
External links
- http://www.parashift.com/c++-faq-lite/proper-inheritance.html#faq-21.6 A popular C++ FAQ site by Marshall Cline. States and explains the problem.
- Constructive Deconstruction of Subtyping by Alistair Cockburn on his own web-site. Technical/mathematical discussion of typing and subtyping, with applications to this problem.
- http://www.ddj.com/dept/cpp/184403771 Article from Dr. Dobb's April 15, 2003 From Mechanism to Method: Total Ellipse by Kelvin Henney Clear medium-level detailed discussion with many examples bias towards C++. Covers some more general inheritance design issues.
- http://www.dbdebunk.com/page/page/1383837.htm More on Type Inheritance with C.J. Date from Database Debunkings 30 Jun 2004 Question from a reader and answer from C.J. Date. Short, clear, emphasises variable/value distinction (with no particular emphasis on databases).
- http://www.dbdebunk.com/page/page/1409199.htm More on Cure for Madness by C.J. Date from Database Debunkings 24 September 2004 Some implications of the problem for SQL.
- http://orafaq.com/usenet/comp.databases.theory/2001/10/01/0001.htm Beginning of a long thread (follow the Maybe reply: links) on Oracle FAQ discussing the issue. Refers to writings of C.J. Date. Some bias towards Smalltalk.
- http://c2.com/cgi/wiki?LiskovSubstitutionPrinciple Long discussion of the Liskov substitution principle with reference to this problem.
- Subtyping, Subclassing, and Trouble with OOP, an essay discussing a related problem, whether sets should inherit from bags.