Why are composite primary keys still around?
Solution 1:
Composite keys are required when your primary keys are non-surrogate and inherently, um, composite, that is, breakable into several non-related parts.
Some real-world examples:
Many-to-many link tables, in which the primary keys are composed of the keys of the entities related.
Multi-tenant applications when
tenant_id
is a part of primary key of each entity and the entities are only linkable within the same tenant (constrained by a foreign key).Applications processing third-party data (with already provided primary keys)
Note that logically, all this can be achieved using a UNIQUE
constraint (additional to a surrogate PRIMARY KEY
).
However, there are some implementation specific things:
Some systems won't let a
FOREIGN KEY
refer to anything that is not aPRIMARY KEY
.Some systems would only cluster a table on a
PRIMARY KEY
, hence making the composite thePRIMARY KEY
would improve performance of the queries joining on the composite.
Solution 2:
Personally I prefer the use of surrogate keys. However, in joining tables that consist only of the ids from two other tables (to create a many-to-many relationships) composite keys are the way to go and thus taking them out would make things more difficult.
There is a school of thought that surrogate keys are always bad and that if you don't have uniqueness to record through the use of natural keys you have a bad design. I strongly disagree with this (if you aren't storing SSN or some other unique value I defy you to come up with a natural key for a person table for instance.) But many people feel that it is necessary for proper normalization.
Sometimes having a composite key reduces the need to join to another table. Sometimes it doesn't. So there are times when a composite key can boost performance as well as times when it can harm performance. If the key is relatively stable, you may be fine with faster performance on select queries. However, if it is something that is subject to change like a company name, you could be in a world of hurt when company A changes it's name and you have to update a million associated records.
There is no one size fits all in database design. There are time when composite keys are helpful and times when they are horrible. There are times when surrogate keys are helpful and times when they are not.
Solution 3:
Composite primary key provides better performance when it comes to them being used as Foreign keys in other tables and reduces table reads - sometimes they can be life savers. If you use surrogate keys, you have to go to that table to get natural key information.
For example (pure example - so we are not talking DB design here), lets say you have an ORDER
table and ORDER_ITEM
. If you use ProductId
and LineNumber
(UPDATE: and as Pedro mentioned OrderId
or even better OrderNumber
) as composite primary key in ORDER_ITEM
, then in your cross table for SHIPPING
, you would be able to have ProductId in the SHIPPING_ORDERITEM
. This can massively boost your performance if for example you have run out of that product and need to find out all products of that ProductId
that need to be shipped without a need to join.
On the other hand, if you use a surrogate key, you have to join and you end up with a very inefficient SQL execution plan where it has to do bookmark lookup on several indexes.
See more on bookmark lookup which using surrogate keys becomes a major issue.