Determine Keys from Functional Dependencies

I'm taking a database theory course, and it's not clear to me after doing the reading how I can infer keys, given a set of functional dependencies. I have an example problem: Find all keys of the relation R(ABCDEFG) with functional dependencies

AB → C CD → E EF → G FG → E DE → C BC → A

Demonstrate your knowledge by identifying which of the following is a key.

a. BCDEF b. ADFG c. BDFG d. BCDE

Can someone walk me through how I should decompose the functional dependencies to conclude that some combination of attributes is a key? I expect I'll face a number of these types of problems and I need to understand how to approach it.

database
functional-dependencies

31.4k 32 32 gold badges 120 120 silver badges 163 163 bronze badges asked Apr 20, 2011 at 19:26 2,600 7 7 gold badges 27 27 silver badges 29 29 bronze badges

The video in yrazlik's answer actually uses the approach introduced in [1], if you are interested to know why it works. (The proofs are short and straightforward.) [1] H. Saiedian and T. Spencer, “An Efficient Algorithm to Compute the Candidate Keys of a Relational Database Schema,” The Computer Journal, vol. 39, no. 2, Feb. 1996 [Online]. Available: pdfs.semanticscholar.org/ab3f/…. [Accessed: 31-Jul-2019]

Commented Aug 4, 2019 at 5:28

@Leponzo The introduction to the bulleted algorithm you gave in some deleted answers similar to your deleted answer here is incorrect. The algorithm requires that F be a cover for the FDs that hold, not just be a set of FDs that hold. PS Please use text, not images/links, for text--including tables & ERDs. Paraphrase or quote from other text. Use images only for what cannot be expressed as text or to augment text. Images cannot be searched for or cut & pasted. Include a legend/key & explanation with an image.

Commented Aug 4, 2019 at 6:07

6 Answers 6

Take an FD, e.g. AB→C

Augment until all attributes are mentioned, e.g. ABDEFG → CDEFG (note that this is equivalent to ABDEFG → ABCDEFG because it is trivially true that A->A and B->B).

This tells you that ABDEFG is a superkey.

Check the other FDs in which the LHS is a subset of your superkey, and that on its RHS contains some other attribute of your superkey.

There are two. EF→G and FG→E. Remove the attributes of the RHS of these from your superkey. Doing so gives you two keys, that are certainly superkeys, but not necessarily irreducible ones: ABDEF and ABDFG.

However, from AB→C and CD→E we can also derive that ABD→E. Hence we can also remove the E from our ABDEF key. The nasty thing here is that when I said "Check the other FDs", that apparently means that you should also check any FD that appears in the closure of your set of FDs (i.e. any FD that is derivable from your given set of FDs). And that's a bit impractical to do by hand .

A useful site for verifying whether your results are correct:

You should now be able to determine that option c is a key.

320 3 3 silver badges 11 11 bronze badges answered Apr 20, 2011 at 20:23 Erwin Smout Erwin Smout 18.3k 4 4 gold badges 35 35 silver badges 54 54 bronze badges Isn't BDF the candidate key? Since DF->C, EF->G, CD->E we have G from BDF. Commented Jun 4, 2012 at 6:13

@NickRosencrantz: There are four candidate keys. BDF isn't one of them; they all happen to have four attributes. I don't follow your deductive logic at all; none of the FDs you mention contains B.

Commented Feb 18, 2013 at 23:48 He mentioned DF->C but that one wasn't stated. Maybe he misread DE->C. Commented Feb 18, 2013 at 23:50

@ErwinSmout Could you go through step by step how you augment AB→ C to become ABDEFG → CDEFG? I don't really understand it.

Commented Feb 18, 2016 at 15:17

It's interesting that you updated your answer on the same day, four years later :) Besides that, nice explanation.

Commented Mar 13, 2017 at 5:55

Well this should be rather straightforward. All you need to do is to take the closure of every key given and see if it contains all attributes of R. For example, closure of BCDEF = ABCDEFG since BC -> A and BC is a subset of BCDEF and so if EF for FD EF -> G. Since this closure contains all attributes of R, BCDEF is the key. The main aim of taking closures is to see if we can "reach" every attribute from a given set of attributes. The closure is the set of attributes that we can actually reach by navigating the FDs.

answered Dec 5, 2011 at 22:42 Parth Mehta Parth Mehta 31 2 2 bronze badges BCDEF isn't a candidate key, because it's reducible. (BC->E) But BCDF is one of the candidate keys. Commented Feb 18, 2013 at 23:50

Since you're in a db theory course, I'm going to assume you have SQL experience and try and relate the theory to the implementation context.

Basically, a relation is what you would refer to as a table in implementation and a key is ANY set of attributes (read columns) which can be used to identify a UNIQUE row (in db theory this would be a tuple). The simplest analogy here is that a key is your primary key, as well as any other possible set of columns you can use to identify a single tuple in your relation (row in your table). So, here are the basic steps for doing this (I'll walk through example A, and then you can try the rest):

List all of the attributes which are NOT in your proposed key (so BCDEF does not include A and G).
For each attribute you're missing, go through the list of functional dependencies and see if your proposed key is capable of inferring the attribute you're missing.

 a. So for A, you have BC -> A. Since both B and C are in the proposed key BCDEF, you can infer that BCDEF -> A. Your set now becomes BCDEFA. b. For G, again going through your FDs you find EF -> G. Since both E and F are in BCDEFA, you can infer BCDEFA -> G.

Because you were able to infer both A and G from BCDEF, option a is a key of the relation ABCDEFG. I know there is an algorithm for this, and it is probably in your course text somewhere. There is also probably an example. You should step through it manually until the pattern is intuitive.

EDIT: The reason I would go back through the text to look for this algorithm is that chances are your exam is going to be written as opposed to multiple choice since it is a db theory course. If this is true then you would get more partial credit if you can methodically follow notation demonstrated in your course text/notes.

The main goal is to turn the key into the relation, which should prove that the proposed key is in fact a key.