Misplaced Pages

Projection (relational algebra)

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
(Redirected from Projection (database)) Operation that restricts a relation to a specified set of attributes This article is about relational algebra. For other uses of the term "projection", see Projection (disambiguation).

In relational algebra, a projection is a unary operation written as Π a 1 , . . . , a n ( R ) {\displaystyle \Pi _{a_{1},...,a_{n}}(R)} , where R {\displaystyle R} is a relation and a 1 , . . . , a n {\displaystyle a_{1},...,a_{n}} are attribute names. Its result is defined as the set obtained when the components of the tuples in R {\displaystyle R} are restricted to the set { a 1 , . . . , a n } {\displaystyle \{a_{1},...,a_{n}\}} – it discards (or excludes) the other attributes.

In practical terms, if a relation is thought of as a table, then projection can be thought of as picking a subset of its columns. For example, if the attributes are (name, age), then projection of the relation {(Alice, 5), (Bob, 8)} onto attribute list (age) yields {5,8} – we have discarded the names, and only know what ages are present.

Projections may also modify attribute values. For example, if R {\displaystyle R} has attributes a {\displaystyle a} , b {\displaystyle b} , c {\displaystyle c} , where the values of b {\displaystyle b} are numbers, then Π a ,   b × 0.5 ,   c ( R ) {\displaystyle \Pi _{a,\ b\times 0.5,\ c}(R)} is like R {\displaystyle R} , but with all b {\displaystyle b} -values halved.

Related concepts

The closely related concept in set theory (see: projection (set theory)) differs from that of relational algebra in that, in set theory, one projects onto ordered components, not onto attributes. For instance, projecting ( 3 , 7 ) {\displaystyle (3,7)} onto the second component yields 7.

Projection is relational algebra's counterpart of existential quantification in predicate logic. The attributes not included correspond to existentially quantified variables in the predicate whose extension the operand relation represents. The example below illustrates this point.

Because of the correspondence with existential quantification, some authorities prefer to define projection in terms of the excluded attributes. In a computer language it is of course possible to provide notations for both, and that was done in ISBL and several languages that have taken their cue from ISBL.

A nearly identical concept occurs in the category of monoids, called a string projection, which consists of removing all of the letters in the string that do not belong to a given alphabet.

When implemented in SQL standard the "default projection" returns a multiset instead of a set, and the π projection is obtained by the addition of the DISTINCT keyword to eliminate duplicate data.

Example

For an example, consider the relations depicted in the following two tables which are the relation Person and its projection on (some say "over") the attributes Age and Weight:

Person {\displaystyle {\text{Person}}} Π Age,Weight ( Person ) {\displaystyle \Pi _{\text{Age,Weight}}({\text{Person}})}
Name Age Weight
Harry 34 180
Sally 28 164
George 28 170
Helena 54 154
Peter 34 180
Age Weight
34 180
28 164
28 170
54 154

Suppose the predicate of Person is "Name is age years old and weighs weight." Then the given projection represents the predicate, "There exists Name such that Name is age years old and weighs weight."

Note that Harry and Peter have the same age and weight, but since the result is a relation, and therefore a set, this combination only appears once in the result.

Formal definition

More formally the semantics of projection are defined as follows:

Π a 1 , . . . , a n ( R ) = {   t [ a 1 , . . . , a n ] :   t R   } , {\displaystyle \Pi _{a_{1},...,a_{n}}(R)=\{\ t:\ t\in R\ \},}

where t [ a 1 , . . . , a n ] {\displaystyle t} is the restriction of the tuple t {\displaystyle t} to the set { a 1 , . . . , a n } {\displaystyle \{a_{1},...,a_{n}\}} so that

t [ a 1 , . . . , a n ] = {   ( a , v )   |   ( a , v ) t ,   a { a 1 , . . . , a n } } , {\displaystyle t=\{\ (a',v)\ |\ (a',v)\in t,\ a'\in \{a_{1},...,a_{n}\}\},}

where ( a , v ) {\displaystyle (a',v)} is an attribute value, a {\displaystyle a'} is an attribute name, and v {\displaystyle v} is an element of that attribute's domain — see Relation (database).

The result of a projection Π a 1 , . . . , a n ( R ) {\displaystyle \Pi _{a_{1},...,a_{n}}(R)} is defined only if { a 1 , . . . , a n } {\displaystyle \{a_{1},...,a_{n}\}} is a subset of the header of R {\displaystyle R} .

Projection over no attributes at all is possible, yielding a relation of degree zero. In this case the cardinality of the result is zero if the operand is empty, otherwise one. The two relations of degree zero are the only ones that cannot be depicted as tables.

See also

References

  1. "Relational Algebra". cs.rochester.edu. Retrieved 2014-07-28.
  2. http://www.csee.umbc.edu/~pmundur/courses/CMSC661-02/rel-alg.pdf See Problem 3.8.B on page 3
Category: