This article needs additional citations for verification. (November 2012) |
Part of a series on |
Formal languages |
---|
In computer science, a production or production rule is a rewrite rule that replaces some symbols with other symbols. A finite set of productions is the main component in the specification of a formal grammar (specifically a generative grammar).
In such grammars, a set of productions is a special case of relation on the set of strings (where is the Kleene star operator) over a finite set of symbols called a vocabulary that defines which non-empty strings can be substituted with others. The set of productions is thus a special kind subset
and productions are then written in the form to mean that (not to be confused with being used as function notation, since there may be multiple rules for the same ). Given two subsets , productions can be restricted to satisfy , in which case productions are said "to be of the form . Different choices and constructions of lead to different types of grammars. In general, any production of the form
where is the empty string (sometimes also denoted ), is called an erasing rule, while productions that would produce strings out of nowhere, namely of the form
are never allowed.
In order to allow the production rules to create meaningful sentences, the vocabulary is partitioned into (disjoint) sets and providing two different roles:
In the most general case of an unrestricted grammar, a production , is allowed to map arbitrary strings and in (terminals and nonterminals), as long as is not empty. So unrestricted grammars have productions of the form
or if we want to disallow changing finished sentences
where indicates concatenation and forces a non-terminal symbol to always be present in of the left-hand side of the productions, denotes set union, and denotes set minus or set difference. If we do not allow the start symbol to occur in (the word on the right side), we have to replace by in the right-hand side.[1]
The other types of formal grammar in the Chomsky hierarchy impose additional restrictions on what constitutes a production. Notably in a context-free grammar, the left-hand side of a production must be a single nonterminal symbol. So productions are of the form: