We saw in the previous entry, that a collection of formation-rules (the formal-grammar) indicate which formulas are well-formed and form a formal-language. Now we will go a step further by marking some of the wffs of a given formal-language $L$ as theorems. This is done again by a collection of rules, known as inference-rules. The collection of inference-rules that we decide to use is known as an inference-system. Unlike a formal-grammar, an inference-system requires a starting point. This is, it marks wffs as theorems using previous theorems (action that is known as “to infer a theorem”). To avoid this recursion, a collection of wffs (known as axiomatic-system) is chosen to be arbitrarily marked as theorems in order to start the process. Each of the wffs of the axiomatic-system is called an axiom. All wffs that can be marked as theorems depend on the formal-language ( $L$ ), the inference-system ( $I$ ), and the axiomatic-system ( $S$ ). The formal-language ( $L$ ) defines all the wffs that can be marked as theorems, the inference-system ( $I$ ) specifies the rules for marking them, and the axiomatic-system ( $S$ ) provides the initial theorems to begin the process. This triplet is then very important as a whole and that is why it receives a name, i.e. a formal-system ( $F = (L, I, S)$ ). Once we choose a formal-system $(F)$ , the collection of wffs of $L$ that will be theorems is automatically defined. This collection is known as the theory.

In the previous entry, we saw that the formal-language $L$ is formed by all the formulas that can be created as combinations of symbols of the formal-alphabet $A$ , and are marked as well-formed-formulas (wffs) by the formation-rules of the formal-grammar. In the image above, the formal-language $L$ (defined by $(A, G)$ ) is formed by all the formulas (in purple) that are inside the green area, i.e. all the formulas that are marked as well-formed using the formal-grammar $G$ (all the wffs) are the 10 formulas $aaa, aab, aa c, abb, ab c, a cc, bbb, bb c, b cc, ccc$ (check the example in m1-formal-language). Then we if we choose an axiomatic-system $S$ (i.e. we chose some of these wffs (e.g. $abb$ and $ccc$ ) to be marked as theorems), and an inference-system $I$ , we have defined the formal-system $F = (L, I, S)$ . All the wffs that can also be marked as theorems using this formal-system $F$ (i.e. all the wffs that can also be marked as theorems by the inference-rules of $I$ starting from the axioms $abb$ and $ccc$ of $S$ ) form the theory $T$ . In the image above, we suppose that all the wffs that can be marked as theorems starting from $S = {abb, ccc}$ and using $I$ are $b cc$ and $aa c$ , so the theory $T$ defined by the formal-system $F = (L, I, S)$ is formed by the four theorems ${abb, ccc, b cc, aa c}$ (the blue area of the image).

Notation note: It is common when talking about formal-systems to refer to the action of marking a wff as a theorem $t$ following $I$ and $A$ , as to prove the theorem $t$ . So we often hear of a theory as “the collection of all the wffs that can be proven from the formal-system $F$ “.

zeroth-order-formal-system

Any formal-system $F = (L, I, S)$ that uses as its formal-language $L$ the propositional-formal-language is called a zeroth-order-formal-system. As we saw, in a formal-language a wff is a proposition. In a zeroth-order-formal-system a theorem will be a True proposition, and to prove a theorem will be to infer that it is True.

inference-rules

A zeroth-order-formal-system deals with propositions, and their truth-values, then the things to be inferred will be truth-values of propositions. For example, suppose we are told that the proposition $p A N D q$ is a theorem (i.e. a True proposition). If we look at the definition, the only combination of truth-values of $p$ and $q$ for which it is True is both $p = T$ and $q = T$ , so, if we are told that $p A N D q$ is theorem, then we know that both $p$ and $q$ are theorems too (True propositions). This is an inference. We are told that $p A N D q$ is a theorem (i.e. that is True), and nothing about the truth-values of the propositions $p$ and $q$ , but this information is implicit in the logical-connective definition itself. This is, this information is not directly given to us, we are just told that $p A N D q$ is a theorem, but we can “infer” that they are theorems too since the only combination of truth-values of $p$ and $q$ for which $p A N D q$ is True is when both $p$ and $q$ are True. If we are told that a given collection of propositions $P$ (known as premises) are theorems, and if for all cases for which these propositions are True, some proposition $p$ (called the conclusion) is also True, we say that we can infer from $P$ (being theorems) that $p$ is also True. This is nothing more than what we have called an inference-rule and is symbolically represented as $P ⊨ p$ . For the case of $p A N D q$ this rule is called simplification and is represented as $p A N D q ⊨ p$ . Note that we can also use simplification to infer $q$ , i.e., $p A N D q ⊨ q$ . Conversely, (and as an example of an inference-rule of more than one premise) we can be told that the two propositions $p$ and $q$ are premises (therefore theorems, therefore both True) and from that infer that $p A N D q$ is True (note that again that for the only row where both $p = T$ and $q = T$ , we have $p A N D q = T$ ). This inference-rule represented as $p, q ⊨ p A N D q$ is called conjunction.

Note, however, that not from every theorem (or collection of them) we can infer something. For example, $pORq ⊨ p$ is not an inference-rule, since if we are told that $pORq$ is a theorem, we have three possible combinations of values of $p$ and $q$ that makes it True, and in two $p$ is True, but in the other $p$ is False. If from a rule $P ⊨ p$ we can actually infer the conclusion $p$ , we say that the rule is valid, if not (as in the case of $pORq ⊨ p$ ) we called it a fallacy. A valid rule then is one in which is impossible for all the premises to be True while the conclusion is False. A sound rule is a rule that is valid and all its premises are True. However, if we add to $pORq$ the premise $\neg q$ , the only combination where both are True is $q = F$ and $p = T$ so we then can actually infer $p$ as a theorem. Then $pORq ⊨ p$ is a fallacy, but $pORq, \neg p ⊨ q$ is an inference-rule, called disjunctive-syllogism. Note that we can also use the disjunctive-syllogism to infer $p$ by adding $\neg q$ as a premise (i.e. $pORq, \neg q ⊨ p$ ).

Other very commonly used inference-rule in a zeroth-order-formal-system is $p \to q, p ⊨ q$ , known as modus-ponens. We can again, looking at the definitions, that for all cases for which the premises are True (just one in this case), $q$ is True. Suppose we are given $p \to q$ and $\neg q$ as premises. Using the contrapositive-law $p \to q ⟺ (\neg q) \to (\neg p)$ (that we introduce in the previous-entry) we have that $\neg q \to \neg p$ is a theorem. And from it and $\neg q$ we can infer $\neg p$ using modus-ponens ( $\neg q \to \neg p, \neg q ⊨ \neg p$ ). This inference-rule, represented as $p \to q, \neg q ⊨ \neg p$ , is called modus-tollens and is a variance of the modus-ponens using the contrapositive-law.

Note: Since if we are given two premises $p$ , $q$ we can infer (using conjunction) that $p A N D q$ is a theorem, and conversely, if we are given $p A N D q$ as a single premise we can infer (via simplification) $p, q$ as two different premises. For this reason, in the collection of premises, $p, q$ or $p A N D q$ are used interchangeably to state that both $p$ and $q$ are premises. For example, in some texts you can find the disjunctive-syllogism rule written as $(pORq) A N D (\neg p) ⊨ q$ , the modus-ponens rule written as $(p \to q) A N D p ⊨ q$ , the modus-tollens rule as $(p \to q) A N D \neg q ⊨ \neg p$ , etc.

As a final example, and as an example of an inference-rule with three different atomic-propositions we’ll present the case of the inference-rule $p \to q, q \to r ⊨ p \to r$ known as hypothetical-syllogism. Again, if you look in the next table, you’ll see that for all cases for which both $p \to q$ and $q \to r$ are True, the proposition $p \to r$ is also True. Therefore, from the propositions $p \to q$ and $q \to r$ we can actually infer $p \to r$ , making the hypothetical-syllogism a valid inference-rule.

fallacies

As we saw, a rule from which we cannot actually infer the conclusion is called a fallacy. F.e., we saw that, unlike the disjunctive-syllogism $pORq, \neg p ⊨ q$ that is a valid inference-rule, the rule $pORq ⊨ q$ is a fallacy since there are cases of which $pORq$ is True that $q$ is False. Many fallacies receive a name, f.e. $pORq ⊨ q$ is called the affirming-a-disjunct-fallacy. Unlike the modus-ponens ( $p \to q, p ⊨ q$ ), the rule $p \to q, q ⊨ p$ is a fallacy (called the affirming-the-consequent-fallacy), since (as you can check in the following table) when $p \to q$ and $q$ are both True, $p$ could be False.

Other example of a common fallacy is $p \to q, \neg p ⊨ \neg q$ , known as the denying-the-antecedent-fallacy. Note that $p \to q$ is True when $\neg p$ is True (i.e., $p$ is False) for both values of $q$ (and therefore both values of $\neg q$ ).

examples

We saw that a zeroth-order-formal-system a formal-system $F = (L, I, S)$ that uses the propositional-formal-language as its formal-language $L$ . Thus, the different types of zeroth-order formal systems are defined by different combinations of inference-system ( $I$ ) and axiomatic-system ( $S$ ). Now that we’ve explored some common inference-rules of a zeroth-order formal system, let’s examine a few concrete examples to illustrate how these systems are constructed and applied.

axiomatic-formal-systems

The first type of of zeroth-order-formal-system we’ll talk about is known as axiomatic since they all have an inference-system that contains just one inference-rule (the modus-ponens i.e. $p \to q, p ⊨ q$ ), and therefore, their power comes from the intelligent choice of axioms for the axiomatic-system. The first system of this kind was formally described by Gottlob Fredge in his 1879 book called “Begriffsschrift” (for what is called Frege-axiomatic-system) and had the following six axioms:

Axiom 1: $p \to (q \to p)$
Axiom 2: $(p \to (q \to r)) \to ((p \to q) \to (p \to r))$
Axiom 3: $(p \to (q \to r)) \to (q \to (p \to r))$
Axiom 4: $(p \to q) \to (\neg q \to \neg p)$
Axiom 5: $\neg\neg p \to p$
Axiom 6: $p \to \neg\neg p$

Jan Łukasiewicz showed in 1930 that, in Frege-system, “the third axiom can be derived from the preceding two, and that the last three axioms can be replaced by the single one $(\neg p \to \neg q) \to (p \to q)$ , reducing the number of axioms to just three, in the system known as Łukasiewicz-axiomatic-system :

Axiom 1: $p \to (q \to p)$
Axiom 2: $(p \to (q \to r)) \to ((p \to q) \to (p \to r))$
Axiom 3: $(\neg p \to \neg q) \to (q \to p)$

Finally, and to end a search of many years, in 1953, Carew Meredith showed that the number of axioms can be incredibly reduced to just one, obviously known as the Meredith-axiom.

Axiom: $((((p \to q) \to (\neg r \to \neg s)) \to r) \to t) \to ((t \to p) \to (s \to p))$ This axiom, allows to have a zeroth-order-formal-system (called the Meredith-axiomatic-system) that not only has an inference-system of a single inference-rule (the modus-ponens), but also an axiomatic-system of a single axiom (the Meredith-axiom).

Note: todo axiomatic-formal-systems using only NAND (list in this article) and Wolfram’s axiom

inference-formal-system

The other type of zeroth-order-formal-systems are known as inference-formal-systems (or formal-systems based on inference) since they take a different approach by eliminating axioms entirely and instead, relying purely on a rich inference-system. The best known consists of 11 inference-rules designed to mimic the natural reasoning processes of mathematicians. They are modus-ponens, disjunctive-syllogism, conjunction, and 8 more listed and explained here. By starting without axioms and building proofs step-by-step using these rules, this system provides a flexible framework for constructing proofs that feels more intuitive. This approach contrasts with the axiomatic-formal-systems, emphasizing process over initial assumptions.

first-order-formal-system

Any formal-system $F = (L, I, S)$ that uses as its formal-language $L$ the predicative-formal-language is called a first-order-formal-system.

En general un first-order-formal-system extends a zeroth-order-formal-system by adding machinery to handle predicates, quantifiers, and variables. El ejemplo más conocido y utilizado, llamado Hilbert-firsrt-order-formal-system extends an axiomatic-formal-system que vimos pueden ser reducidos a tan solo un axioma (the Meredith-axiom) by extending its propositional-formal-language to a predicative-formal-language with $\forall$ as the unique quantifier (remember that we can derive the rest from any other quantifier), and extending the axiomatic-system with the following three axioms:

Q5. $\forall x (ϕ) \to ϕ [x := t]$ where $t$ may be substituted for $x$ in $ϕ$ Q6. $\forall x (ϕ \to ψ) \to (\forall x (ϕ) \to \forall x (ψ))$ Q7. $ϕ \to \forall x (ϕ)$ where $x$ is not free in $ϕ$ .

The wikipedia-article says also that we can, equivalently have the Hilbert-firsrt-order-formal-system by adding just the axioms Q5 to the axiomatic-system, but adding to the inference-system the predicative inference-rule known as universal-instantation que hace redundantes a Q6 y Q7.

todo investigar y explicar bien este último párrafo.

properties-of-formal-systems

As we saw, we say that we prove (inside a formal-system $F = (L, I, S)$ ) a wff $ϕ$ of the formal-language $L$ if we can mark it as a theorem applying inference-rules (of the inference-system $I$ ) over axioms (of the axiomatic-system $S$ ) or other theorems previously proved. We say that we disprove a wff $ϕ$ of $L$ when we prove its negation $\neg ϕ$ .

A formal-system is consistent if for every wff $ϕ$ in $L$ we cannot both prove it and disprove it, i.e. if we cannot prove both $ϕ$ and its negation $\neg ϕ$ as theorems.

Note: In a not consistent formal-system there exists a wff $ϕ$ such that both $ϕ$ and $\neg ϕ$ are theorems, then we can prove as a theorem any wff (since if $ϕ \land \neg ϕ$ is a contradiction that let us prove any wff $φ$ using the modus-tollens ( $(ϕ \land \neg ϕ) \to φ, (ϕ \land \neg ϕ) ⊨ φ$ )). So, by contraposition, a formal-system is consistent if it does not contain contradictions, or equivalently, if we cannot prove any wff.

A formal-system is complete if for every wff $ϕ$ in $L$ we can prove it or disprove it.

A formal-system is decidable if for any wff $ϕ$ in $L$ there exists an algorithm that can determine whether it is a theorem.

todo hablar antes de qué joraca es formalmente un algorithm

Quence

Explorer

m2-formal-system