On Complexity of Deterministic and Nondeterministic Decision Trees Fordecision Tables from Closed Classes

(1)

On Complexity of Deterministic and Nondeterministic Decision Trees Fordecision Tables from Closed Classes

Item Type Preprint

Authors Ostonov, Azimkhon;Moshkov, Mikhail

Citation Ostonov, A., & Moshkov, M. (2023). On Complexity of Deterministic and Nondeterministic Decision Trees Fordecision Tables from Closed Classes. https://doi.org/10.2139/ssrn.4510959

Eprint version Pre-print

DOI

10.2139/ssrn.4510959

Publisher Elsevier BV

Rights This is a preprint version of a paper and has not been peer reviewed. Archived with thanks to Elsevier BV.

Download date 2023-12-02 20:04:30

Link to Item

http://hdl.handle.net/10754/693120

(2)

On Complexity of Deterministic and Nondeterministic Decision Trees for Decision Tables from Closed Classes

Azimkhon Ostonov and Mikhail Moshkov

Computer, Electrical and Mathematical Sciences & Engineering Division and Computational Bioscience Research Center

King Abdullah University of Science and Technology (KAUST) Thuwal 23955-6900, Saudi Arabia

{azimkhon.ostonov,mikhail.moshkov}@kaust.edu.sa

Abstract

In this paper, we consider classes of conventional decision tables closed relative to removal of attributes (columns) and changing decisions assigned to rows. For tables from an arbitrary closed class, we study the dependence of the minimum complexity of deterministic and nondeterministic decision trees on the complexity of the set of attributes attached to columns. Note that a nondeterministic decision tree can be interpreted as a set of true decision rules that cover all rows of the table.

Keywords: closed classes of decision tables, deterministic decision trees, nondeterministic decision trees

1 Introduction

In this paper, we consider closed classes of conventional decision tables and study the dependence of the minimum complexity of deterministic and nondeterministic decision trees on the complexity of the set of attributes attached to columns.

A conventional decision table is a rectangular table in which columns are labeled with attributes, rows are pairwise different and each row is labeled with a decision from the set ω = {0,1, . . .}. Rows are interpreted as tuples of values of the attributes. For a given row, it is required to find the decision attached to the row. To this end, we can use the following queries: we can choose an attribute and ask what is the value of this attribute in the considered row. We study two types of algorithms based on these queries: deterministic and nondeterministic decision trees. One can interpret nondeterministic decision trees for a decision table as a way to represent an arbitrary system of true decision rules for this table that cover all rows of the table. We consider a fairly wide class of complexity measures that characterize the time complexity of decision trees. Among them, we distinguish so-called bounded complexity measures, for example, the depth of decision trees.

(3)

Decision trees [1, 2, 6, 15, 16, 25, 27] and decision rule systems [4, 5, 8, 9, 17, 18, 22, 23]

are widely used as classifiers, as a means for knowledge representation, and as algorithms for solving various problems of combinatorial optimization, fault diagnosis, etc. Decision trees and rules are among the most interpretable models in data analysis [11].

The depth of deterministic and nondeterministic decision trees for computation Boolean functions (variables of a function are considered as attributes) was studied quite intensively [3, 10, 14, 28]. Note that the minimum depth of a nondeterministic decision tree for a Boolean function is equal to its certificate complexity [7].

We study classes of conventional decision tables closed under two operations: removal of columns and changing of decisions. The most natural examples of such classes are closed classes of decision tables generated by information systems [21]. An information system consists of a set of objects (universe) and a set of attributes (functions) defined on the universe and with values from a finite set. A problem over an information system is specified by a finite number of attributes that divide the universe into nonempty domains in which these attributes have fixed values. A decision from the setω is attached to each domain. For a given object from the universe, it is required to find the decision attached to the domain containing this object.

A decision table corresponds to this problem in a natural way: columns of this table are labeled with the considered attributes, rows correspond to domains and are labeled with decisions attached to domains. The set of decision tables corresponding to problems over an information system forms a closed class generated by this system. Note that the family of all closed classes is essentially wider than the family of closed classes generated by information systems. In particular, the union of two closed classes generated by two information systems is a closed class. However, generally, there is no an information system that generates this class.

Various classes of objects that are closed under different operations are intensively studied. Among them, in particular, are classes of Boolean functions closed under the operation of superposition [24] and minor-closed classes of graphs [26]. Decision tables represent an interesting mathematical object deserving mathematical research, in particular, the study of closed classes of decision tables.

This paper continues the study of closed classes of decision tables that began with work [13] and continued with works [19, 20]. In [13], we studied the dependence of the minimum depth of deterministic decision trees and the depth of deterministic decision trees constructed by a greedy algorithm on the number of attributes (columns) for conventional decision tables from classes closed under operations of removal of columns and changing of decisions.

In [19], we considered classes of decision tables with many-valued decisions closed under operations of removal of columns, changing of decisions, permutation of columns, and du- plication of columns. We studied relationships among three parameters of these tables: the complexity of a decision table (if we consider the depth of decision trees, then the complexity of a decision table is the number of columns in it), the minimum complexity of a deterministic decision tree, and the minimum complexity of a nondeterministic decision tree. We consid-

(4)

with the decision 0 or the decision 1) closed relative to removal of attributes (columns) and changing decisions assigned to rows. For tables from an arbitrary closed class, we studied the dependence of the minimum complexity of deterministic decision trees on various parameters of the tables: the minimum complexity of a test, the complexity of the set of attributes attached to columns, and the minimum complexity of a strongly nondeterministic decision tree. We also studied the dependence of the minimum complexity of strongly nondeterministic decision trees on the complexity of the set of attributes attached to columns. Note that a strongly nondeterministic decision tree can be interpreted as a set of true decision rules that cover all rows labeled with the decision 1.

Let A be a class of conventional decision tables closed under removal of columns and changing of decisions, andψbe a bounded complexity measure. In this paper, we study two functions: F_ψ,A(n), andG_ψ,A(n).

The function F_ψ,A(n) characterizes the growth in the worst case of the minimum complexity of a deterministic decision tree for a decision table from A with the growth of the complexity of the set of attributes attached to columns of the table. We prove that the functionF_ψ,A(n) is either bounded from above by a constant, or grows as a logarithm ofn, or grows almost linearly depending on n (it is bounded from above by n and is equal to n for infinitely manyn). These results are generalizations of results obtained in [20] for closed classes of decision tables with 0-1-decisions.

The function G_ψ,A(n) characterizes the growth in the worst case of the minimum complexity of a nondeterministic decision tree for a decision table fromA with the growth of the complexity of the set of attributes attached to columns of the table. We prove that the func- tionG_ψ,A(n) is either bounded from above by a constant or grows almost linearly depending onn (it is bounded from above by nand is equal ton for infinitely manyn).

The paper consists of six sections. In Sect. 2, main definitions and notation are considered.

In Sect. 3, we provide the main results. Section 4 contains auxiliary statements. In Sect. 5, we prove the main results. Section 6 contains short conclusions.

2 Main Definitions and Notation

Denote ω = {0,1,2, . . .} and, for any k ∈ ω\ {0,1}, denote E_k = {0,1, . . . , k−1}. Let P = {f_i : i ∈ ω} be the set of attributes (really names of attributes). Two attributes f_i, f_j ∈P are considered different ifi̸=j.

2.1 Decision Tables

First, we define the notion of a decision table.

Definition 1. Let k ∈ ω\ {0,1}. Denote by M_k the set of rectangular tables filled with numbers from E_k in each of which rows are pairwise different, each row is labeled with a number from ω (decision), and columns are labeled with pairwise different attributes from

(5)

f₂ f₄ f₃

1 1 1 1

0 1 1 3

1 1 0 1

1 0 1 2

0 0 1 2

1 0 0 2

0 0 0 2

Figure 1: Decision table from M₂

Two tables from M_k are equal if one can be obtained from another by permutation of rows with attached to them decisions.

Example 1. Figure 1 shows a decision table fromM₂.

Denote by M_kC the set of tables fromM_k in each of which all rows are labeled with the same decision. Let Λ∈ M_kC.

Let T be a nonempty table from M_k. Denote by P(T) the set of attributes attached to columns of the table T. Let f_i₁, . . . , f_i_m ∈ P(T) and δ₁, . . . , δ_m ∈ E_k. We denote by T(fi1, δ1)· · ·(fim, δm) the table obtained from T by removal of all rows that do not satisfy the following condition: in columns labeled with attributesfi1, . . . , fim, the row has numbers δ₁, . . . , δ_m, respectively.

We now define two operations on decision tables: removal of columns and changing of decisions. LetT ∈ M_k.

Definition 2. Removal of columns. Let D⊆P(T). We remove from T all columns labeled with the attributes from the setD. In each group of rows equal on the remaining columns, we keep one with the minimum decision. Denote the obtained table by I(D, T). In particular, I(∅, T) =T and I(P(T), T) = Λ. It is obvious that I(D, T)∈ M_k.

Definition 3. Changing of decisions. Letν :E^|P_k ^(T^)|→ω (by definition, E_k⁰ =∅). For each row ¯δ of the tableT, we replace the decision attached to this row withν(¯δ). We denote the obtained table by J(ν, T). It is obvious thatJ(ν, T)∈ M_k.

Definition 4. Denote [T] = {J(ν, I(D, T)) : D ⊆ P(T), ν : E_k^|P^(T^)\D| → ω}. The set [T] is the closure of the table T under the operations of removal of columns and changing of decisions.

Example 2. Figure 2 shows the table J(ν, I(D, T₀)), where T₀ is the table shown in Fig. 1, D={f₄} andν(x₁, x₂) =x₁+x₂.

Definition 5. LetA⊆ M_kandA̸=∅. Denote [A] =S

T∈A[T]. The set [A] is theclosure of the set A under the considered two operations. The class (the set) of decision tables A will

(6)

f₂ f₃

1 1 2

0 1 1

1 0 1

0 0 0

Figure 2: Decision table obtained from the decision table shown in Fig. 1 by removal of a column and changing of decisions

Figure 3: A deterministic decision tree for the decision table shown in Fig. 1 2.2 Deterministic and Nondeterministic Decision Trees

A finite tree with root is a finite directed tree in which exactly one node called theroot has no entering edges. The nodes without leaving edges are calledterminal nodes.

Definition 6. A k-decision tree is a finite tree with root, which has at least two nodes and in which

• The root and edges leaving the root are not labeled.

• Each terminal node is labeled with a decision from the setω.

• Each node, which is neither the root nor a terminal node, is labeled with an attribute from the setP. Each edge leaving such node is labeled with a number from the set Ek. Example 3. Figures 3 and 4 show 2-decision trees.

We denote byT_k the set of allk-decision trees. Let Γ∈ T_k. We denote byP(Γ) the set of attributes attached to nodes of Γ that are neither the root nor terminal nodes. Acomplete path of Γ is a sequence τ =v₁, d₁, . . . , v_m, d_m, v_m+1 of nodes and edges of Γ in which v₁ is the root of Γ,vm+1 is a terminal node of Γ and, forj= 1, . . . , m, the edgedj leaves the node v_j and enters the nodev_j+1. LetT ∈ M_k. IfP(Γ)⊆P(T), then we correspond to the table T and the complete path τ a decision tableT(τ). If m = 1, thenT(τ) = T. If m > 1 and,

(7)

Figure 4: A nondeterministic decision tree for the decision table shown in Fig. 1 Definition 7. LetT ∈ M_k\{Λ}. Adeterministic decision tree for the tableT is ak-decision tree Γ satisfying the following conditions:

• Only one edge leaves the root of Γ.

• For any node, which is neither the root nor a terminal node, edges leaving this node are labeled with pairwise different numbers.

• P(Γ)⊆P(T).

• For any row of T, there exists a complete path τ of Γ such that the considered row belongs to the tableT(τ).

• For any complete pathτ of Γ, eitherT(τ) = Λ or all rows ofT(τ) are labeled with the decision attached to the terminal node ofτ.

Example 4. The 2-decision tree shown in Fig. 3 is a deterministic decision tree for the decision table shown in Fig. 1.

Definition 8. Let T ∈ M_k \ {Λ}. A nondeterministic decision tree for the table T is a k-decision tree Γ satisfying the following conditions:

• P(Γ)⊆P(T).

• For any row of T, there exists a complete path τ of Γ such that the considered row belongs to the tableT(τ).

• For any complete pathτ of Γ, eitherT(τ) = Λ or all rows ofT(τ) are labeled with the decision attached to the terminal node ofτ.

Example 5. The 2-decision tree shown in Fig. 4 is a nondeterministic decision tree for the

(8)

2.3 Complexity Measures

Denote by B the set of all finite words over the alphabet P = {f_i : i∈ ω}, which contains the empty wordλand on which the word concatenation operation is defined.

Definition 9. A complexity measure is an arbitrary function ψ : B → ω that has the following properties: for any wordsα₁, α₂ ∈B,

• ψ(α₁) = 0 if and only ifα₁ =λ–positivity property.

• ψ(α₁) =ψ(α^′₁) for any wordα^′₁ obtained fromα₁ by permutation of letters – commutativity property.

• ψ(α₁)≤ψ(α₁α₂) –nondecreasing property.

• ψ(α₁α₂)≤ψ(α₁) +ψ(α₂) – boundedness from above property.

The following functions are complexity measures:

• Function h for which, for any word α ∈ B, h(α) = |α|, where |α| is the length of the wordα.

• An arbitrary function φ: B → ω such that φ(λ) = 0, for any fi ∈B, φ(fi) > 0 and, for any nonempty wordf_i₁· · ·f_i_m ∈B,

φ(f_i₁· · ·f_i_m) =

m

X

j=1

φ(f_i_j). (1)

• An arbitrary functionρ:B →ωsuch that ρ(λ) = 0, for anyfi ∈B,ρ(fi)>0, and, for any nonempty wordf_i₁· · ·f_i_m∈B,ρ(f_i₁· · ·f_i_m) = max{ρ(f_i_j) :j= 1, . . . , m}.

Definition 10. A bounded complexity measure is a complexity measure ψ, which has the boundedness from below property: for any wordα ∈B,ψ(α)≥ |α|.

Any complexity measure satisfying the equality (1), in particular the function h, is a bounded complexity measure. One can show that if functions ψ1 and ψ2 are complexity measures, then the functions ψ₃ and ψ₄ are complexity measures, where for any α ∈ B, ψ₃(α) = ψ₁(α) +ψ₂(α) and ψ₄(α) = max(ψ₁(α), ψ₂(α)). If the function ψ₁ is a bounded complexity measure, then the functionsψ3 and ψ4 are bounded complexity measures.

Definition 11. Letψ be a complexity measure. We extend it to the set of all finite subsets of the set P. Let D be a finite subset of the set P. If D = ∅, then ψ(D) = 0. Let D={f_i₁, . . . , f_i_m} and m≥1. Thenψ(D) =ψ(f_i₁· · ·f_i_m).

(9)

2.4 Parameters of Decision Trees and Tables

Let Γ∈ T_k and τ =v1, d1, . . . , vm, dm, vm+1 be a complete path of Γ. We correspond to the pathτ a word F(τ)∈B: ifm = 1, then F(τ) =λ, and if m >1 and, for j = 2, . . . , m, the nodev_j is labeled with the attributef_i_j, thenF(τ) =f_i₂· · ·f_i_m.

Definition 12. Letψbe a complexity measure. We extend the functionψto the setT_k. Let Γ∈ T_k. Then ψ(Γ) = max{ψ(F(τ))}, where the maximum is taken over all complete paths τ of the decision tree Γ. For a given complexity measureψ, the valueψ(Γ) will be called the complexity of the decision tree Γ. The valueh(Γ) will be called thedepth of the decision tree Γ.

Let ψ be a complexity measure. We now describe the functionsψ^d, ψ^a, ∆, Wψ,Sψ, ˆSψ, M_ψ, andN defined on the setM_kand taking values from the setω. By definition, the value of each of these functions for Λ is equal 0. LetT ∈ M_k\ {Λ}.

• ψ^d(T) = min{ψ(Γ)}, where the minimum is taken over all deterministic decision trees Γ for the tableT.

• ψ^a(T) = min{ψ(Γ)}, where the minimum is taken over all nondeterministic decision trees Γ for the tableT.

• A setD⊆P(T) is called aseparating set for the tableT if in the set of columns labeled with attributes from D rows of the table T are pairwise different. Then ∆(T) is the minimum cardinality of a separating sets for the tableT.

• W_ψ(T) =ψ(P(T)).

• Let ¯δ be a row of the table T. Denote S_ψ(T,δ) = min{ψ(D)}, where the minimum is¯ taken over all subsets D of the set P(T) such that in the set of columns of T labeled with attributes from Dthe row ¯δ is different from all other rows of the table T. Then S_ψ(T) = max{S_ψ(T,δ)}, where the maximum is taken over all rows ¯¯ δ of the tableT.

• Sˆψ(T) = max{S_ψ(T^∗) :T^∗∈[T]}.

• IfT ∈ M_kC, thenMψ(T) = 0. LetT /∈ M_kC,|P(T)|=n, and columns of the tableT be labeled with the attributesft1, . . . , ftn. Let ¯δ = (δ1, . . . , δn)∈E_kⁿ. Denote byM_ψ(T,¯δ) the minimum number p ∈ ω for which there exist attributes f_t_i

1, . . . , f_t_im ∈ P(T) such that T(fti1, δi1)· · ·(ft_im, δim) ∈ M_kC and ψ(fti1· · ·ft_im) = p. Then Mψ(T) = max{M_ψ(T,δ) : ¯¯ δ ∈E_kⁿ}.

• N(T) is the number of rows in the table T.

For the complexity measureh, we denoteW(T) =W_h(T),S(T) =S_h(T), ˆS(T) = ˆS_h(T), and M(T) =M_h(T). Note thatW(T) is the number of columns in the tableT.

(10)

3 Main Results

In this section, we consider results obtained for the functions F_ψ,A and G_ψ,A and discuss closed classes of decision tables generated by information systems.

3.1 Function F_ψ,A

Letψbe a bounded complexity measure andAbe a nonempty closed class of decision tables fromM_k. We now define a functionF_ψ,A:ω→ω. Letn∈ω. Then

F_ψ,A(n) = max{ψ^d(T) :T ∈A, Wψ(T)≤n}.

The functionF_ψ,A characterizes the growth in the worst case of the minimum complexity of a deterministic decision tree for a decision table from A with the growth of the complexity of the set of attributes attached to columns of this table.

LetD={n_i :i∈ω} be an infinite subset of the setω in which, for any i∈ω,n_i < n_i+1. Let us define a function HD :ω → ω. Let n∈ ω. If n < n0, then HD(n) = 0. If, for some i∈ω,ni ≤n < ni+1, thenHD(n) =ni.

Theorem 1. Let ψ be a bounded complexity measure and A be a nonempty closed class of decision tables from M_k. Then F_ψ,A is an everywhere defined nondecreasing function such that F_ψ,A(n) ≤ n for any n ∈ ω and F_ψ,A(0) = 0. For this function, one of the following statements holds:

(a) If the functionsS_ψ andN are bounded from above on the classA, then there exists a positive constant c₀ such that F_ψ,A(n)≤c₀ for any n∈ω.

(b) If the function Sψ is bounded from above on the class A and the function N is not bounded from above on the classA, then there exist positive constantsc1, c2, c3, c4 such that c₁log₂n−c₂≤ F_ψ,A(n)≤c₃log₂n+c₄ for anyn∈ω\ {0}.

(c) If the function Sψ is not bounded from above on the class A, then there exists an infinite subset D of the set ω such that HD(n)≤ F_ψ,A(n) for anyn∈ω.

3.2 Function Gψ,A

Letψbe a bounded complexity measure andAbe a nonempty closed class of decision tables fromM_k. We now define a functionG_ψ,A. Letn∈ω. Then

G_ψ,A(n) = max{ψ^a(T) :T ∈A, Wψ(T)≤n}.

The functionG_ψ,Acharacterizes the growth in the worst case of the minimum complexity of a nondeterministic decision tree for a decision table fromA with the growth of the complexity of the set of attributes attached to columns of this table.

Theorem 2. Let ψ be a bounded complexity measure and A be a nonempty closed class of

(11)

(a) If the function S_ψ is bounded from above on the class A, then there exists a positive constant c such that G_ψ,A(n)≤c for any n∈ω.

(b) If the function Sψ is not bounded from above on the class A, then there exists an infinite subset D of the set ω such that HD(n)≤ G_ψ,A(n) for anyn∈ω.

3.3 Family of Closed Classes of Decision Tables

Let U be a set and Φ = {f₀, f₁, . . .} be a finite or countable set of functions (attributes) defined onU and taking values fromEk. The pair (U,Φ) is called ak-information system. A problem over (U,Φ) is an arbitrary tuplez= (U, ν, fi1, . . . , fin), wheren∈ω\{0},ν :E_kⁿ→ω and f_i₁, . . . , f_i_n are functions from Φ with pairwise different indices i₁, . . . , i_n. The problem z is to determine the value ν(fi1(u), . . . , fin(u)) for a given u ∈ U. Various examples of k-information systems and problems over these systems can be found in [15].

We denote by T(z) a decision table from M_k with n columns labeled with attributes fi1, . . . , fin. A row (δ1, . . . , δn) ∈ E_kⁿ belongs to the table T(z) if and only if the system of equations {f_i₁(x) =δ1, . . . , fin(x) =δn} has a solution from the set U. This row is labeled with the decisionν(δ₁, . . . , δ_n).

Let the algorithms for the problem z solving be algorithms in which each elementary operation consists in calculating the value of some attribute from the set{f_i₁, . . . , fin} on a given elementu∈U. Then, as a model of the problemz, we can use the decision tableT(z), and as models of algorithms for the problem z solving – deterministic and nondeterministic decision trees for the tableT(z).

Denote by Z(U,Φ) the set of problems over (U,Φ) and A(U,Φ) ={T(z) :z∈ Z(U,Φ)}.

One can show thatA(U,Φ) = [A(U,Φ)], i.e.,A(U,Φ) is a closed class of decision tables from M_k generated by the information system (U,Φ).

Closed classes of decision tables generated byk-information systems are the most natural examples of closed classes. However, the notion of a closed class is essentially wider. In particular, the union A(U₁,Φ1)∪ A(U₂,Φ2), where (U1,Φ1) and (U2,Φ2) are k-information systems, is a closed class, but generally, we cannot find an information system (U,Φ) such thatA(U,Φ) =A(U₁,Φ₁)∪ A(U₂,Φ₂).

3.4 Example of Information System

Let Rbe the set of real numbers and F ={f_i :i∈ ω} be the set of functions defined on R and taking values from the setE₂ such that, for any i∈ω anda∈R,

f_i(a) =

0, a < i, 1, a≥i.

Let ψbe a bounded complexity measure and A=A(R, F). One can prove the following statements:

• The functionN is not bounded from above on the set A.

(12)

4 Auxiliary Statements

This section contains auxiliary statements.

It is not difficult to prove the following upper bound on the minimum complexity of deterministic decision trees for a table.

Lemma 1. For any complexity measureψ and any table T from M_k, ψ^d(T)≤Wψ(T).

The notions of a decision table and a deterministic decision tree used in this paper are somewhat different from the corresponding notions used in [12]. Taking into account these differences, it is easy to prove the following statement, which follow almost directly from Lemma 1.3 and Theorem 2.2 from [12].

Lemma 2. For any complexity measureψ and any table T from M_k, ψ^d(T)≤

0, Mψ(T) = 0,

M_ψ(T) log₂N(T), M_ψ(T)≥1.

The following two statements are simple generalizations of similar results obtained in [20]

for decision tables with 0-1-decisions. For the sake of completeness, we present their proofs.

Lemma 3. For any complexity measureψ and any table T from M_k, M_ψ(T)≤2 ˆS_ψ(T).

Proof. LetT ∈ M_kC. Then M_ψ(T) = 0. Therefore M_ψ(T)≤2 ˆS_ψ(T).

Let T /∈ M_kC,W(T) =nand ft1, . . . , ftn be attributes attached to columns of the table T. Denote D = {f_i : fi ∈ P(T), ψ(fi) ≤ S_ψ(T)} and T^∗ = I(P(T)\D, T). Evidently, T^∗∈[T]. Taking into account that the functionψhas the nondecreasing property, we obtain that any two rows ofT are different in columns labeled with attributes from the setD. Let for the definiteness,D={f_t₁, . . . , ftm}.

Let ¯δ = (δ₁, . . . , δ_n) ∈E_kⁿ. Let (δ₁, . . . , δ_m) be a row of T^∗. Since T^∗ ∈ [T], there exist attributes f_t_j

1, . . . , f_t_js of the table T^∗ such that the row (δ₁, . . . , δ_m) is different from all other rows of T^∗ in columns labeled with these attributes andψ(f_t_j

1· · ·f_t_js)≤Sˆ_ψ(T). It is clear thatT(f_t_j

1, δ_j₁)· · ·(f_t_js, δ_j_s)∈ M_kC. ThereforeM_ψ(T,¯δ)≤Sˆ_ψ(T).

Let (δ1, . . . , δm) be not a row ofT^∗. We considerm tablesT^∗(ft1, δ1),T^∗(ft1, δ1)(ft2, δ2), ..., T^∗(f_t₁, δ₁)· · ·(f_t_m, δ_m). If T^∗(f_t₁, δ₁) = Λ, then T(f_t₁, δ₁) = Λ. Sincef_t₁ ∈D, ψ(f_t₁) ≤ S_ψ(T). Taking into account that S_ψ(T) ≤ Sˆ_ψ(T), we obtain M_ψ(T,δ)¯ ≤ Sˆ_ψ(T). Let T^∗(ft1, δ1) ̸= Λ. Then there exists p ∈ {1, . . . , m−1} such that T^∗(ft1, δ1)· · ·(ftp, δp) ̸= Λ and T^∗(f_t₁, δ₁)· · ·(f_t_p+1, δ_p+1) = Λ. Denote C = {f_t_p+1, f_t_p+2, . . . , f_t_n} and T⁰ = I(C, T).

Evidently, T⁰ ∈ [T]. Therefore in T⁰ there are attributes f_t_i

1, . . . , f_t_il such that the row (δ , . . . , δ ) is different from all other rows of the table T⁰ in columns labeled with these

(13)

ThereforeT^∗(fti1, δi1)· · ·(ft_il, δi_l)(ftp+1, δp+1) = Λ. Hence

T(fti1, δi1)· · ·(ft_il, δil)(ftp+1, δp+1) = Λ.

Since f_t_p+1 ∈D,ψ(f_t_p+1)≤S_ψ(T)≤Sˆ_ψ(T). Using boundedness from above property of the functionψ, we obtainψ(f_t_i

1 · · ·f_t_ilf_t_p+1)≤2 ˆS_ψ(T). ThereforeM_ψ(T,δ)¯ ≤2 ˆS_ψ(T).

Thus, for any ¯δ ∈E_kⁿ,Mψ(T,δ)¯ ≤2 ˆSψ(T). As a result, we obtain Mψ(T)≤2 ˆSψ(T).

Lemma 4. For any tableT from M_k\ {Λ},

N(T)≤(kW(T))^S(T⁾.

Proof. IfN(T) = 1, thenS(T) = 0 and the considered inequality holds. LetN(T)>1. Then S(T)>0. Denotem=S(T). Evidently, for any row ¯δ of the tableT, there exist attributes fi1, . . . , fim ∈P(T) and numbers σ1, . . . , σm ∈Ek such that the table T(fi1, σ1)· · ·(fim, σm) contains only the row ¯δ. Therefore there is a one-to-one mapping of rows of the tableT onto some set G of pairs of tuples of the kind ((f_i₁, . . . , f_i_m),(σ₁, . . . , σ_m)) where f_i₁, . . . , f_i_m ∈ P(T) and σ1, . . . , σm ∈ Ek. Evidently, |G| ≤ W(T)^mk^m. Therefore N(T) ≤ (kW(T))^S(T⁾. Lemma 5. For any tableT fromM_k\ {Λ}, there exists a mappingν:E_k^W(T⁾ →ω such that

h^d(J(ν, T))≥log_kN(T).

Proof. Let T ∈ M_k\ {Λ} and ν :E_k^W(T⁾ → ω be a mapping for which ν(¯δ) ̸=ν(¯σ) for any δ,¯ σ¯ ∈E_k^W^(T⁾ such that ¯δ ̸= ¯σ. DenoteT^∗ =J(ν, T). Let Γ be a deterministic decision tree for the tableT^∗ such thath(Γ) =h^d(T^∗). Denote by L_t(Γ) the number of terminal nodes of Γ. Evidently, N(T) ≤ Lt(Γ). One can show that Lt(Γ) ≤ k^h(Γ). Therefore N(T) ≤k^h(Γ). Since T ̸= Λ, N(T) >0. Henceh(Γ)≥log_kN(T). Taking into account thath(Γ) =h^d(T^∗), we obtainh^d(T^∗)≥log_kN(T).

It is not difficult to prove the following upper bound on the minimum cardinality of a separating set for a table by the induction on the number of rows in the table.

Lemma 6. For any tableT from M_k\ {Λ},

∆(T)≤N(T)−1.

Lemma 7. For any complexity measureψ and any table T from M_k, ψ^a(T)≤ψ^d(T).

Proof. Let T ∈ M_k. If T = Λ, then ψ^a(T) = ψ^d(T) = 0. Let T ∈ M_k\ {Λ}. It is clear

(14)

Lemma 8. For any complexity measureψand any tableT fromM_k, which contains at least two rows, there exists a table T^∗∈[T] such that

ψ^a(T^∗) =ψ^d(T^∗) =W_ψ(T^∗) =S_ψ(T^∗) =S_ψ(T).

Proof. Let ¯δ be a row of the table T such that Sψ(T,δ) =¯ Sψ(T). Let D be a subset of the set P(T) with the minimum cardinality such that ψ(D) = S_ψ(T,δ) and in the set of¯ columns labeled with attributes fromDthe row ¯δ is different from all other rows of the table T. Let ¯σ be the tuple obtained from the row ¯δ by the removal of all numbers that are in the intersection with columns labeled with attributes from the set P(T)\D. Letν :E_k^|D|→E2

and, for any ¯γ ∈ E_k^|D|, if ¯γ = ¯σ, then ν(¯γ) = 1 and if ¯γ ̸= ¯σ, then ν(¯γ) = 0. Denote T^∗=J(ν, I(P(T)\D, T)).

From the fact thatDhas the minimum cardinality and from the properties of the function ψit follows that, for any attribute from the set D, there exists a row ofT, which is different from the row ¯δ only in the column labeled with the considered attribute among attributes from D. Therefore, for any attribute of the table T^∗, there exists a row of T^∗, which is different from the row ¯σ only in the column labeled with this attribute. Thus,

S_ψ(T^∗,σ) =¯ W_ψ(T^∗). (2) Using properties of the function ψ, we obtain S_ψ(T^∗)≤ W_ψ(T^∗). From this inequality and from (2) it follows that

S_ψ(T^∗) =W_ψ(T^∗). (3)

Let Γ be a nondeterministic decision tree for the tableT^∗ such thatψ(Γ) =ψâ(T^∗),τ be a complete path of Γ such that the row ¯σ belongs to the tableT(τ), and F(τ) =fi1· · ·fit. It is clear that ¯σ is the only row of the tableT(τ). Therefore in columns labeled with attributes f_i₁, . . . , f_i_t the row ¯σ is different from all other rows of the table T^∗. Thus, ψ(F(τ)) ≥ Sψ(T^∗,σ) =¯ Wψ(T^∗). Therefore ψ(Γ)≥ Wψ(T^∗) and ψâ(T^∗) ≥Wψ(T^∗). Using Lemmas 1 and 7, we obtainψâ(T^∗)≤ψ^d(T^∗)≤W_ψ(T^∗). Therefore

ψ^a(T^∗) =ψ^d(T^∗) =W_ψ(T^∗). (4) By the choice of the set D, Wψ(T^∗) = Sψ(T). From this equality and from (3) and (4) it follows thatψ^a(T^∗) =ψ^d(T^∗) =W_ψ(T^∗) =S_ψ(T^∗) =S_ψ(T).

It is not difficult to prove the following upper bounds on the minimum complexity of nondeterministic decision trees for a table.

Lemma 9. For any complexity measureψ and any table T from M_k, ψ^a(T)≤Sψ(T)≤Wψ(T).

(15)

5 Proofs of Theorems 1 and 2

Proof of Theorem 1. Since A is a closed class, Λ∈A. By definition, W_ψ(Λ) = 0. Using this fact and Lemma 1, we obtain thatF_ψ,A is an everywhere defined function and F_ψ,A(n)≤n for any n ∈ ω. Evidently, F_ψ,A is a nondecreasing function. Let T ∈ A and W_ψ(T) ≤ 0.

Using positivity property of the functionψ, we obtainT = Λ. Therefore F_ψ,A(0) = 0.

(a) Let the functions Sψ and N be bounded from above on the classA. Then there are constants a ≥ 1 and b ≥ 2 such that S_ψ(T) ≤ a and N(T) ≤ b for any table T ∈ A. Let T ∈A. Taking into account that A is a closed class, we obtain ˆS_ψ(T) ≤a. By Lemma 3, M_ψ(T) ≤2a. From this inequality, inequality N(T) ≤ b and from Lemma 2 it follows that ψ^d(T) ≤2alog₂b. Denote c0 = 2alog₂b. Taking into account that T is an arbitrary table from the classA, we obtain that F_ψ,A(n)≤c0 for any n∈ω.

(b) Let the function S_ψ be bounded from above on the class A and the function N be not bounded from above on A. Then there exists a constant a≥2 such that, for any table Q∈A,

S_ψ(Q)≤a. (5)

Let n∈ω\ {0} and T be an arbitrary table from A such thatW_ψ(T) ≤n. IfT ∈ M_kC, then, evidently, ψ^d(T) = 0. Let T /∈ M_kC. Using the boundedness from below property of the functionψ, we obtainW(T)≤nandS(T)≤a. From these inequalities and Lemma 4 it follows that N(T) ≤(kn)^a. From (5) and Lemma 3 it follows that M_ψ(T) ≤2a. Using the last two inequalities and Lemma 2, we obtain

ψ^d(T)≤2a²log₂n+ 2a²log₂k. (6) Denote c₃ = 2a² and c₄ = 2a²log₂k. Then ψ^d(T) ≤ c₃log₂n+c₄. Taking into account that n is an arbitrary number from ω \ {0} and T is an arbitrary table from A such that Wψ(T)≤n, we obtain that, for anyn∈ω\ {0},

F_ψ,A(n)≤c₃log₂n+c₄. (7) Letn∈ω\ {0}. Denote c₁ = 1/log₂k andc₂ = log_ka. We now show that

F_ψ,A(n)≥c1log₂n−c2. (8) Denotem=⌊n/a⌋. Taking into account that the functionN is not bounded from above on the set A, we obtain that there exists a table T ∈ A such that N(T) ≥ k^m. Let C be a separating set for the table T with the minimum cardinality such that ψ(fi) ≤ a for any f_i ∈ C. The existence of such set follows from the inequality (5) and properties of commutativity and nondecreasing of the functionψ. Evidently,|C| ≥m. Let Dbe a subset of the set C such that |D|= m. Denote T⁰ =I(P(T)\D, T). One can show that, for any attribute ofT⁰, there are two rows of the tableT⁰ that differ only in the column labeled with this attribute. ThereforeDis a separating set for the tableT⁰with the minimum cardinality.

By Lemma 6,N(T⁰)≥m+ 1≥n/a.

(16)

Denote T^∗ = J(ν, T⁰). Since ψ is a bounded complexity measure, ψ^d(T^∗) ≥ log_k(n/a).

Using boundedness from above property of the functionψ, we obtainW_ψ(T^∗)≤ ⌊n/a⌋a≤n.

Hence the inequality (8) holds. From (7) and (8) it follows that c1log₂n−c2 ≤ F_ψ,A(n) ≤ c3log₂n+c4 for any n∈ω\ {0}.

(c) Let the function S_ψ be not bounded from above on the class A. Using Lemma 8, we obtain that the set D ={W_ψ(T) : T ∈A, ψ^d(T) =W_ψ(T)} is infinite. Since the classA is closed, Λ∈Aand, sinceψ^d(Λ) =W_ψ(Λ) = 0, 0∈D. Evidently, for anyn∈D,F_ψ,A(n)≥n.

Taking into account thatF_ψ,A is a nondecreasing function, we obtain thatF_ψ,A(n)≥H_D(n) for any n∈ω\ {0}.

Proof of Theorem 2. Since A is a closed class, Λ∈A. By definition, W_ψ(Λ) = 0. Using this fact and Lemma 9, we obtain that G_ψ,A is an everywhere defined function and G_ψ,A(n) ≤n for any n ∈ ω. Evidently, G_ψ,A is a nondecreasing function. Let T ∈ A and W_ψ(T) ≤ 0.

Using positivity property of the functionψ, we obtainT = Λ. Therefore G_ψ,A(0) = 0.

(a) Let the function Sψ be bounded from above on the class A. Then there is a constant c >0 such thatS_ψ(T)≤c for anyT ∈A. By Lemma 9,ψ^a(T)≤S_ψ(T)≤c for anyT ∈A.

Therefore, for anyn∈ω,G_ψ,A(n)≤c.

(b) Let the functionS_ψ be not bounded from above on the classA. Using Lemma 8, we obtain that the set D ={W_ψ(T) : T ∈A, ψ^a(T) =W_ψ(T)} is infinite. Since the classA is closed, Λ∈A and, sinceψ^a(Λ) =W_ψ(Λ) = 0, 0∈D. Evidently, for any n∈D,G_ψ,A(n)≥n.

Taking into account thatG_ψ,A is a nondecreasing function, we obtain thatG_ψ,A(n)≥H_D(n) for any n∈ω\ {0}.

6 Conclusions

In this paper, we studied the complexity of deterministic and nondeterministic decision trees for tables from closed classes of conventional decision tables in which each row is labeled with a decision from ω. Future research will be devoted to the study of deterministic and nondeterministic decision trees for tables from closed classes of decision tables with many- valued decisions in which rows are labeled with finite sets of decisions.

Acknowledgements

Research reported in this publication was supported by King Abdullah University of Science and Technology (KAUST).

References

[1] AbouEisha, H., Amin, T., Chikalov, I., Hussain, S., Moshkov, M.: Extensions of Dy- namic Programming for Combinatorial Optimization and Data Mining, Intelligent Sys- tems Reference Library, vol. 146. Springer (2019)

(17)

[2] Alsolami, F., Azad, M., Chikalov, I., Moshkov, M.: Decision and Inhibitory Trees and Rules for Decision Tables with Many-valued Decisions, Intelligent Systems Reference Library, vol. 156. Springer (2020)

[3] Blum, M., Impagliazzo, R.: Generic oracles and oracle classes (extended abstract). In:

28th Annual Symposium on Foundations of Computer Science, Los Angeles, California, USA, 27-29 October 1987, pp. 118–126. IEEE Computer Society (1987)

[4] Boros, E., Hammer, P.L., Ibaraki, T., Kogan, A.: Logical analysis of numerical data.

Math. Program. 79, 163–190 (1997)

[5] Boros, E., Hammer, P.L., Ibaraki, T., Kogan, A., Mayoraz, E., Muchnik, I.B.: An implementation of logical analysis of data. IEEE Trans. Knowl. Data Eng. 12(2), 292–

306 (2000)

[6] Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth and Brooks (1984)

[7] Buhrman, H., de Wolf, R.: Complexity measures and decision tree complexity: a survey.

Theor. Comput. Sci. 288(1), 21–43 (2002)

[8] Chikalov, I., Lozin, V.V., Lozina, I., Moshkov, M., Nguyen, H.S., Skowron, A., Zielosko, B.: Three Approaches to Data Analysis - Test Theory, Rough Sets and Logical Analysis of Data, Intelligent Systems Reference Library, vol. 41. Springer (2013)

[9] F¨urnkranz, J., Gamberger, D., Lavrac, N.: Foundations of Rule Learning. Cognitive Technologies. Springer (2012)

[10] Hartmanis, J., Hemachandra, L.A.: One-way functions, robustness, and the non- isomorphism of NP-complete sets. In: Proceedings of the Second Annual Conference on Structure in Complexity Theory, Cornell University, Ithaca, New York, USA, June 16-19, 1987. IEEE Computer Society (1987)

[11] Molnar, C.: Interpretable Machine Learning. A Guide for Making Black Box Models Explainable, 2 edn. (2022). URL christophm.github.io/interpretable-ml-book/

[12] Moshkov, M.: Conditional tests. In: S.V. Yablonskii (ed.) Problemy Kibernetiki (in Russian), vol. 40, pp. 131–170. Nauka Publishers, Moscow (1983)

[13] Moshkov, M.: On depth of conditional tests for tables from closed classes. In: A.A.

Markov (ed.) Combinatorial-Algebraic and Probabilistic Methods of Discrete Analysis (in Russian), pp. 78–86. Gorky University Press, Gorky (1989)

[14] Moshkov, M.: About the depth of decision trees computing Boolean functions. Fundam.

Informaticae 22(3), 203–215 (1995)

(18)

[16] Moshkov, M.: Comparative Analysis of Deterministic and Nondeterministic Decision Trees, Intelligent Systems Reference Library, vol. 179. Springer (2020)

[17] Moshkov, M., Piliszczuk, M., Zielosko, B.: Partial Covers, Reducts and Decision Rules in Rough Sets - Theory and Applications, Studies in Computational Intelligence, vol.

145. Springer (2008)

[18] Moshkov, M., Zielosko, B.: Combinatorial Machine Learning - A Rough Set Approach, Studies in Computational Intelligence, vol. 360. Springer (2011)

[19] Ostonov, A., Moshkov, M.: Comparative analysis of deterministic and nondeterministic decision trees for decision tables from closed classes. arXiv:2304.10594 [cs.CC] (2023).

URLhttps://doi.org/10.48550/arXiv.2304.10594

[20] Ostonov, A., Moshkov, M.: Deterministic and strongly nondeterministic decision trees for decision tables from closed classes. arXiv:2305.06093 [cs.CC] (2023). URL https:

//doi.org/10.48550/arXiv.2305.06093

[21] Pawlak, Z.: Information systems theoretical foundations. Inf. Syst.6(3), 205–218 (1981) [22] Pawlak, Z.: Rough Sets - Theoretical Aspects of Reasoning about Data, Theory and

Decision Library: Series D, vol. 9. Kluwer (1991)

[23] Pawlak, Z., Skowron, A.: Rudiments of rough sets. Inf. Sci. 177(1), 3–27 (2007) [24] Post, E.: Two-valued Iterative Systems of Mathematical Logic,Annals of Math. Studies,

vol. 5. Princeton Univ. Press, Princeton-London (1941)

[25] Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann (1993) [26] Robertson, N., Seymour, P.D.: Graph minors. XX. Wagner’s conjecture. J. Comb.

Theory, Ser. B 92(2), 325–357 (2004)

[27] Rokach, L., Maimon, O.: Data Mining with Decision Trees - Theory and Applications, Series in Machine Perception and Artificial Intelligence, vol. 69. World Scientific (2007) [28] Tardos, G.: Query complexity, or why is it difficult to separateN PÂ∩coN PÂ fromPÂ

by random oracles A? Comb.9(4), 385–392 (1989)