0% found this document useful (0 votes)
49 views

Slide 1

This document discusses computational algebraic statistics and its applications. It will cover three topics: 1) an introduction to Gröbner bases of polynomial rings, 2) applications of Gröbner bases theory in design of experiments, and 3) applications in sampling problems of contingency tables. Gröbner bases provide a systematic method for solving systems of polynomial equations without relying on factorization, as they allow variables to be systematically eliminated to obtain a single-variable equation. The document cites several references that provide introductions and applications of Gröbner bases theory in statistics.

Uploaded by

randomrandom221
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
49 views

Slide 1

This document discusses computational algebraic statistics and its applications. It will cover three topics: 1) an introduction to Gröbner bases of polynomial rings, 2) applications of Gröbner bases theory in design of experiments, and 3) applications in sampling problems of contingency tables. Gröbner bases provide a systematic method for solving systems of polynomial equations without relying on factorization, as they allow variables to be systematically eliminated to obtain a single-variable equation. The document cites several references that provide introductions and applications of Gröbner bases theory in statistics.

Uploaded by

randomrandom221
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 78

' $

Computational Algebraic Statistics


and its Applications

Satoshi Aoki (Kobe Univ.)

RIKEN iTHEMS
26 June, 2018

& %

1
' $

Contents

1. An introduction of Gröbner bases of polynomial rings

2. Gröbner bases theory in design of experiments

3. Gröbner bases theory in sampling problems of contingency tables

& %

2
' $

• References
1. Cox, D., Little, J. and O’Shea, D. (2000). Ideals, Varieties, and
Algorithms, An Introduction to Computational Algebraic Geometry
and Commutative Algebra. 3rd ed. Springer.
· · · An introduction of Gröbner basis theory.
2. Hibi, T. (ed). (2013). Gröbner Bases, Statistics and Software
Systems. Springer.
· · · Translation of [3]. Chap 1 by Hibi: An introduction of Gröbner
basis theory. Chap 4 by Aoki and Takemura: An application to
the analysis of contingency tables.
3. JST CREST 日比チーム (編). (2011). グレブナー道場. 共立出版.
4. 青木敏.計算代数統計 — グレブナー基底と実験計画法 —.統計学
One Point 第 9 巻.2018 年 8 月刊行予定.
· · · The slides 1 and 2 today are partially based on this book.

& %

3
' $
5. Pistone, G. and Wynn, H. P. (1996). Generalized confounding with
Gröbner bases. Biometrika, 83, 653–666.
· · · An application to the design of experiments. (One of the two
origins of computational algebraic statistics.)
6. Pistone, G., Riccomagno, E. and Wynn, H. P. (2001). Algebraic
statistics: Computational commutative algebra in statistics.
Chapman & Hall Ltd, Boca Raton.
· · · The first textbook of computational algebraic statistics.
7. Diaconis, P. and Sturmfels, B. (1998). Algebraic algorithms for
sampling from conditional distributions. Annals of Statistics. 26,
363–397.
· · · Application to the analysis of contingency tables. (One of the
two origins of computational algebraic statistics.)
8. Aoki, S., Hara, H. and Takemura, A. (2012). Markov bases in
statistics. Springer Series in Statistics.
· · · Application to the analysis of contingency tables.
& %

4
' $

1. An introduction of Gröbner bases of polynomial rings

& %

5
' $

1. Gröbner bases and algebraic equations

• The stage of the Gröbner bases is an ideal of a polynomial ring.


Prof. Takayama says, “Ideal is an algebraic equation.”

• Prob. 1 Solve the simultaneous linear equations:




⎨ x + 2y − z = 2

x + y − 4z = 3



x + 3y + 3z = 0

& %

6
' $

• Easy. (Linear algebra)




⎨ x + 2y − z
⎪ = 2
y + 3z = −1



z = −1

• Prob. 2 Solve the simultaneous algebraic equations:




⎪ x 2
+ y 2
+ 4z 2
= 81

x − y + z 2 = 13



xz − 2y = 18

& %

7
' $

• Answer: ⎧

⎪ x 2
+ y 2
+ 4z 2
= 81

x − y + z 2 = 13 (1)



xz − 2y = 18 (2)
From (2) − 2 × (1), we can eliminate y as

xz − 2x − 2z 2 = −8.

The answer is given from the factorization

(z − 2)(x − 2z − 4) = 0.

& %

8
' $

• Prob. 3 Solve the simultaneous algebraic equations:




⎪ x 2
+ y 2
+ 4z 2
= 90

x − y + z = 12



xz − 3y = 28

• Similarly to Prob. 2, we can eliminate y as

xz − 3x − 3z + 8 = 0,

however, we cannot factorize this.

& %

9
' $

• It is quite difficult to solve Prob. 3 by hand, though it seems to


be similar to Prob. 2.
• Prob. 2 is easy to solve because we can use the factorization of
the equation.

What is a general method to obtain an equation of one


variable from algebraic simultaneous equations, by not us-
ing factorization?

• The calculation of the Gröbner bases is an effective answer.

& %

10
' $

• ”Gröbner bases like” solution of Prob. 3




⎪ f1 = x 2
+ y 2
+ 4z 2
− 90 = 0

f2 = x − y + z − 12 = 0



f3 = xz − 3y − 28 = 0

◦ Aim: eliminate x, y and obtain the equation of z.


◦ From f2 = 0, substitute x = y − z + 12 into f3 , we have

f3 = z(y − z + 12) − 3y − 28 = yz − 3y − z 2 + 12z − 28.

We have
f4 = yz − 3y − z 2 + 12z − 28.

& %

11
' $

f1 = x2 + y 2 + 4z 2 − 90 = 0
f2 = x − y + z − 12 = 0
f4 = yz − 3y − z 2 + 12z − 28 = 0

◦ Similarly, substitute x = y − z + 12 into f1 , we have

f1 = (y − z + 12)2 + y 2 + 4z 2 − 90
= 2y 2 − 2yz + 24y + 5z 2 − 24z + 54. (∗)

We already have f4 = 0. Substitute yz = 3y + z 2 − 12z + 28 as

(∗) = 2y 2 − 2(3y + z 2 − 12z + 28) + 24y + 5z 2 − 24z + 54


= 2y 2 + 18y + 3z 2 − 2.

We have
f5 = y 2 + 9y + 3z 2 /2 − 1.
& %

12
' $

◦ Now we have the equivalent simultaneous equation




⎨ f2 = x − y + z − 12 = 0

f4 = yz − 3y − z 2 + 12z − 28 = 0



f5 = y 2 + 9y + 3z 2 /2 − 1 = 0.

◦ Next we want to eliminate y from f4 = f5 = 0. But we do not


use rational expression
z 2 − 12z + 28
y= .
z−3
We only consider polynomials.
Eliminate yz and y 2 , the leading terms of f4 , f5 .

& %

13
' $

◦ Calculate yf4 − zf5 :

yf4 − zf5
= y(yz − 3y − z 2 + 12z − 28) − z(y 2 + 9y + 3z 2 /2 − 1)
= −3y 2 − yz 2 + 3yz − 28y − 3z 3 /2 + z. (∗∗)

yz and y 2 can be replaced from f4 = f5 = 0 as

(∗∗) = −3(−9y − 3z 2 /2 + 1) − (z − 3)(3y + z 2 − 12z + 28)


−28y − 3z 3 /2 + z
= −3yz + 8y − 5z 3 /2 + 39z 2 /2 − 63z + 81
= −3(3y + z 2 − 12z + 28) + 8y − 5z 3 /2 + 39z 2 /2 − 63z + 81
= −y − 5z 3 /2 + 33z 2 /2 − 27z − 3.

We have f6 = y + 5z 3 /2 − 33z 2 /2 + 27z + 3.

& %

14
' $

◦ Now we have the equivalent simultaneous equations




⎨ f2 = x − y + z − 12 = 0

f4 = yz − 3y − z 2 + 12z − 28 = 0



f6 = y + 5z 3 /2 − 33z 2 /2 + 27z + 3 = 0.

◦ Now it is easy to obtain




⎪ x + 5z 3
/2 − 33z 2
/2 + 28z − 9 = 0

y + 5z 3 /2 − 33z 2 /2 + 27z + 3 = 0


⎩ 4
z − 48z 3 /5 + 31z 2 − 36z + 38/5 = 0.

The last one is an equation of z.

& %

15
' $

◦ Note: The third equation can be factorized as

(z 2 − 4z + 1)(z 2 − 28z/5 + 38/5) = 0.

The solution of Prob. 3 is


 √ √ 
7 ± 3 −13 ± 3 3 √
(x, y, z) = , ,2 ± 3 ,
2 2
 √ √ 
√ −26 ± 6 6 14 ± 6
4 ± 6, , .
5 5

& %

16
' $

• The set of the polynomials we have

x + 5z 3 /2 − 33z 2 /2 + 28z − 9,
y + 5z 3 /2 − 33z 2 /2 + 27z + 3,
z 4 − 48z 3 /5 + 31z 2 − 36z + 38/5

is a Gröbner basis.
• The polynomial deformation we done is a Buchberger algorithm.
• For general simultaneous algebraic equations, we can obtain the
polynomial of z in similar way, if x, y can be eliminated.
• Prof. Hibi says, “Gröbner basis is a powerful technique for
solving simultaneous equations”.

& %

17
' $

2. Ideals of polynomial rings

The stage of the Gröbner bases: polynomial rings in n variables.


• Monomial of the variables x1 , . . . , xn :
n

n

xai i = xa1 1 · · · xann , a1 , . . . , an ∈ Z≥0 . ai : degree
i=1 i=1

◦ Term: monomial with a nonzero coefficient


◦ Polynomial: finite sum of terms
2
◦ Example: f = −5x21 x2 x23 + x2 x34 x25 − x33 − 7 is a polynomial
3
2
with 4 terms −5x21 x2 x23 , x2 x34 x25 , −x33 , −7.
3
The monomials appearing in f are x21 x2 x23 , x2 x34 x25 , x33 , 1.

& %

18
' $

• K: field. (Example: K = Q or R or C.)


• K[x1 , . . . , xn ] : the set of all polynomials in the variables
x1 , . . . , xn with coefficients in K.
◦ Example:

x21 − 2x2 x3 ∈ R[x1 , x2 , x3 ],
2
2x21 x2 − x2 x43 + 1 ∈ Q[x1 , x2 , x3 ](⊂ R[x1 , x2 , x3 ])
3
• K[x1 , . . . , xn ] has the structure of a ring (i.e., sum and product),
and is called a polynomial ring in n variables over K.

& %

19
' $
• To introduce an ideal of K[x1 , . . . , xn ], consider Prof.
Takayama’s words: “Ideal is an algebraic equations”.
◦ For f1 (x1 , . . . , xn ), . . . , fr (x1 , . . . , xn ) ∈ K[x1 , . . . , xn ],
consider the simultaneous equation


⎪ f1 (x1 , . . . , xn ) = 0




⎨ f2 (x1 , . . . , xn ) = 0
⎪ ..

⎪ .



⎩ fr (x1 , . . . , xn ) = 0.

◦ Affine variety of f1 , . . . , fr :

V(f1 , . . . , fr ) = {(a1 , . . . , an ) ∈ K n | ∀i, fi (a1 , . . . , an ) = 0}.

◦ Solve the equation f1 = · · · = fr = 0


⇔ Derive the affine variety V(f1 , . . . , fr ).
& %

20
' $

◦ In Prob. 3, we have
 √ √ 
7 ± 3 −13 ± 3 3 √
V(f1 , f2 , f3 ) = , ,2 ± 3 ,
2 2
 √ √ 
√ −26 ± 6 6 14 ± 6
4 ± 6, ,
5 5

for K = R and

f1 = x2 + y 2 + 4z 2 − 90, f2 = x − y + z − 12, f3 = xz − 3y − 28.

How did we get it?

& %

21
' $

f1 = x2 + y 2 + 4z 2 − 90 = 0
f2 = x − y + z − 12 = 0
f3 = xz − 3y − 28 = 0

◦ From f1 = f2 = f3 = 0, “solve f2 = 0 for x and substitute it


into f3 ”, we have


⎪ f 1 = x 2
+ y 2
+ 4z 2
− 90 = 0

f2 = x − y + z − 12 = 0



f4 = yz − 3y − z 2 + 12z − 28 = 0.

Note that f3 − zf2 = f4 .

& %

22
' $

f1 = x2 + y 2 + 4z 2 − 90 = 0
f2 = x − y + z − 12 = 0
f4 = yz − 3y − z 2 + 12z − 28 = 0

◦ Next, “solve f2 = 0 for x and substitute it into f1 ”, and into


this, substitute the result of “solve f4 = 0 for yz”, and
multiply the result by 1/2, we have


⎨ f2 = x − y + z − 12 = 0

f4 = yz − 3y − z 2 + 12z − 28 = 0



f5 = y 2 + 9y + 3z 2 /2 − 1 = 0.

Note that
1 x + y − z + 12
f1 − f2 + f4 = f5 .
2 2
& %

23
' $
◦ Next, to eliminate yz of f4 and y 2 of f5 , consider yf4 − zf5 ,
and substitute yz = · · · from f4 = 0 and y 2 = · · · from f5 = 0
into it. Then multiply it by −1, we have


⎨ f2 = x − y + z − 12 = 0

f4 = yz − 3y − z 2 + 12z − 28 = 0



f6 = y + 5z 3 /2 − 33z 2 /2 + 27z + 3 = 0.
Note that
−1 × (yf4 − zf5 + 3f5 + (z − 3)f4 + 3f4 )
= (−y − z)f4 + (z − 3)f5 = f6 .

◦ Finally we have f6 = f7 = f8 = 0 as

f7 = f2 + f6 = x + 5z 3 /2 − 33z 2 /2 + 28z − 9
2 2z − 6 48 38
f8 = − f4 + f6 = z 4 − z 3 + 31z 2 − 36z + .
5 5 5 5
& %

24
' $

◦ Point: The deformations of polynomials above are expressed


as “multiplied by polynomials” and “addition” of polynomials.
• Now we define the set

f1 , . . . , fr
= {h1 f1 + · · · + hr fr | h1 , . . . , hr ∈ K[x1 , . . . , xr ]},

i.e., the set of polynomials that can be obtained from f1 , . . . , fr


by “multiplied by polynomials” and “addition”.
• In other words, f1 , . . . , fr
is the set of all polynomials that can
appear to solve the equation f1 = · · · = fr = 0.
• f1 , . . . , fr
is an important example of an ideal.

& %

25
' $

Definition A nonempty subset I of K[x1 , . . . , xn ] is called an ideal


of K[x1 , . . . , xn ] if the following conditions are satisfied.
• If f ∈ I, g ∈ I, then f + g ∈ I;
• If f ∈ I, h ∈ K[x1 , . . . , xn ], then hf ∈ I.

Proposition I = f1 , . . . , fr
is an ideal of K[x1 , . . . , xn ].

& %

26
' $

• f1 , . . . , fr
is the ideal generated by {f1 , . . . , fr }, i.e., finitely
generated ideal.
◦ In general, let {fλ | λ ∈ Λ} be a nonempty subset of
K[x1 , . . . , xn ], then we can show that


{fλ | λ ∈ Λ}
= finite sum hλ fλ , hλ ∈ K[x1 , . . . , xn ]
λ∈Λ

is an ideal of K[x1 , . . . , xn ].
◦ Conversely, for an ideal I ⊂ K[x1 , . . . , xn ], there exists
{fλ | λ ∈ Λ} ⊂ K[x1 , . . . , xn ] satisfying I = {fλ | λ ∈ Λ}
.
The subset {fλ | λ ∈ Λ} is called a system of generators of I.

& %

27
' $

• Fundamental problems for ideal:


1. Ideal description problem
Can every ideal I ⊂ K[x1 , . . . , xn ] be written as

I = f1 , . . . , fr

for some finite set {f1 , . . . , fr } ⊂ K[x1 , . . . , xn ]?


2. Ideal membership problem
For an ideal I ⊂ K[x1 , . . . , xn ], is there an algorithm to decide
whether a given f ∈ K[x1 , . . . , xn ] lies in I?
• The answers for both are “YES”.
• For the case of one variable polynomial ring K[x], both problems
can be solved by high school mathematics.
A key is a division algorithm.
& %

28
' $

Prob. 4
Divide f (x) = x4 + 2x3 − x2 + 4x − 1 by g(x) = x2 − 3x + 1.
Answer:
x2 + 5x + 13
x2 − 3x + 1 x4 +2x3 − x2 + 4x − 1
x4 −3x3 + x2
5x3 − 2x2 + 4x − 1
5x3 −15x2 + 5x
13x2 − x − 1
13x2 −39x+13
38x−14
The quotient is x2 + 5x + 13 and the remainder is 38x − 14.
& %

29
' $

• The expression we have is

f (x) = (x2 + 5x + 13)g(x) + 38x − 14.

• A division of f ∈ K[x] by g ∈ K[x] is to express f in the form

f (x) = q(x)g(x) + r(x),

where q(x) is the quotient and r(x) is the remainder satisfying

r(x) = 0 or deg(r(x)) < deg(g(x)).

q(x) and r(x) are decided uniquely.

& %

30
' $

• Point: “Divide by g(x) = x2 − 3x + 1“ means “replace x2 by


g(x) + 3x − 1 as many times as possible”.
x2 + 5x + 13
x2 − 3x + 1 x4 +2x3 − x2 + 4x − 1 x4 + 2x3 − x2 + 4x − 1
x4 −3x3 + x2 = x2 (g + 3x − 1) + 2x3 − x2 + 4x − 1
5x3 − 2x2 + 4x − 1 = x2 g + 5x3 − 2x2 + 4x − 1
5x3 −15x2 + 5x = x2 g + 5x(g + 3x − 1) − 2x2 + 4x − 1
13x2 − x − 1 = (x2 + 5x)g + 13x2 − x − 1
13x2 −39x+13 = (x2 + 5x)g + 13(g + 3x − 1) − x − 1
38x−14 = (x2 + 5x + 13)g + 38x − 14

& %

31
' $
• Ideal description problem for K[x]
◦ I ⊂ K[x]: ideal.
◦ g ∈ I: an element of I with a minimum degree.
◦ For each f ∈ I, we have the expression

f = qg + r,
where r = 0 or deg(r) < deg(g).

q ∈ K[x] and r ∈ K[x] are decided uniquely.


◦ From f, g ∈ I, we have r = f − qg ∈ I. Because g is an
element of I with a minimum degree, r = 0. Therefore each
f ∈ I can be expressed as f = qg, i.e., I is generated by g:

I = {qg | q ∈ K[x]} = g
.

• Each ideal I ⊂ K[x] is a principal ideal.


& %

32
' $

• Ideal membership problem for K[x]


To judge whether f ∈ K[x] is in g
⊂ K[x] or not, divide f by g,
and we have the expression

f = qg + r, q, r ∈ K[x],
where r = 0 or deg(r) < deg(g).

Then we have
r = 0 ⇔ f ∈ g
.

& %

33
' $

3. Monomial order and division algorithm

• To solve the ideal membership problem for n variables, consider


the division algorithm for K[x1 , . . . , xn ].
• Recall that “divide g = x2 − 3x + 1” means “replace x2 by
g + 3x − 1” for one variable case.
• How it can be generalized to n variable cases?
◦ “Divide f by h = x2 − yz” means “replace x2 in f with
h + yz”? or “replace yz in f with −h + x2 ”?
◦ Anyway, deg(f ) cannot be reduced. Also, we must define a
stopping rule.
• To generalize the division algorithm for K[x1 , . . . , xn ], we need
“order” on the set of monomials.
& %

34
' $

• Definition Let Mn be the set of monomials in the variables


x1 , . . . , xn . A monomial order on K[x1 , . . . , xn ] is a total order ≺
on Mn such that
(i) 1 ≺ u for all 1 = u ∈ Mn ;
(ii) If u, v ∈ Mn and u ≺ v, then uw ≺ vw for all w ∈ Mn .
◦ Example: An order by the degree

1 ≺ x ≺ x2 ≺ x 3 ≺ · · ·

is a monomial order on M1 .

& %

35
' $
• Examples of monomial orders:
a b
Definition Let u = x1 1 · · · xann , v = x11 · · · xbnn ∈ Mn .
(i) We define u ≺purelex v if the leftmost nonzero element of
(b1 − a1 , . . . , bn − an ) is positive. It follows ≺purelex is a
monomial order on K[x1 , . . . , xn ], called the pure
lexicographic order.
(ii) We define u ≺lex v if either (i) deg(v) > deg(u) or (ii)
deg(v) = deg(u) and the leftmost nonzero element of
(b1 − a1 , . . . , bn − an ) is positive. It follows ≺lex is a monomial
order on K[x1 , . . . , xn ], called the lexicographic order.
(iii) We define u ≺rev v if either (i) deg(v) > deg(u) or (ii)
deg(v) = deg(u) and the rightmost nonzero element of
(b1 − a1 , . . . , bn − an ) is negative. It follows ≺rev is a monomial
order on K[x1 , . . . , xn ], called the reverse lexicographic order.
& %

36
' $

• Prob. 5 Let n = 3 and x1 = x, x2 = y, x3 = z. List the


monomials of degree less than 5 with respect to
≺purelex , ≺lex , ≺rev induced by x y z, respectively.
Answer
◦ The pure lexicographic order:

x4 , x3 y, x3 z, x3 , x2 y 2 , x2 yz, x2 y, x2 z 2 , x2 z, x2 ,
xy 3 , xy 2 z, xy 2 , xyz 2 , xyz, xy, xz 3 , xz 2 , xz, x,
y 4 , y 3 z, y 3 , y 2 z 2 , y 2 z, y 2 , yz 3 , yz 2 , yz, y,
z 4 , z 3 , z 2 , z, 1

& %

37
' $

◦ The lexicographic order:

x4 , x3 y, x3 z, x2 y 2 , x2 yz, x2 z 2 , xy 3 , xy 2 z, xyz 2 , xz 3 ,
y 4 , y 3 z, y 2 z 2 , yz 3 , z 4 ,
x3 , x2 y, x2 z, xy 2 , xyz, xz 2 , y 3 , y 2 z, yz 2 , z 3 ,
x2 , xy, xz, y 2 , yz, z 2 , x, y, z, 1

◦ The reverse lexicographic order:

x4 , x3 y, x2 y 2 , xy 3 , y 4 , x3 z, x2 yz, xy 2 z, y 3 z,
x2 z 2 , xyz 2 , y 2 z 2 , xz 3 , yz 3 , z 4 ,
x3 , x2 y, xy 2 , y 3 , x2 z, xyz, y 2 z, xz 2 , yz 2 , z 3 ,
x2 , xy, y 2 , xz, yz, z 2 , x, y, z, 1

& %

38
' $

• We fix a monomial order ≺ on K[x1 , . . . , xn ].


• For each f ∈ K[x1 , . . . , xn ], we can define the initial monomial of
f with respect to ≺, in≺ (f ), as the largest monomial belonging
to f .
• We also write cf as the coefficient of in≺ (f ). The term
cf · in≺ (f ) is the initial term of f .
• Example: f = 2xy 4 − x3 z + 5x2 y 2 z + 1 ∈ K[x, y, z]
◦ For ≺purelex , in≺purelex (f ) = x3 z and the initial term is −x3 z.
◦ For ≺lex , in≺lex (f ) = x2 y 2 z and the initial term is 5x2 y 2 z.
◦ For ≺rev , in≺rev (f ) = xy 4 and the initial term is 2xy 4 .

& %

39
' $

• Division algorithm for K[x1 , . . . , xn ]


Let f, g1 , . . . , gs ∈ K[x1 , . . . , xn ]. Divide f by g1 , . . . , gs as
follows.
◦ Fix a monomial order ≺ on Mn .
◦ Suppose a monomial in f is divided by some
in≺ (g1 ), . . . , in≺ (gs ). Then replace in≺ (gi ) in f with

1 1
gi − gi − in≺ (gi ) .
cgi cgi

◦ Replace similarly as possible as we can.

& %

40
' $

• Theorem (Division algorithm) Fix a monomial order ≺ on Mn .


Let g1 , . . . , gs and f are nonzero polynomials in K[x1 , . . . , xn ].
Then there exists a standard form of f with respect to g1 , . . . , gs ,

f = q1 g1 + · · · + qs gs + r

for some q1 , . . . , qs , r ∈ K[x1 , . . . , xn ], satisfying


(i) If r = 0, then any monomial of r cannot be divided by any
in≺ (gi ).
(ii) If qi = 0, then in≺ (qi gi )  in≺ (f ) holds.
r is called a remainder of f with respect to g1 , . . . , gs .

& %

41
' $

• Prob. 6 For ≺lex on M3 , derive a standard form of


f = xyz + xz 2 − y 2 − 1 with respect to g1 = yz − x, g2 = xz − y 2 .

Answer. For the lexicographic order, the initial monomials of


g1 , g2 are
in≺lex (g1 ) = yz, in≺lex (g2 ) = xz.
Then, we “replace yz in f with g1 + x” or “replace xz in f with
g2 + y 2 ” as possible as we can.

& %

42
' $

g1 = yz − x, g2 = xz − y 2

f = xyz + xz 2 − y 2 − 1
= x(g1 + x) + xz 2 − y 2 − 1
= xg1 + x2 + xz 2 − y 2 − 1
= xg1 + x2 + z(g2 + y 2 ) − y 2 − 1
= xg1 + zg2 + x2 + y 2 z − y 2 − 1
= xg1 + zg2 + x2 + y(g1 + x) − y 2 − 1
= (x + y)g1 + zg2 + x2 + xy − y 2 − 1

This is a standard form of f with respect to g1 , g2 for ≺lex .

& %

43
' $

Another answer. Replace by g2 first, we have

f = xyz + xz 2 − y 2 − 1
= y(g2 + y 2 ) + xz 2 − y 2 − 1
= yg2 + y 3 + xz 2 − y 2 − 1
= yg2 + y 3 + z(g2 + y 2 ) − y 2 − 1
= (y + z)g2 + y 3 + y 2 z − y 2 − 1
= (y + z)g2 + y 3 + y(g1 + x) − y 2 − 1
= yg1 + (y + z)g2 + y 3 + xy − y 2 − 1.

This is also a standard form of f with respect to g1 , g2 for ≺lex .

For n variable cases, a standard form is not unique.

& %

44
' $

4. Gröbner bases and ideal membership problems

• Consider the ideal membership problem for K[x1 , . . . , xn ].


• For an ideal I = g1 , . . . , gs
⊂ K[x1 , . . . , xn ] and
f ∈ K[x1 , . . . , xn ], if we fix a monomial order ≺ on Mn , we have
a standard form of f with respect to g1 , . . . , gs by the division
algorithm,
f = q1 g1 + · · · + qs gs + r.

• If r = 0, then we know f ∈ I. However, r = 0 is not necessary


condition for f ∈ I.

& %

45
' $

• Example. For ≺lex on M3 ,

f = (x + y)g1 + zg2
= yg1 + (y + z)g2 + y 3 − x2

are standard forms of f = xyz + xz 2 − x2 − xy with respect to


g1 = yz − x, g2 = xz − y 2 .
• Here, write g3 = y 3 − x2 and express I = g1 , g2
as
I = g1 , g2 , g3
. Then we have standard forms of f with zero
remainder with respect to g1 , g2 , g3 ,

f = (x + y)g1 + zg2
= yg1 + (y + z)g2 + g3 .

• This {g1 , g2 , g3 } is a Gröbner basis of I.


& %

46
' $
• Consider the case that f ∈ I = g1 , . . . , gs
has a standard form

f = q1 g1 + · · · + qs gs + r,

where r = 0.
• Here, r = f − q1 g1 − · · · − qs gs ∈ I. Each monomial of r cannot
be divided by any of in≺ (g1 ), . . . , in≺ (gs ). In particular, in≺ (r)
cannot be divided by any of in≺ (g1 ), . . . , in≺ (gs ), i.e.,

in≺ (r) ∈ in≺ (g1 ), . . . , in≺ (gs )


.

• Conversely, for a monomial order ≺, if the set {g1 , . . . , gs }


satisfies the condition

f ∈ g1 , . . . , gs
⇒ in≺ (f ) ∈ in≺ (g1 ), . . . , in≺ (gs )
,

then it follows, for each f ∈ K[x1 , . . . , xn ], the remainder of f


with respect to g1 , . . . , gs is unique.
& %

47
' $

• The above consideration leads to the definition of the Gröbner


basis.
• Write the (monomial) ideal generated by {in≺ (f ) | f ∈ I} as

in≺ (I) = {in≺ (f ) | f ∈ I}


.

This is called an initial ideal of f with respect to ≺.

• Definition Fix a monomial order ≺ on Mn . Let I be an ideal


of K[x1 , . . . , xn ]. Then a Gröbner basis of I with respect to ≺ is
a finite set {g1 , . . . , gs } of polynomials in I satisfying

in≺ (I) = in≺ (g1 ), . . . , in≺ (gs )


.

& %

48
' $

• For I = g1 , . . . , gs
, in≺ (I) ⊃ in≺ (g1 ), . . . , in≺ (gs )
in general.
• The converse does not necessary hold.
Example.
For ≺lex on M3 , consider g1 = yz − x, g2 = xz − y 2 , I = g1 , g2
.
From y 3 − x2 = xg1 − yg2 ∈ I, in≺lex (y 3 − x2 ) = y 3 ∈ in≺lex (I)
holds. However, y 3 ∈ yz, xz
= in≺lex (g1 ), in≺lex (g2 )
, i.e.,
in≺lex (I) = in≺lex (g1 ), in≺lex (g2 )
.

& %

49
' $

• A finite Gröbner basis always exists.


⎛ ⎞
From Dickson’s lemma saying that “The set of
⎜ ⎟
⎜ minimal elements of a nonempty subset of Mn ⎟
⎝ ⎠
is at most finite”.

• A Gröbner basis of I is a system of generators of I.


⎛ ⎞
This yields the so-called Hilbert Basis Theorem.
⎜ ⎟
⎜ Therefore the ideal description problem reduces to ⎟
⎝ ⎠
a calculation of Gröbner basis for arbitrary fixed ≺.

& %

50
' $

• The remainder of the standard form with respect to the Gröbner


basis is unique.
⎛ ⎞
Therefore the ideal membership problem is solved
⎜ ⎟
⎜ by the division algorithm with respect to the ⎟
⎝ ⎠
Gröbner basis.

& %

51
' $

5. Buchberger criterion and Buchberger algorithm

• Given a system of generators of an ideal, how can we decide


whether they form its Gröbner basis or not?
• The answer is Buchberger criterion.
• Recall that {g1 , g2 } = {yz − x, xz − y 2 } is not a Gröbner basis of
I = g1 , g2
with respect to ≺lex because the initial monomial of
g3 = xg1 − yg2 ∈ I is not included in
in≺lex (g1 ), in≺lex (g2 )
= yz, xz
.
• Idea: check the element of I that can be produced from g1 , g2 by
“cancelling the initial monomials in≺ (g1 ), in≺ (g2 ) each other”.

& %

52
' $

• Write the least common multiple lcm(u, v) of two monomials


u = xa1 1 · · · xann and v = xb11 · · · xbnn as

lcm(u, v) = xc11 · · · xcnn , ci = max{ai , bi }, i = 1, . . . , n.

• Definition The polynomial

lcm(in≺ (f ), in≺ (g)) lcm(in≺ (f ), in≺ (g))


S(f, g) = f− g
cf · in≺ (f ) cg · in≺ (g)
is called the S-polynomial of f and g with respect to ≺, where
cf , cg are the coefficients of in≺ (f ), in≺ (g) in f, g, respectively.

& %

53
' $
• The S-polynomial of f and g is obtained by cancelling the initial
monomials of f and g.
• Example: g1 = yz − x, g2 = xz − y 2
Lexicographic order:

in≺lex (g1 ) = yz, in≺lex (g2 ) = xz,


S(g1 , g2 ) = xg1 − yg2 = −x2 + y 3
Reverse lexicographic order:

in≺lex (g1 ) = yz, in≺lex (g2 ) = y 2 ,


S(g1 , g2 ) = yg1 + zg2 = −xy + xz 2
Pure lexicographic order:

in≺lex (g1 ) = x, in≺lex (g2 ) = xz,


S(g1 , g2 ) = −zg1 − g2 = −yz 2 + y 2
& %

54
' $

• Theorem (Buchberger criterion)


Fix a monomial order ≺ on Mn . Let I be an ideal of
K[x1 , . . . , xn ] and G = {g1 , . . . , gs } is a system of generators of I.
Then G is a Gröber basis of I if and only if the following
condition is satisfied:
“For all i = j, the remainder of the standard form of S(gi , gj )
with respect to g1 , . . . , gs is 0.”

& %

55
' $

• Example:
g1 = yz − x, g2 = xz − y 2 , I = G
⊂ K[x, y, z], G = {g1 , g2 }
◦ Lexicographic order ≺lex
S(g1 , g2 ) = y 3 − x2 is a standard form w.r.t. G and is a
remainder itself. Therefore G is not a Gröbner basis of I.
Write g3 = y 3 − x2 and G = G ∪ {g3 }, then

S(g1 , g3 ) = x2 z − xy 2 = xg2
S(g2 , g3 ) = −y 2 g3 + x2 g2 (∗)

are standard forms w.r.t. G with 0 remainder. Therefore G


is a Gröbner basis of I w.r.t. ≺lex .

& %

56
' $

◦ Reverse lexicographic order ≺rev


S(g1 , g2 ) = xz 2 − xy is a standard form w.r.t. G and is a
remainder itself. Therefore G is not a Gröbner basis of I.
Write g3 = xz 2 − xy and G = G ∪ {g3 }, then

S(g1 , g3 ) = xy 2 − x2 z = −xg2
S(g2 , g3 ) = −xzg3 − xyg2 (∗)

are standard forms w.r.t. G with 0 remainder. Therefore G


is a Gröbner basis of I w.r.t. ≺rev .

& %

57
' $

◦ Pure lexicographic order ≺purelex


S(g1 , g2 ) = y 2 − yz 2 is a standard form w.r.t. G and is a
remainder itself. Therefore G is not a Gröbner basis of I.
Write g3 = y 2 − yz 2 and G = G ∪ {g3 }, then

S(g1 , g3 ) = −y 3 z + xyz 2 = −yzg3 − yz 2 g1 (∗)


S(g2 , g3 ) = xyz 3 − y 4 = −y 2 g3 + yz 2 g2 (∗)

are standard forms w.r.t. G with 0 remainder. Therefore G


is a Gröbner basis of I w.r.t. ≺purelex .

& %

58
' $

• As we have seen, Buchberger criterion is not only a criterion, but


supplies an element that is needed to be a Gröbner basis.
Buchberger algorithm
Fix a monomial order ≺. Let I = G
= g1 , . . . , gs
. If each
S-polynomial S(gi , gj ) has a standard form with 0 remainder
w.r.t. G, G is a Grb̈ner basis of I. Otherwise, if S(gi , gj ) has a
standard form with nonzero remainder rs+1 , add rs+1 to G.
Repeating this procedure, we can obtain a Gröbner basis after
finite number of steps.
 
The “finiteness” is important. This is from the fact
that the polynomial ring is Noetherian ring.

& %

59
' $

• To perform the Buchberger algorithm, the following lemma is


useful.
Lemma
Fix a monomial order ≺ on Mn . For f, g ∈ K[x1 , . . . , xn ],
suppose in≺ (f ) and in≺ (g) are relatively prime, i.e.,
lcm(in≺ (f ), in≺ (g)) = in≺ (f ) · in≺ (g). Then there exists a
standard form of S(f, g) with 0 remainder w.r.t. f, g.
 
Therefore the calculation of S-polynomials (∗) are
not needed in the previous Example in pages 56-58.

& %

60
' $

• Now we back to Prob. 3.


By the Buchberger algorithm, calculate the Gröbner basis of the
ideal I = f1 , f2 , f3
⊂ K[x, y, z] for ≺purelex , where

f1 = x2 + y 2 + 4z 2 − 90, f2 = x − y + z − 12, f3 = xz − 3y − 2.

& %

61
' $

• Let F = {f1 , f2 , f3 }. Check S(f1 , f2 ), S(f1 , f3 ), S(f2 , f3 ).

S(f2 , f3 ) = zf2 − f3
= −yz + 3y + z 2 − 12z + 28.

This is a standard form w.r.t. F and is a remainder itself. We


add
f4 = yz − 3y − z 2 + 12z − 28
to F . We have F = {f1 , f2 , f3 , f4 }.

& %

62
' $

S(f1 , f2 ) = f1 − xf2
= xy − xz + 12x + y 2 + 4z 2 − 90
= (y − z + 12)f2 − 2f4 + 2y 2 + 18y + 3z 2 − 2.

This is a standard form w.r.t. F . We add the remainder


2 3 2
f5 = y + 9y + z − 1
2
to F . We have F = {f1 , f2 , f3 , f4 , f5 }.

& %

63
' $

S(f1 , f3 ) = zf1 − xf3


= 3xy + 28x + y 2 z + 4z 3 − 90z
= (3y + 28)f2 − 12f4 + (z + 3)f5 + y + 5z 3 /2
−33z 2 /2 + 27z + 3.

This is a standard form w.r.t. F . We add the remainder


5 33
f6 = y + z 3 − z 2 + 27z + 3
2 2
to F . We have F = {f1 , f2 , f3 , f4 , f5 , f6 }.

& %

64
' $

• Now we have

f1 = x2 + y 2 + 4z 2 − 90,
f2 = x − y + z − 12,
f3 = xz − 3y − 2,
f4 = yz − 3y − z 2 + 12z − 28,
f5 = y 2 + 9y + 32 z 2 − 1,
f6 = y + 52 z 3 − 33 2
2 z + 27z + 3.

Checked: S(f1 , f2 ), S(f1 , f3 ), S(f2 , f3 ).


Check needed: S(f3 , f4 ), S(f4 , f5 ), S(f4 , f6 ), S(f5 , f6 ).

& %

65
' $

S(f3 , f4 ) = yf3 − zf4


= xyz − 3y 2 − yz 2 + 3yz − 28y + z 3 − 12z 2 + 28z
= (3y + 28)f2 + (z − 12)f3 + (x − z)f4 ,

S(f4 , f5 ) = yf4 − zf5


= −3y 2 − yz 2 + 3yz − 28y − 3z 3 /2 + z
= −zf4 − 3f5 − f6 .
These are standard forms w.r.t. F with 0 remainder.

& %

66
' $

S(f4 , f6 ) = f4 − zf6
= −3y − 5z 4 /2 + 33z 3 /2 − 28z 2 + 9z − 28
= −3f6 − 5z 4 /2 + 24z 3 − 155z 2 /2 + 90z − 19

This is a standard forms w.r.t. F . We add the remainder


48 3
4 2 38
f8 = z − z + 31z − 36z +
5 5
to F . We have F = {f1 , f2 , f3 , f4 , f5 , f6 , f8 }.

& %

67
' $

S(f5 , f6 ) = f5 − yf6
= −5yz 3 /2 + 33yz 2 /2 − 27yz + 6y + 3z 2 /2 − 1
= (−5z 2 /2 + 9z)f4 + 6f6 − (5/2)f8 .

This is a standard form w.r.t. F with 0 remainder.

& %

68
' $

• Now we have

f1 = x2 + y 2 + 4z 2 − 90,
f2 = x − y + z − 12,
f3 = xz − 3y − 2,
f4 = yz − 3y − z 2 + 12z − 28,
f5 = y 2 + 9y + 32 z 2 − 1,
f6 = y + 52 z 3 − 33 2
2 z + 27z + 3.,
48 3 38
f8 = z 4 − 5 z + 31z 2 − 36z + 5

Checked: S(f1 , f2 ), S(f1 , f3 ), S(f2 , f3 ), S(f3 , f4 ),


S(f4 , f5 ), S(f4 , f6 ), S(f5 , f6 ).
Check needed: S(f3 , f8 ), S(f4 , f8 ).

& %

69
' $

S(f3 , f8 ) = z 3 f3 − xf8
= 48xz 3 /5 − 31xz 2 + 36xz − 38x/5 − 3yz 3 − 28z 3
= −(38/5)f2 + (48z 2 /5 − 31z + 36)f3
+(−3z 2 + 99z/5 − 168/5)f4 − (2/5)f6 − 3f8 .

S(f4 , f8 ) = z 3 f4 − yf8
= 33yz 3 /5 − 31yz 2 + 36yz − 38y/5 − z 5 + 12z 4 − 28z 3
= (33z 2 /5 − 31z + 99z/5 − 57 + 297/5)f4
−(2/5)f6 − (z − 9)f8 .
These are standard forms w.r.t. F with 0 remainder.
Therefore F = {f1 , f2 , f3 , f4 , f5 , f6 , f8 } is a Gröbner basis of I
w.r.t. ≺purelex from the Buchberger criterion.
& %

70
' $

6. Elimination theory and solving simultaneous


equations

• Prof. Hibi says, “Gröbner basis is a powerful technique for


solving simultaneous equations”.
• This is due to the elimination theory. It is a fascinating result
which demonstrates the power of Gröbner bases.

& %

71
' $

• Recall that the Gröbner basis w.r.t. ≺purelex of


I = f1 , f2 , f3
⊂ Q[x, y, z], where

f1 = x2 + y 2 + 4z 2 − 90, f2 = x − y + z − 12, f3 = xz − 3y − 28

includes a polynomial of z,

5z 4 − 48z 3 + 155z 2 − 180z + 38,

as an element.
• In fact, the Gröbner basis w.r.t. the pure lexicographic order is
effective for solving simultaneous equations.

& %

72
' $

Proposition Let I = f1 , . . . , fr
is an ideal of K[x, y, z]. If
I ∩ K[z] = 0
, there is only one polynomial of K[z] in the
reduced Grob̈ner basis of I w.r.t. ≺purelex satisfying
x purelex y purelex z, and is unique. Write this element be
g ∗ ∈ G ∩ K[z], then I ∩ K[z] = g ∗
holds.

◦ Note that I ∩ K[z] is the set of elements in I with the variable


z. In other words, I ∩ K[z] is the set of polynomials that is
derived from the simultaneous equation
f1 (x, y, z) = · · · = fr (x, y, z) = 0 by eliminating x, y.
◦ I ∩ K[z] is an ideal of K[z].

& %

73
' $

◦ We have seen that each ideal of one variable polynomial ring


is a principal ideal, and the unique generator is the element
with the minimum degree. (see page 32)
◦ Therefore this Proposition means that g ∗ ∈ G ∩ K[z] is the
element of I ∩ K[z] with the minimum degree.
◦ In the previous example, we have
 4 3 2

I ∩ Q[z] = 5z − 48z + 155z − 180z + 38 .

& %

74
' $

• Extending Proposition.
• Let K[xi1 , xi2 , . . . , xim ] be the set of the polynomials in
K[x1 , . . . , xn ] with the variables xi1 , xi2 , . . . , xim
(1 ≤ i1 < i2 < · · · < im ≤ n). This is a polynomial ring.
• A monomial order ≺ on K[x1 , . . . , xn ] naturally induce the
monomial order ≺ on K[xi1 , xi2 , . . . , xim ].

& %

75
' $

• Theorem (The Elimination Theorem)


Let ≺ be a monomials order on K[x1 , . . . , xn ] and G a Gröbner
basis of an ideal I ⊂ K[x1 , . . . , xn ] w.r.t. ≺. Suppose that

g ∈ G, in≺ (g) ∈ K[xi1 , xi2 , . . . , xim ] ⇒ g ∈ K[xi1 , xi2 , . . . , xim ]

holds. Then G ∩ K[xi1 , xi2 , . . . , xim ] is a Gröbner basis of


I ∩ K[xi1 , xi2 , . . . , xim ] w.r.t. ≺ .

• The pure lexicographic order satisfies this condition.

& %

76
' $

• In the previous example, we have the Gröbner basis


G = {g1 , g2 , g3 },

g1 = 5z 4 − 48z 3 + 155z 2 − 180z + 38,


g2 = 2y + 5z 3 − 33z 2 + 54z + 6,
g3 = 2x + 5z 3 − 33z 2 + 56z − 18,

of I = g1 , g2 , g3
w.r.t. ≺purelex .
From the elimination theorem,
◦ {g1 } is a Gröbner basis of I ∩ K[z],
◦ {g1 , g2 } is a Gröbner basis of I ∩ K[y, z] w.r.t. y purelex z.

& %

77
' $

• Solving equation f1 = · · · = fr = 0 by the elimination theorem


1. Calculate the reduced Gröbner basis G of I = f1 , . . . , fr

w.r.t. x1 purelex x2 purelex · · · purelex xn .


2. If G ∩ K[xn ] = ∅, this is the reduced Gröbner basis of
I ∩ K[xn ]. Solve it for xn .
3. If G ∩ K[xn−1 , xn ] = ∅, this is the reduced Gröbner basis of
I ∩ K[xn−1 , xn ]. Substitute xn and solve it for xn−1 .
4. Similarly, obtain xn−2 , xn−3 , . . . , x1 in order.

& %

78

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy