Experimental2-Test-Persistence-Coins-Benford-Katona-Solutions

In [1]:

## loading python libraries

# necessary to display plots inline:
%matplotlib inline   

# load the libraries
import matplotlib.pyplot as plt # 2D plotting library
import numpy as np              # package for scientific computing  

import sympy as sympy             # package for symbolic computation
from sympy import *

from math import *              # package for mathematics (pi, arctan, sqrt, factorial ...)

Exercise 1. Playing with digits¶

Write a function ProductOfDigits(n) which returns the product of digits of $n$. For example, ProductOfDigits(2281)=32, since $2\times 2\times 8\times 1 =32$.
(Hint: Recall that in python 3 the euclidean division is obtained with //.)

In [9]:

def ProductOfDigits(n):
    # input: integer n
    # output: product of digits of n
    if n<10:
        return n
    else:
        return n%10*ProductOfDigits(n//10)
    
ProductOfDigits(2281)

#T=500
#plt.plot([ProductOfDigits(n) for n in range(T)],'.')
#plt.plot([k**(np.log(9)/np.log(10)) for k in range(T)])
#plt.show()

Out[9]:

For an integer $n$, we consider the following procedure. Multiply all the digits of $n$ by each other, repeating with the product until a single digit is obtained. Let us denote by $\mathrm{M}(n)$ the number of steps required to get one single digit number, starting from $n$. (If $n<10$ we set $\mathrm{M}(n)=0$.)

For example, for $n=2281$: $$ 2281 \stackrel{\text{1st step}}{\longrightarrow} 2\times 2\times 8\times 1= 32 \stackrel{\text{2d step}}{\longrightarrow} 3\times 2 =6. $$
Therefore $\mathrm{M}(2281)=2$.

Write a program which computes $\mathrm{M}(n)$.

In [13]:

def MultiplicativePersistence(n):
    # input: n=integers 
    # output: mult.persistence
    if n<10:
        return 0
    else:
        return MultiplicativePersistence(ProductOfDigits(n))+1

print(MultiplicativePersistence(2281))

Find the smallest integer $K$ such that $\mathrm{M}(K)\geq 7$.

In [12]:

n=1
PersistenceLargerThanSeven = False
while PersistenceLargerThanSeven == False:
    n=n+1
    if MultiplicativePersistence(n) > 6:
        PersistenceLargerThanSeven=True

print(str(n)+' has a persistence equal to '+str(MultiplicativePersistence(n)))

68889 has a persistence equal to 7

Let $\Pi(n)$ denote the product of digits of $n$. Prove that $\Pi(n) < n $ for every $n\geq 10$.
Deduce that $\mathrm{M}(n)<+\infty$ for every $n$.
(Bonus) Find a constant $c>0$ such that for every $n\geq 1$ one has $\mathrm{M}(n)\leq c\log(n)$.

Let $n\geq 1$ be an integer with $c+1$ digits and denote by $a_c,a_{c-1},a_{c-2},\dots a_1,a_0$ its digits (in basis $10$). Then $$ n=a_c10^c +a_{c-1}10^{c-1}+\dots +a_1\times 10+a_0 \geq a_c 10^c. $$ Besides $$ \Pi(n)=a_c\times a_{c-1}\times \dots \times a_0 \leq a_c\times9 \times 9 \times \dots \times 9 \leq a_c 9^{c} $$ (since each digit is less than $9$). If we combine our two inequalities for $c\geq 1$ we obtain $$ \Pi(n) < n. $$
This means that at each iteration of $\Pi$, the quantity is (strictly) decreasing. After at most $n-10$ steps, $n$ is mapped onto a integer $<10$.
The idea is that
- If $n$ is large then we prove that $\Pi(n)$ is way smaller than $n$
- If $n$ is small we use python.
Let us prove it rigously. Let $n\geq 1$. From above we have (with $c=\lfloor \log_{10}(n)\rfloor$): $$ \Pi(n)\leq a_c 9^{c} \leq \frac{n}{10^c}9^c = n(9/10)^{\lfloor \log_{10}(n)\rfloor} $$ i.e. $$ \frac{\Pi(n)}{n} \leq (9/10)^{\lfloor \log_{10}(n)\rfloor}. $$ Now, if $n>10^7$ (see script A below) then the right-hand side is $<0.5$ and therefore we have $\Pi(n) < n/2$. This means that:
- As long as $ n > 10^7 $, $ \Pi(n) $ decreases at least by a factor $2$ at each iteration. So after $s$ steps we are below $n \times (1/2)^s$. This is less than $10^7$ if $s\geq \log_2(n)$.
Finally, it takes less than $\log_2(n)$ steps to reach $10^7$ and then at most $8$ steps to reach $0$. We have proved $$ \mathrm{Mp}(n)\leq \log_2(n)+8. $$

In [8]:

# Script A
# Computing the first bound

n=1
u=(9/10)**(np.floor(np.log(n)/np.log(10)))
while u>0.5:
    u=(9/10)**(np.floor(np.log(n)/np.log(10)))
    n=n+1
    
print('Above ',n,' the ratio is less than 0.5')

Above  10000001  the ratio is less than 0.5

In [14]:

# Script B
# maximal persistence up to M

n=1
M=10**7
record=0

while n<= M:
    record=max(record,MultiplicativePersistence(n))
    n=n+1
    
print(record)

Exercise 2. $L$-decompositions (the coin-changing problem)¶

This problem is defined as follows. Let $L=[a_1,a_2,\dots,a_k]$ be a list of distinct positive integers.

For a fixed integer $n\geq 1$ we want to find the number of solutions $(x_1,\dots,x_k)$ to the equation

$$ a_1 x_1 +a_2 x_2 + \dots a_k x_k=n, \qquad\qquad (\star) $$
where $x_i$'s are non-negative integers. We denote by $H_n(L)$ be the number of such solutions. For example if $L=[1,2,5]$ then $H_6([1,2,5])=5$, since

\begin{align*} 6&=5+1 \\ &=2+2+2\\ &=2+ 2+1+1\\ &=2+1+1+1+1\\ &=1+1+1+1+1+1 \end{align*}

In other words, the solutions to $(\star)$ are: $$ (x_1,x_2,x_3)=(1,0,1),\quad (0,3,0),\quad (2,2,0),\quad (4,1,0),\quad (6,0,0) $$

The typical questions we will ask are:

How to compute $H_n(L)$ ?
How does $H_n(L)$ grow when $n\to +\infty$? (Can we find a simple equivalent?)

Throughout the exercise we focus on the case $L=[1,2,5]$.

Justify that for every $n\geq 3$:

$$ H_n([1,2])=1+ H_{n-2}([1,2]). $$
Prove a similar recurrence formula for $H_n([1,2,5])$: write $H_n([1,2,5])$ as a function of $H_1([1,2]),\dots,H_n([1,2])$ and $H_1([1,2,5]),\dots,H_{n-1}([1,2,5])$ (justify your formula).
Deduce from these recurrence formulas:
- A function `H_125(n)` which computes $H_n([1,2,5])$.(To check your result: $H_{30}([1,2,5])=58$.)
Show a numerical experiment which shows that $H_n([1,2,5])\sim cn^2$ for some positive constant $c$ (and find an approximation of $c$).

A solution in $H_n([1,2])$ ends either with a $1$ (in that case the solution is $1,1,\dots,1$) or a $2$. Hence: $$ H_n([1,2])=1+ H_{n-2}([1,2]). $$
A solution in $H_n([1,2,5])$ ends either with a $1$ , a $2$ or a $5$. Hence: $$ H_n([1,2,5])=1+ H_{n-2}([1,2])+H_{n-5}([1,2,5]) $$

In [3]:

def H_12(n):
    # return the number of solutions H_n([1,2])
    if n<=1:
        return 1
    elif n==2:
        return 2 # (2 solutions: 1+1 or 2)
    else:
        return 1+H_12(n-2)

print([H_12(k) for k in range(14)])

def H_125(n):
    # return the number of solutions H_n([1,2,5])
    if n<=4:
        return H_12(n) # one cannot use 5
    elif n==5:
        return H_12(n)+1 # one other solution: 5
    else:
        return 1+H_12(n-2)+H_125(n-5)
    
print(H_125(30))

NN=range(300,1000,2)
Y=[H_125(n)/(n**2/20) for n in NN]

plt.plot(NN,Y,'o-')
plt.show()

[1, 1, 2, 2, 3, 3, 4, 4, 5, 5, 6, 6, 7, 7]
58

Exercise 3. First digit and powers of two¶

Write a function FirstDigit(n) which returns the leftmost digit of $n$:

FirstDigit(238) 2
Hint: Think recursive!

In [6]:

def FirstDigit(n):
    # input: integer n
    # output: First digit (in {1,...,9})
    if n<10:
        return n
    else:
        return FirstDigit(n//10)

# test
print(FirstDigit(516))

The Benford distribution is the probability distribution $(p_k)_{1\leq k\leq 9}$ on $\{1,2,\dots, 9\}$ defined by $$ p_k=\log_{10}(k+1)-\log_{10}(k), $$ where $\log_{10}$ stands for the logarithm in basis $10$. Benford's law of anomalous numbers states that for many data sets the leftmost digit is not uniformly distributed but follows rather the Benford distribution. We want to illustrate the fact that the data set $2,2^2,2^3,2^4,2^5,\dots,2^n$ satisfies Benford's law (when $n$ is large).

(Theory) Justify briefly that $(p_k)_{1\leq k\leq 9}$ is indeed a probability distribution.
Fix $n=1000$. Plot on the same figure:
- The histogram of the frequencies of $\{1,2,\dots,9\}$ among the leftmost digit of $2,2^2,2^3,2^4,2^5,\dots,2^n$.
- The distribution $k\mapsto (p_k)_{1\leq k\leq 9}$

To plot a normalized histogram of a list of data in $\{1,2,\dots,9\}$, use:
plt.hist(MyData, bins= [k+0.5 for k in range(10)], density=True, ec='k')

In [11]:

n=80

FirstDigitPowersOfTwo=[FirstDigit(2**i) for i in range(n)]
Benford=[np.log((k+1.0)/k)/np.log(10) for k in range(1,10)]

#print(FirstDigitPowersOfTwo)

plt.plot(np.arange(1,10),Benford,'ro',label='Benford distribution')
plt.hist(FirstDigitPowersOfTwo, bins= [k+0.5 for k in range(10)], density=True, ec='k')#,facecolor='g', alpha=0.2,label='Frequency of digits') # Histogramme normalise
plt.legend()
plt.show()

1. As $x\mapsto \log_{10}(x)$ is increasing the $p_k$'s are positive. Moreover \begin{align*} p_1+\dots +p_{9}&=\log_{10}(2)-\log_{10}(1)+\log_{10}(3)-\log_{10}(2)+\dots +\log_{10}(10)-\log_{10}(9)\\ &=\log_{10}(10)-\log_{10}(1)=1-0=1. \end{align*}

It can be proved with ergodic theory that the limiting frequencies (when $n\to +\infty$) are governed by the Benford distribution $B$: $$ \text{Frequency of }i \stackrel{n\to +\infty}{\to} B(i) :=\log_{10}((i+1)/i). $$ (See this link for a nice account on this problem.)

Exercise 4. Kruskal–Katona decomposition¶

The Kruskal–Katona decomposition states that for every integers $n,\ell\geq 1$ there exists an integer $t$ and $a_\ell>a_{\ell-1}>a_{\ell-2}>\dots >a_1$ with $$ n=\binom{a_\ell}{\ell} + \binom{a_{\ell-1}}{\ell-1} + \dots + \binom{a_t}{t}. $$ The decomposition is explicit and given by the following greedy recursive algorithm:

If $\ell=1$ then write $n=\binom{n}{1}$.
Otherwise, let $a_\ell$ be the largest integer such that $n\geq \binom{a_\ell}{\ell}$. Apply the algorithm recursively to decompose $n-\binom{a_\ell}{\ell}$.

For example for $n=62$, $\ell=5$ one gets \begin{align*} 62&=\binom{8}{5}+\binom{5}{4}+\binom{3}{3}\\ &= 56+5+1 \end{align*}

1. Write a function `a_max(n,l)` which returns the largest integer such that $n\geq \binom{a_\ell}{\ell}$. You can use
import scipy.special scipy.special.binom(n,l) 2.Write a function Katona(n,l) which returns the Kruskal-Katona decomposition of the pair $(n,\ell)$. The output must be a list of list of sizes $2$:

In [2]:

import scipy.special


int(scipy.special.binom(8, 5))

def a_max(n,l):
    # input: integers n,l
    # output: largest a such that \binom{a}{l} \leq n
    a=l
    #while int(scipy.special.binom(a,l))<=n:
    while scipy.special.binom(a,l)<=n:
        a=a+1
    return a-1

#print(scipy.special.binom(8,5))
#print(scipy.special.binom(21,2))

print(a_max(0,5))

def Katona(n,l):
    # input: integers n,l \geq 1
    # output: Katona's decomposition of n,l
    #if l>n:
    #    return []
    if n==0:
        return []
    elif l==1:
        return [[n,1]]
    else:
        a=a_max(n,l)
        #return [[a,l]]+Katona(n-int(scipy.special.binom(a,l)),l-1)
        return [[a,l]]+Katona(n-scipy.special.binom(a,l),l-1)
        
print(Katona(300,11))

4
[[13, 11], [12, 10], [11, 9], [10, 8], [9, 7], [7, 6], [6, 5], [5, 4], [3, 3], [2, 2]]

APM_2F005 - Algorithms For Discrete Mathematics Bachelor 2
Lecturer: Lucas Gerin

Experimental Maths 2: Multiplicative persistence, Benford distribution, coin-changing problem, Kruskal-Katona decomposition, ...¶

Table of contents¶

Exercise 1. Playing with digits¶

Exercise 2. $L$-decompositions (the coin-changing problem)¶

Exercise 3. First digit and powers of two¶

Exercise 4. Kruskal–Katona decomposition¶

APM_2F005 - Algorithms For Discrete Mathematics Bachelor 2Lecturer: Lucas Gerin

Experimental Maths 2: Multiplicative persistence, Benford distribution, coin-changing problem, Kruskal-Katona decomposition, ...¶

Table of contents¶

Exercise 1. Playing with digits¶

Exercise 2. $L$-decompositions (the coin-changing problem)¶

Exercise 3. First digit and powers of two¶

Exercise 4. Kruskal–Katona decomposition¶

APM_2F005 - Algorithms For Discrete Mathematics Bachelor 2
Lecturer: Lucas Gerin