Unit 2
Unit 2
UNIT - II
GREEDY METHOD
DEFINITION:
A problem with N inputs will have some constraints .any subsets that satisfy these constraints are
called a feasible solution.
A feasible solution that either maximize can minimize a given objectives function is called an
optimal solution.
* The function select an input from a[] and removes it. The select input value is assigned to X.
Feasible is a Boolean value function that determines whether X can be included into the solution
vector.
The function Union combines X with The solution and updates the objective function.
The function Greedy describes the essential way that a greedy algorithm will once a particular
problem is chosen ands the function subset, feasible & union are properly implemented.
Example
Suppose we have in a country the following coins are available :
Dollars(100 cents)
Quarters(25 cents)
Dimes( 10 cents)
Nickel(5 Cents)
Pennies(1 cent)
Our aim is paying a given amount to a customer using the smallest possible number of coins.
For example if we must pay 276 cents possible solution then,
KNAPSACK PROBLEM
we are given n objects and knapsack or bag with capacity M object I has a weight Wi where I
varies from 1 to N.
The problem is we have to fill the bag with the help of N objects and the resulting profit has to be
maximum.
There are so many ways to solve this problem, which will give many feasible solution for which we
have to find the optimal solution.
But in this algorithm, it will generate only one solution which is going to be feasible as well as
optimal.
First, we find the profit & weight rates of each and every object and sort it according to the
descending order of the ratios.
Select an object with highest p/w ratio and check whether its height is lesser than the capacity of the
bag.
If so place 1 unit of the first object and decrement .the capacity of the bag by the weight of the
object you have placed.
Repeat the above steps until the capacity of the bag becomes less than the weight of the object you
have selected .in this case place a fraction of the object and come out of the loop.
ALGORITHM:
Example:
Capacity=20
N=3 ,M=20
Wi=18,15,10
Pi=25,24,15
Pi/Wi=25/18=1.36,24/15=1.6,15/10=1.5
X1 X2 X3 WiXi PiXi
½ 1/3 ¼ 16.6 24.25
1 2/5 0 20 18.2
0 2/3 1 20 31
0 1 ½ 20 31.5
Of these feasible solution Solution 4 yield the Max profit .As we shall soon see this solution is optimal for
the given problem instance.
The problem is the number of jobs, their profit and deadlines will be given and we have to find a sequence
of job, which will be completed within its deadlines, and it should yield a maximum profit.
Points To remember:
To complete a job, one has to process the job or a action for one unit of time.
Only one machine is available for processing jobs.
A feasible solution for this problem is a subset of j of jobs such that each job in this subject can be
completed by this deadline.
If we select a job at that time ,
Since one job can be processed in a single m/c. The other job has to be in its waiting state until the job is
completed and the machine becomes free.
So the waiting time and the processing time should be less than or equal to the dead line of the job.
ALGORITHM:
Algorithm JS(d,j,n)
//The job are ordered such that p[1]>p[2]…>p[n]
//j[i] is the ith job in the optimal solution
// Also at terminal d [ J[ i]<=d[ J {i+1],1<i<k
{
d[0]= J[0]=0;
J[1]=1;
K=1;
For I =1 to n do
{ // consider jobs in non increasing order of P[I];find the position for I and check feasibility insertion
r=k;
while((d[J[r]]>d[i] )and
(d[J[r]] = r)do r =r-1;
if (d[J[r]]<d[I])and (d[I]>r))then
{
for q=k to (r+1) step –1 do J [q+1]=j[q]
J[r+1]=i;
K=k+1;
}
}
return k;
Example :
1. n=5 (P1,P2,…P5)=(20,15,10,5,1)
(d1,d2….d3)=(2,2,1,3,3)
(1) (1) 20
(2) (2) 15
(3) (3) 10
(4) (4) 5
(5) (5) 1
(1,2) (2,1) 35
(1,3) (3,1) 30
(1,4) (1,4) 25
(1,5) (1,5) 21
(2,3) (3,2) 25
(2,4) (2,4) 20
(2,5) (2,5) 16
(1,2,3) (3,2,1) 45
(1,2,4) (1,2,4) 40
2. n=4 (P1,P2,…P4)=(100,10,15,27)
(d1,d2….d4)=(2,1,2,1)
Let G(V,E) be an undirected connected graph with vertices ‘v’ and edge ‘E’.
A sub-graph t=(V,E’) of the G is a Spanning tree of G iff ‘t’ is a tree.3
The problem is to generate a graph G’= (V,E) where ‘E’ is the subset of E,G’ is a Minimum
spanning tree.
Each and every edge will contain the given non-negative length .connect all the nodes with edge
present in set E’ and weight has to be minimum.
NOTE:
We have to visit all the nodes.
The subset tree (i.e) any connected graph with ‘N’ vertices must have at least N-1 edges and also it
does not form a cycle.
Definition:
A spanning tree of a graph is an undirected tree consisting of only those edge that are necessary to
connect all the vertices in the original graph.
A Spanning tree has a property that for any pair of vertices there exist only one path between them
and the insertion of an edge to a spanning tree form a unique cycle.
1. Kruskal’s Algorithm
2. Prom’s Algorithm.
KRUSKAL’S ALGORITHM:
In kruskal's algorithm the selection function chooses edges in increasing order of length without
worrying too much about their connection to previously chosen edges, except that never to form a cycle.
The result is a forest of trees that grows until all the trees in a forest (all the components) merge in a single
tree.
Algorithm:
Algorithm kruskal(E,cost,n,t)
//Eset of edges in G has ‘n’ vertices.
//cost[u,v]cost of edge (u,v).tset of edge in minimum cost spanning tree
// the first cost is returned.
{
for i=1 to n do parent[I]=-1;
I=0;mincost=0.0;
While((I<n-1)and (heap not empty)) do
{
j=find(n);
k=find(v);
if(j not equal k) than
{
i=i+1
t[i,1]=u;
t[i,2]=v;
mincost=mincost+cost[u,v];
union(j,k);
}
}
if(i notequal n-1) then write(“No spanning tree”)
else return minimum cost;
}
Analysis
The time complexity of minimum cost spanning tree algorithm in worst case is O(|E|log|E|),
where E is the edge set of G.
Step 1. In the graph, the Edge(g, h) is shortest. Either vertex g or vertex h could be representative. Lets
choose vertex g arbitrarily.
Step 2. The edge (c, i) creates the second tree. Choose vertex c as representative for second tree.
Step 3. Edge (g, g) is the next shortest edge. Add this edge and choose vertex g as representative.
Step 5. Add edge (c, f) and merge two trees. Vertex c is chosen as the representative.
Step 6. Edge (g, i) is the next next cheapest, but if we add this edge a cycle would be created. Vertex c is
the representative of both.
Step 10. Again, if we add edge (b, c), it would create a cycle. Add edge (d, e) instead to complete the
spanning tree. In this spanning tree all trees joined and vertex c is a sole representative.
PRIM'S ALGORITHM
Start from an arbitrary vertex (root). At each stage, add a new branch (edge) to the tree already
constructed; the algorithm halts when all the vertices in the graph have been reached.
Algorithm prims(e,cost,n,t)
{
Let (k,l) be an edge of minimum cost in E;
Mincost :=cost[k,l];
T[1,1]:=k; t[1,2]:=l;
For I:=1 to n do
If (cost[i,l]<cost[i,k]) then near[i]:=l;
Else near[i]:=k;
Near[k]:=near[l]:=0;
B.BalaKonda reddy KKCITE Page 10
Design and Analysis of Algorithms Computer Science and Engg.
The prims algorithm will start with a tree that includes only a minimum cost edge of G.
Then, edges are added to the tree one by one. the next edge (i,j) to be added in such that I is a vertex
included in the tree, j is a vertex not yet included, and cost of (i,j), cost[i,j] is minimum among all the
edges.
Step 1: Step 2:
Step 3: Step 4:
Step 5: Step 6:
The problems defined by these questions are special case of the path problem we study in this section. The
length of a path is now defined to be the sum of the weights of the edges on that path. The starting vertex of
the path is referred to as the source and the last vertex the destination. The graphs are digraphs representing
streets. Consider a digraph G=(V,E), with the distance to be traveled as weights on the edges. The problem
is to determine the shortest path from v0 to all the remaining vertices of G. It is assumed that all the
weights associated with the edges are positive. The shortest path between v0 and some other node v is an
ordering among a subset of the edges. Hence this problem fits the ordering paradigm.
Example:
Consider the digraph of fig 7-1. Let the numbers on the edges be the costs of travelling along that route. If
a person is interested travel from v1 to v2, then he encounters many paths. Some of them are
1. v1 v2 = 50 units
2. v1 v3 v4 v2 = 10+15+20=45 units
3. v1 v5 v4 v2 = 45+30+20= 95 units
4. v1 v3 v4 v5 v4 v2 = 10+15+35+30+20=110 units
The cheapest path among these is the path along v1 v3 v4 v2. The cost of the path is 10+15+20 = 45
units. Even though there are three edges on this path, it is cheaper than travelling along the path connecting
v1 and v2 directly i.e., the path v1 v2 that costs 50 units. One can also notice that, it is not possible to
travel to v6 from any other node.
To formulate a greedy based algorithm to generate the cheapest paths, we must conceive a multistage
solution to the problem and also of an optimization measure. One possibility is to build the shortest paths
one by one. As an optimization measure we can use the sum of the lengths of all paths so far generated. For
this measure to be minimized, each individual path must be of minimum length. If we have already
constructed i shortest paths, then using this optimization measure, the next path to be constructed should be
the next shortest minimum length path. The greedy way to generate these paths in non-decreasing order of
path length. First, a shortest path to the nearest vertex is generated. Then a shortest path to the second
nearest vertex is generated, and so on.
A much simpler method would be to solve it using matrix representation. The steps that should be followed
is as follows,
Step 1: find the adjacency matrix for the given graph. The adjacency matrix for fig 7.1 is given below
V1 V2 V3 V4 V5 V6
V1 - 50 10 Inf 45 Inf
Step 2: consider v1 to be the source and choose the minimum entry in the row v1. In the above table the
minimum in row v1 is 10.
Step 3: find out the column in which the minimum is present, for the above example it is column v3.
Hence, this is the node that has to be next visited.
Step 4: compute a matrix by eliminating v1 and v3 columns. Initially retain only row v1. The second row is
computed by adding 10 to all values of row v3.
The resulting matrix is
V2 V4 V5 V6
Minimum 50 25 45 inf
Step 5: find the minimum in each column. Now select the minimum from the resulting row. In the above
example the minimum is 25. Repeat step 3 followed by step 4 till all vertices are covered or single column
is left.
The solution for the fig 7.1 can be continued as follows
V2 V5 V6
V1 Vw 50 45 Inf
Minimum 45 45 inf
V5 V6
V1 Vw 45 Inf
Minimum 45 inf
Minimum inf
DYNAMIC PROGRAMMING
General method-multistage graphs-all pair shortest path algorithm-0/1 knapsack and traveling
salesman problem-chained matrix multiplication-approaches using recursion-memory functions
DYNAMIC PROGRAMING
The idea of dynamic programming is thus quit simple: avoid calculating the same thing twice,
usually by keeping a table of known result that fills up a sub instances are solved.
When a problem is solved by divide and conquer, we immediately attack the complete instance,
which we then divide into smaller and smaller sub-instances as the algorithm progresses.
We usually start with the smallest and hence the simplest sub- instances.
By combining their solutions, we obtain the answers to sub-instances of increasing size, until finally
we arrive at the solution of the original instances.
The essential difference between the greedy method and dynamic programming is that the greedy
method only one decision sequence is ever generated.
Let G=<N,A> be a directed graph ’N’ is a set of nodes and ‘A’ is the set of edges.
We want to calculate the length of the shortest path between each pair of nodes.
Suppose the nodes of G are numbered from 1 to n, so N={1,2,...N},and suppose G matrix L gives the
length of each edge, with L(i,j)=0 for i=1,2...n,L(i,j)>=for all i & j, and L(i,j)=infinity, if the edge (i,j)
does not exist.
The principle of optimality applies: if k is the node on the shortest path from i to j then the part of
the path from i to k and the part from k to j must also be optimal, that is shorter.
Copy the above matrix-to-matrix D, which will give the direct distance between nodes.
We have to perform N iteration after iteration k.the matrix D will give you the distance between
nodes with only (1,2...,k)as intermediate nodes.
At the iteration k, we have to check for each pair of nodes (i,j) whether or not there exists a path from
i to j passing through node k.
D0 =L= 0 5
50 0 15 5
30 0 15
15 5 0
1 75 11 12 - -
2 72 21 - - 24
3 3 - 32 - -
4 41 41 – 43 -
vertex 1:
7 5 11 12 - -
7 12 2 21 212 - 24
3 - 32 - -
4 9 1 41 412 43 –
vertex 2:
7 5 7 11 12 - 124
7 12 2 21 212 - 24
10 3 5 321 32 - 324
4 9 1 11 41 412 43 4124
vertex 3:
7 5 7 11 12 - 124
7 12 2 21 212 - 24
10 3 5 321 32 - 324
4 4 1 6 41 432 43 4324
vertex 4:
7 5 8 7 11 12 1243 124
6 6 3 2 241 2432 243 24
9 3 6 5 3241 32 3243 324
4 4 1 6 41 432 43 4324
At 0th iteration it nil give you the direct distances between any 2
nodes
D0= 0 5
50 0 15 5
30 0 15
15 5 0
0 5
50 0 15 5 p[3,2]= 1
D1= 30 35 0 15 p[4,2]= 1
15 20 5 0
15
30
5
5 50 5 15
15
likewise we have to find the value for N iteration (ie) for N nodes.
0 5 20 10 P[1,3] = 2
D2= 50 0 15 5 P[1,4] = 2
30 35 0 15
15 20 5 0
0 5 20 10
D3= 45 0 15 5 P[2,1]=3
30 35 0 15
15 20 5 0
0 5 15 10
20 0 10 5 P[1,3]=4
D4= 30 35 0 15 P[2,3]=4
15 20 5 0
If you want the exact path then we have to refer the matrix p.The
matrix will be,
0042
3040 0 direct path
P= 0100
0100
ALGORITHM :
D=L
For k = 1 to n do
For i = 1 to n do
For j = 1 to n do
D [ i , j ] = min (D[ i, j ], D[ i, k ] + D[ k, j ]
Return D
ANALYSIS:
MULTISTAGE GRAPH
1. A multistage graph G = (V,E) is a directed graph in which the vertices are portioned into K > = 2
disjoint sets Vi, 1 <= i<= k.
2. In addition, if < u,v > is an edge in E, then u < = Vi and V Vi+1 for some i, 1<= i < k.
3. If there will be only one vertex, then the sets Vi and Vk are such that [Vi]=[Vk] = 1.
4. Let ‘s’ and ‘t’ be the source and destination respectively.
5. The cost of a path from source (s) to destination (t) is the sum of the costs of the edger on the path.
6. The MULTISTAGE GRAPH problem is to find a minimum cost path from ‘s’ to ‘t’.
7. Each set Vi defines a stage in the graph. Every path from ‘s’ to ‘t’ starts in stage-1, goes to stage-2
then to stage-3, then to stage-4, and so on, and terminates in stage-k.
8. This MULISTAGE GRAPH problem can be solved in 2 ways.
a) Forward Method.
b) Backward Method.
FORWARD METHOD
PROCEDURE:
V1 V2 V3 V4 V5
4 6
2 2
5 4
9 1
4
7 3 2
7 t
s
3
11 5 5
2
11 6
Maintain a cost matrix cost (n) which stores the distance from any vertex to the destination.
If a vertex is having more than one path, then we have to choose the minimum distance path and the
intermediate vertex, which gives the minimum distance path, will be stored in the distance array
‘D’.
In this way we will find out the minimum cost path from each and every vertex.
Finally cost(1) will give the shortest distance from source to destination.
For finding the path, start from vertex-1 then the distance array D(1) will give the minimum cost
neighbour vertex which in turn give the next nearest vertex and proceed in this way till we reach the
Destination.
For a ‘k’ stage graph, there will be ‘k’ vertex in the path.
In the above graph V1…V5 represent the stages. This 5 stage graph can be solved by using forward
approach as follows,
STEPS: - DESTINATION, D
cost(6) = 7 =>D(6) = 10
cost(2) = 7 =>D(2) = 7
The path through which you have to find the shortest distance.
(i.e.)
D ( 1) = 2
D ( 2) = 7
B.BalaKonda reddy KKCITE Page 22
Design and Analysis of Algorithms Computer Science and Engg.
D ( 7) = 10
D (10) = 12
9 2 3 2
P[1]=1;
P[k]=n;
For j=2 to k-1 do
P[j]=d[p[j-1]];
}
ANALYSIS:
The time complexity of this forward method is O( V + E )
BACKWARD METHOD
if there one ‘K’ stages in a graph using back ward approach. we will find out the cost of each &
every vertex starting from 1st
stage to the kth stage.
We will find out the minimum cost path from destination to source (ie)[from stage k to stage 1]
PROCEDURE:
STEP:
cost(6) = 9 =>D(6)=3
cost(7) = 11 =>D(7)=2
cost(8) = 10 =>D(8)=2
cost(9) = 15 =>D(9)=6
cost(11) = 16 =>D(11)=8
cost(12)=min(c(9,12)+cost(9),c(10,12)+cost(10),c(11,12)+cost(11))
=min(19,16,21)
cost(12) = 16 =>D(12)=10
PATH:
d[j] =r;
}
// find a minimum cost path.
P[1]=1;
P[k]=n;
For j= k-1 to 2 do
P[j]=d[p[j+1]];
}
Let G(V,E) be a directed graph with edge cost c ij is defined such that cij >0 for all i and j and c ij
= ,if <i,j> E.
Let V =n and assume n>1.
The traveling salesman problem is to find a tour of minimum cost.
A tour of G is a directed cycle that include every vertex in V.
The cost of the tour is the sum of cost of the edges on the tour.
The tour is the shortest path that starts and ends at the same vertex (ie) 1.
APPLICATION :
1. Suppose we have to route a postal van to pick up mail from the mail boxes located at ‘n’ different
sites.
2. An n+1 vertex graph can be used to represent the situation.
3. One vertex represent the post office from which the postal van starts and return.
4. Edge <i,j> is assigned a cost equal to the distance from site ‘i’ to site ‘j’.
5. the route taken by the postal van is a tour and we are finding a tour of minimum length.
6. every tour consists of an edge <1,k> for some k V-{} and a path from vertex k to vertex 1.
7. the path from vertex k to vertex 1 goes through each vertex in V-{1,k} exactly once.
8. the function which is used to find the path is
1. Find g(i,) =ci1, 1<=i<n, hence we can use equation(2) to obtain g(i,s) for all s to size 1.
2. That we have to start with s=1,(ie) there will be only one vertex in set ‘s’.
10
15
10
15
20 8 9 13
8 6
12
7
Cost matrix
0 10 15 20
5 0 9 10
6 13 0 12
8 8 9 0
starting position
STEP 1:
g(1,{2,3,4})=min{c12+g(2{3,4}),c13+g(3,{2,4}),c14+g(4,{2,3})}
min{10+25,15+25,20+23}
min{35,35,43}
=35
STEP 2:
g(2,{3,4}) = min{c23+g(3{4}),c24+g(4,{3})}
min{9+20,10+15}
min{29,25}
=25
g(3,{2,4}) =min{c32+g(2{4}),c34+g(4,{2})}
min{13+18,12+13}
min{31,25}
=25
g(4,{2,3}) = min{c42+g(2{3}),c43+g(3,{2})}
min{8+15,9+18}
min{23,27}
=23
STEP 3:
STEP 4:
g{4,} =c41 = 8
g{3,} =c31 = 6
g{2,} =c21 = 5
s = 0.
i =1 to n.
s =1
i =2 to 4
s =2
i 1, 1 s and i s.
g(2,{3,4}) = min{c23+g(3{4}),c24+g(4,{3})}
min{9+20,10+15}
min{29,25}
=25
B.BalaKonda reddy KKCITE Page 29
Design and Analysis of Algorithms Computer Science and Engg.
g(3,{2,4}) =min{c32+g(2{4}),c34+g(4,{2})}
min{13+18,12+13}
min{31,25}
=25
g(4,{2,3}) = min{c42+g(2{3}),c43+g(3,{2})}
min{8+15,9+18}
min{23,27}
=23
s = 3
g(1,{2,3,4})=min{c12+g(2{3,4}),c13+g(3,{2,4}),c14+g(4,{2,3})}
min{10+25,15+25,20+23}
min{35,35,43}
=35
optimal cost is 35
This problem is similar to ordinary knapsack problem but we may not take a fraction of an object.
We are given ‘ N ‘ object with weight Wi and profits Pi where I varies from l to N and also a
knapsack with capacity ‘ M ‘.
The problem is, we have to fill the bag with the help of ‘ N ‘ objects and the resulting profit has to
be maximum.
n
Where Xi are constraints on the solution Xi {0,1}. (u) Xi is required to be 0 or 1. if the object is
selected then the unit in 1. if the object is rejected than the unit is 0. That is why it is called as 0/1,
knapsack problem.
To solve the problem by dynamic programming we up a table T[1…N, 0…M] (ic) the size is N.
where ‘N’ is the no. of objects and column starts with ‘O’ to capacity (ic) ‘M’.
In the table T[i,j] will be the maximum valve of the objects i varies from 1 to n and j varies from O
to M.
If i=l and j < w(i) then T(i,j) =o, (ic) o pre is filled in the table.
If i=l and j w (i) then T (i,j) = p(i), the cell is filled with the profit p[i], since only one object can
be selected to the maximum.
If i>l and j < w(i) then T(i,l) = T (i-l,j) the cell is filled the profit of previous object since it is not
possible with the current object.
If i>l and j w(i) then T (i,j) = {f(i) +T(i-l,j-w(i)),. since only ‘l’ unit can be selected to the
maximum. If is the current profit + profit of the previous object to fill the remaining capacity of the
bag.
Start with the last position of i and j, T[i,j], if T[i,j] = T[i-l,j] then no object of ‘i’ is required so
move up to T[i-l,j].
After moved, we have to check if, T[i,j]=T[i-l,j-w(i)]+ p[I], if it is equal then one unit of object ‘i’ is
selected and move up to the position T[i-l,j-w(i)]
Repeat the same process until we reach T[i,o], then there will be nothing to fill the bag stop the
process.
Consider a Example,
M = 6,
N=3
W1 = 2, W2 = 3, W3 = 4
P1 = 1, P2 =2, P3 = 5
i 1 to N
j 0 to 6
o<2 T1,o =0
i=l, j=2
2 o,= T1,2 = l.
i=l, j=3
3>2,= T1,3 = l.
i=l, j=4
4>2,= T1,4 = l.
i=l, j=5
5>2,= T1,5 = l.
i=l, j=6
6>2,= T1,6 = l.
i=2, j=1
l<3= T(2,1) = T(i-l)
T 2,1 =0