Compute the difference of each data point from their mean vector
mean of dimension 1=47+9+6+10=8mean of dimension 2=47+9+10+6=8mean vector=(8,8)difference of a from mean vector=(−1,−1)difference of b from mean vector=(1,1)difference of c from mean vector=(−2,2)difference of d from mean vector=(2,−2)
He is unlikely to have lung cancer since the probability of him having lung cancer (0.389) is less than the probability of him not having lung cancer (1−0.389=0.611).
(b)
The Bayesian Belief Network can be very difficult to construct
Bayesian Belief Network can be computationally expensive to run for larger and more complex networks
The Bayesian Belief Network cannot contain cyclic relationships
\pagebreak
Q4
(a)
Advantages
Materializations allow for a faster querying time of data
We can query data based on the level of materialization
Disadvantages
Storage space is required for materializing views, which means there can be additional costs
It might take up too much storage space if we materialize too many views
(b)
Iterate the greedy algorithm for k values from 1 to n, until the memory size to materialize the views is larger than the available memory size X. Therefore n would be the largest number of views possible to be materialized within the memory size limit.
Then we choose the value of k, which is the number of views to be materialized, based on the largest benefit, i.e. the largest benefit we could obtain from 1 to n.