Science Fair Projects Ideas - Euclidean minimum spanning tree

All Science Fair Projects

      

Science Fair Project Encyclopedia for Schools!

  Search    Browse    Forum  Coach    Links    Editor    Help    Tell-a-Friend    Encyclopedia    Dictionary     

Science Fair Project Encyclopedia

For information on any area of science that interests you,
enter a keyword (eg. scientific method, molecule, cloud, carbohydrate etc.).
Or else, you can start by choosing any of the categories below.

Euclidean minimum spanning tree


The Euclidean minimum spanning tree or EMST is a minimum spanning tree of a set of points in the plane (or more generally in \Bbb{R}^n), where the weight of the edge between each pair of points is the distance between those two points. In simpler terms, an EMST connects a set of dots using lines such that the total length of all the lines is minimized and any dot can be reached from any other by following the lines. This is much like a more difficult version of the child's game connect-the-dots .

Contents

Algorithms for computing EMSTs

The simplest algorithm to find an EMST, given n points, is to actually construct the complete graph on n vertices, which has n(n - 1) edges, compute each edge weight by finding the distance between each pair of points, and then run a standard minimum spanning tree algorithm on it. Since this graph has O(n2) edges, constructing it already requires O(n2) time; even using a relatively simple minimum spanning tree algorithm such as Prim's algorithm with a Fibonacci heap requires only O(n2) additional time to find the minimum spanning tree of this graph, bringing the total time to O(n2) time. This solution also requires O(n2) space to store all the edges.

A better approach to finding the EMST in a plane is to note that it is a subgraph of every Delaunay triangulation of the n points, a much-reduced set of edges. Although this is not easy to see (a proof is given in the next section), if we are willing to accept it, the algorithm for finding the EMST is clear:

  1. Compute the Delaunay triangulation, which, using a simple randomized algorithm, requires only O(n log n) expected time and O(n) expected space. Because the Delaunay triangulation is a planar graph, and there are no more than 3 times as many edges as vertices in any planar graph, this generates only O(n) edges.
  2. Label each edge with its length.
  3. Run Prim's algorithm (any variant) on it to find a minimum spanning tree. Since there are O(n) edges, this requires at most O((n+n)log n) or O(n log n) time.

The final result is an algorithm taking O(n log n) expected time and expected O(n) space.

The problem can also be generalized to n points in the d-dimensional space \Bbb{R}^d. The first algorithm listed above solves this problem in O(dn2) time, if we use the d-dimensional distance function, and the second algorithm requires the same amount of time in the worst-case, since Delaunay triangulations can have Ω(n2) edges in higher dimensions. In the 1996 paper "A lower bound for randomized algebraic decision trees," Dima Grigoriev and others proved that this general problem requires at least Ω(nlog n) time for n points, but no approach to date has met this lower bound; the best solutions are close to Ω(n2) for large d.

Subset of Delaunay triangulation

The most popular efficient algorithm for computing the EMST depends on the fact that its edges are a subset of the edges in every Delaunay triangulation of the points. While this isn't clear, it's not difficult to prove.

First, there is a useful property about minimum spanning trees that we will use: if we have a cycle of points, such as v1v2v3v4v1, it may so happen that one of the edges between adjacent points in this cycle will have a larger weight than any other. If so, this edge will not be used in any minimum spanning tree.

Second, we'll also be using a property of Delaunay triangulations: if there is a circle with two of the input points on its boundary which contains no other input points, the line between those two points is an edge of every Delaunay triangulation.

Now, suppose we take an edge e between two input points p and q which is not an edge of a Delaunay triangulation. Then, the circle C with e as its diameter must contain some other point r (or else it would prove that e is in a Delaunay triangulation). But then r is closer to both p and q than they are to each other, and so the edge from p to q is the longest edge in the cycle of points pqrp. Therefore, that edge is not in the EMST.

Having shown that every edge not in the Delaunay triangulation is also not in the EMST, the contrapositive follows: that every edge in the EMST is in the Delaunay triangulation.

Expected size

The expected size of the EMST for large numbers of points has been determined by J. Michael Steele . If f is the density of the probability function for picking points, then for large n and d \neq 1 the size of the EMST is approximately

c(d) n^{\frac{d-1}{d}} \int_{\Bbb{R}^d} f(x)^{\frac{d-1}{d}} dx

where c(d) is a constant depending only on the dimension d. The exact value of the constants are unknown but can be estimated from empirical evidence.

Applications

An obvious application of Euclidean minimum spanning trees is to find the cheapest network of wires or pipes to connect a set of places, assuming the links cost a fixed amount per unit length. However, while these give an absolute lower bound on the amount of connection needed, most such networks prefer a k-connected graph to a tree, so that failure of an any individual link will not split the network into parts.

Another application of EMSTs is to approximating the Traveling Salesman Problem on a set of points obeying the triangle inequality, such as a set of points in the plane. This realistic variation of the problem can be solved within a factor of 2 by computing the EMST, doing a walk along its boundary which outlines the entire tree, and then removing any duplicate vertices from this walk.

References

External links

  • Leda, a C++ commercial algorithm library with the ability to compute and draw EMSTs
11-30-2008 18:11:33
The contents of this article is licensed from www.wikipedia.org under the GNU Free Documentation License. Click here to see the transparent copy and copyright details
Science kits, science lessons, science toys, maths toys, hobby kits, science games and books - these are some of many products that can help give your kid an edge in their science fair projects, and develop a tremendous interest in the study of science. When shopping for a science kit or other supplies, make sure that you carefully review the features and quality of the products. Compare prices by going to several online stores. Read product reviews online or refer to magazines.

Start by looking for your science kit review or science toy review. Compare prices but remember, Price $ is not everything. Quality does matter.
Science Fair Coach
What do science fair judges look out for?
ScienceHound
Science Fair Projects for students of all ages
All Science Fair Projects.com Site
All Science Fair Projects Homepage
Search | Browse | Links | From-our-Editor | Books | Help | Contact | Privacy | Disclaimer | Copyright Notice