| Copyright | (c) Andrey Mokhov 2016-2021 |
|---|---|
| License | MIT (see the file LICENSE) |
| Maintainer | andrey.mokhov@gmail.com |
| Stability | experimental |
| Safe Haskell | None |
| Language | Haskell2010 |
Algebra.Graph.NonEmpty
Description
Alga is a library for algebraic construction and manipulation of graphs in Haskell. See this paper for the motivation behind the library, the underlying theory, and implementation details.
This module defines the data type Graph for algebraic graphs that are known
to be non-empty at compile time. To avoid name clashes with Algebra.Graph,
this module can be imported qualified:
import qualified Algebra.Graph.NonEmpty as NonEmpty
The naming convention generally follows that of Data.List.NonEmpty: we use
suffix 1 to indicate the functions whose interface must be changed compared
to Algebra.Graph, e.g. vertices1.
Synopsis
- data Graph a
- toNonEmpty :: Graph a -> Maybe (Graph a)
- vertex :: a -> Graph a
- edge :: a -> a -> Graph a
- overlay :: Graph a -> Graph a -> Graph a
- overlay1 :: Graph a -> Graph a -> Graph a
- connect :: Graph a -> Graph a -> Graph a
- vertices1 :: NonEmpty a -> Graph a
- edges1 :: NonEmpty (a, a) -> Graph a
- overlays1 :: NonEmpty (Graph a) -> Graph a
- connects1 :: NonEmpty (Graph a) -> Graph a
- foldg1 :: (a -> b) -> (b -> b -> b) -> (b -> b -> b) -> Graph a -> b
- isSubgraphOf :: Ord a => Graph a -> Graph a -> Bool
- (===) :: Eq a => Graph a -> Graph a -> Bool
- size :: Graph a -> Int
- hasVertex :: Eq a => a -> Graph a -> Bool
- hasEdge :: Eq a => a -> a -> Graph a -> Bool
- vertexCount :: Ord a => Graph a -> Int
- edgeCount :: Ord a => Graph a -> Int
- vertexList1 :: Ord a => Graph a -> NonEmpty a
- edgeList :: Ord a => Graph a -> [(a, a)]
- vertexSet :: Ord a => Graph a -> Set a
- edgeSet :: Ord a => Graph a -> Set (a, a)
- path1 :: NonEmpty a -> Graph a
- circuit1 :: NonEmpty a -> Graph a
- clique1 :: NonEmpty a -> Graph a
- biclique1 :: NonEmpty a -> NonEmpty a -> Graph a
- star :: a -> [a] -> Graph a
- stars1 :: NonEmpty (a, [a]) -> Graph a
- tree :: Tree a -> Graph a
- mesh1 :: NonEmpty a -> NonEmpty b -> Graph (a, b)
- torus1 :: NonEmpty a -> NonEmpty b -> Graph (a, b)
- removeVertex1 :: Eq a => a -> Graph a -> Maybe (Graph a)
- removeEdge :: Eq a => a -> a -> Graph a -> Graph a
- replaceVertex :: Eq a => a -> a -> Graph a -> Graph a
- mergeVertices :: (a -> Bool) -> a -> Graph a -> Graph a
- splitVertex1 :: Eq a => a -> NonEmpty a -> Graph a -> Graph a
- transpose :: Graph a -> Graph a
- induce1 :: (a -> Bool) -> Graph a -> Maybe (Graph a)
- induceJust1 :: Graph (Maybe a) -> Maybe (Graph a)
- simplify :: Ord a => Graph a -> Graph a
- sparsify :: Graph a -> Graph (Either Int a)
- sparsifyKL :: Int -> Graph Int -> Graph
- box :: Graph a -> Graph b -> Graph (a, b)
Non-empty algebraic graphs
Non-empty algebraic graphs, which are constructed using three primitives:
vertex, overlay and connect. See module Algebra.Graph for algebraic
graphs that can be empty.
We define a Num instance as a convenient notation for working with graphs:
0 ==vertex0 1 + 2 ==overlay(vertex1) (vertex2) 1 * 2 ==connect(vertex1) (vertex2) 1 + 2 * 3 ==overlay(vertex1) (connect(vertex2) (vertex3)) 1 * (2 + 3) ==connect(vertex1) (overlay(vertex2) (vertex3))
Note: the signum method of the type class Num cannot be implemented and
will throw an error. Furthermore, the Num instance does not satisfy several
"customary laws" of Num, which dictate that fromInteger 0 and
fromInteger 1 should act as additive and multiplicative identities, and
negate as additive inverse. Nevertheless, overloading fromInteger, + and
* is very convenient when working with algebraic graphs; we hope that in
future Haskell's Prelude will provide a more fine-grained class hierarchy for
algebraic structures, which we would be able to utilise without violating any
laws.
The Eq instance satisfies the following laws of non-empty algebraic graphs.
overlayis commutative, associative and idempotent:x + y == y + x x + (y + z) == (x + y) + z x + x == xconnectis associative:x * (y * z) == (x * y) * z
connectdistributes overoverlay:x * (y + z) == x * y + x * z (x + y) * z == x * z + y * z
connectcan be decomposed:x * y * z == x * y + x * z + y * z
connectsatisfies absorption and saturation:x * y + x + y == x * y x * x * x == x * x
When specifying the time and memory complexity of graph algorithms, n will
denote the number of vertices in the graph, m will denote the number of
edges in the graph, and s will denote the size of the corresponding Graph
expression, defined as the number of vertex leaves (note that n <= s). If
g is a Graph, the corresponding n, m and s can be computed as follows:
n ==vertexCountg m ==edgeCountg s ==sizeg
Converting a Graph to the corresponding
AdjacencyMap takes O(s + m * log(m)) time and O(s + m) memory. This is also the
complexity of the graph equality test, because it is currently implemented by
converting graph expressions to canonical representations based on adjacency
maps.
The total order Ord on graphs is defined using size-lexicographic comparison:
- Compare the number of vertices. In case of a tie, continue.
- Compare the sets of vertices. In case of a tie, continue.
- Compare the number of edges. In case of a tie, continue.
- Compare the sets of edges.
Here are a few examples:
vertex1 <vertex2vertex3 <edge1 2vertex1 <edge1 1edge1 1 <edge1 2edge1 2 <edge1 1 +edge2 2edge1 2 <edge1 3
Note that the resulting order refines the isSubgraphOf relation and is
compatible with overlay and connect operations:
isSubgraphOf x y ==> x <= yx <= x + y x + y <= x * y
Instances
Basic graph construction primitives
vertex :: a -> Graph a Source #
Construct the graph comprising a single isolated vertex. An alias for the
constructor Vertex.
hasVertexx (vertex y) == (x == y)vertexCount(vertex x) == 1edgeCount(vertex x) == 0size(vertex x) == 1
edge :: a -> a -> Graph a Source #
Construct the graph comprising a single edge.
edge x y ==connect(vertexx) (vertexy)hasEdgex y (edge x y) == TrueedgeCount(edge x y) == 1vertexCount(edge 1 1) == 1vertexCount(edge 1 2) == 2
overlay :: Graph a -> Graph a -> Graph a Source #
Overlay two graphs. An alias for the constructor Overlay. This is a
commutative, associative and idempotent operation.
Complexity: O(1) time and memory, O(s1 + s2) size.
hasVertexz (overlay x y) ==hasVertexz x ||hasVertexz yvertexCount(overlay x y) >=vertexCountxvertexCount(overlay x y) <=vertexCountx +vertexCountyedgeCount(overlay x y) >=edgeCountxedgeCount(overlay x y) <=edgeCountx +edgeCountysize(overlay x y) ==sizex +sizeyvertexCount(overlay 1 2) == 2edgeCount(overlay 1 2) == 0
overlay1 :: Graph a -> Graph a -> Graph a Source #
Overlay a possibly empty graph (from Algebra.Graph) with a non-empty
graph. If the first argument is empty, the function returns the second
argument; otherwise it is semantically the same as overlay.
Complexity: O(s1) time and memory, and O(s1 + s2) size.
overlay1emptyx == x x /=empty==> overlay1 x y == overlay (fromJust $ toNonEmpty x) y
connect :: Graph a -> Graph a -> Graph a Source #
Connect two graphs. An alias for the constructor Connect. This is an
associative operation, which distributes over overlay and obeys the
decomposition axiom.
Complexity: O(1) time and memory, O(s1 + s2) size. Note that the number
of edges in the resulting graph is quadratic with respect to the number of
vertices of the arguments: m = O(m1 + m2 + n1 * n2).
hasVertexz (connect x y) ==hasVertexz x ||hasVertexz yvertexCount(connect x y) >=vertexCountxvertexCount(connect x y) <=vertexCountx +vertexCountyedgeCount(connect x y) >=edgeCountxedgeCount(connect x y) >=edgeCountyedgeCount(connect x y) >=vertexCountx *vertexCountyedgeCount(connect x y) <=vertexCountx *vertexCounty +edgeCountx +edgeCountysize(connect x y) ==sizex +sizeyvertexCount(connect 1 2) == 2edgeCount(connect 1 2) == 1
overlays1 :: NonEmpty (Graph a) -> Graph a Source #
Overlay a given list of graphs. Complexity: O(L) time and memory, and O(S) size, where L is the length of the given list, and S is the sum of sizes of the graphs in the list.
overlays1 [x] == x
overlays1 [x,y] == overlay x y
connects1 :: NonEmpty (Graph a) -> Graph a Source #
Connect a given list of graphs. Complexity: O(L) time and memory, and O(S) size, where L is the length of the given list, and S is the sum of sizes of the graphs in the list.
connects1 [x] == x
connects1 [x,y] == connect x y
Graph folding
foldg1 :: (a -> b) -> (b -> b -> b) -> (b -> b -> b) -> Graph a -> b Source #
Generalised graph folding: recursively collapse a Graph by
applying the provided functions to the leaves and internal nodes of the
expression. The order of arguments is: vertex, overlay and connect.
Complexity: O(s) applications of the given functions. As an example, the
complexity of size is O(s), since const and + have constant costs.
foldg1vertexoverlayconnect== id foldg1vertexoverlay(flipconnect) ==transposefoldg1 (const1) (+) (+) ==sizefoldg1 (== x) (||) (||) ==hasVertexx
Relations on graphs
isSubgraphOf :: Ord a => Graph a -> Graph a -> Bool Source #
The isSubgraphOf function takes two graphs and returns True if the
first graph is a subgraph of the second.
Complexity: O(s + m * log(m)) time. Note that the number of edges m of a
graph can be quadratic with respect to the expression size s.
isSubgraphOf x (overlayx y) == True isSubgraphOf (overlayx y) (connectx y) == True isSubgraphOf (path1xs) (circuit1xs) == True isSubgraphOf x y ==> x <= y
(===) :: Eq a => Graph a -> Graph a -> Bool infix 4 Source #
Structural equality on graph expressions. Complexity: O(s) time.
x === x == True x + y === x + y == True 1 + 2 === 2 + 1 == False x + y === x * y == False
Graph properties
size :: Graph a -> Int Source #
The size of a graph, i.e. the number of leaves of the expression. Complexity: O(s) time.
size (vertexx) == 1 size (overlayx y) == size x + size y size (connectx y) == size x + size y size x >= 1 size x >=vertexCountx
hasVertex :: Eq a => a -> Graph a -> Bool Source #
Check if a graph contains a given vertex. Complexity: O(s) time.
hasVertex x (vertex y) == (x == y)
edgeList :: Ord a => Graph a -> [(a, a)] Source #
The sorted list of edges of a graph. Complexity: O(s + m * log(m)) time and O(m) memory. Note that the number of edges m of a graph can be quadratic with respect to the expression size s.
edgeList (vertexx) == [] edgeList (edgex y) == [(x,y)] edgeList (star2 [3,1]) == [(2,1), (2,3)] edgeList .edges1==nub.sort.toListedgeList .transpose==sort.mapswap. edgeList
Standard families of graphs
clique1 :: NonEmpty a -> Graph a Source #
The clique on a list of vertices. Complexity: O(L) time, memory and size, where L is the length of the given list.
clique1 [x] ==vertexx clique1 [x,y] ==edgex y clique1 [x,y,z] ==edges1[(x,y), (x,z), (y,z)] clique1 (xs<>ys) ==connect(clique1 xs) (clique1 ys) clique1 .reverse==transpose. clique1
stars1 :: NonEmpty (a, [a]) -> Graph a Source #
The stars formed by overlaying a non-empty list of stars.
Complexity: O(L) time, memory and size, where L is the total size of the
input.
stars1 [(x, [] )] ==vertexx stars1 [(x, [y])] ==edgex y stars1 [(x, ys )] ==starx ys stars1 ==overlays1.fmap(uncurrystar)overlay(stars1 xs) (stars1 ys) == stars1 (xs<>ys)
tree :: Tree a -> Graph a Source #
The tree graph constructed from a given Tree data structure.
Complexity: O(T) time, memory and size, where T is the size of the
given tree (i.e. the number of vertices in the tree).
tree (Node x []) ==vertexx tree (Node x [Node y [Node z []]]) ==path1[x,y,z] tree (Node x [Node y [], Node z []]) ==starx [y,z] tree (Node 1 [Node 2 [], Node 3 [Node 4 [], Node 5 []]]) ==edges1[(1,2), (1,3), (3,4), (3,5)]
mesh1 :: NonEmpty a -> NonEmpty b -> Graph (a, b) Source #
Construct a mesh graph from two lists of vertices. Complexity: O(L1 * L2) time, memory and size, where L1 and L2 are the lengths of the given lists.
mesh1 [x] [y] ==vertex(x, y) mesh1 xs ys ==box(path1xs) (path1ys) mesh1 [1,2,3] ['a', 'b'] ==edges1[ ((1,'a'),(1,'b')), ((1,'a'),(2,'a')) , ((1,'b'),(2,'b')), ((2,'a'),(2,'b')) , ((2,'a'),(3,'a')), ((2,'b'),(3,'b')) , ((3,'a'),(3,'b')) ]
torus1 :: NonEmpty a -> NonEmpty b -> Graph (a, b) Source #
Construct a torus graph from two lists of vertices. Complexity: O(L1 * L2) time, memory and size, where L1 and L2 are the lengths of the given lists.
torus1 [x] [y] ==edge(x,y) (x,y) torus1 xs ys ==box(circuit1xs) (circuit1ys) torus1 [1,2] ['a', 'b'] ==edges1[ ((1,'a'),(1,'b')), ((1,'a'),(2,'a')) , ((1,'b'),(1,'a')), ((1,'b'),(2,'b')) , ((2,'a'),(1,'a')), ((2,'a'),(2,'b')) , ((2,'b'),(1,'b')), ((2,'b'),(2,'a')) ]
Graph transformation
removeVertex1 :: Eq a => a -> Graph a -> Maybe (Graph a) Source #
Remove a vertex from a given graph. Returns Nothing if the resulting
graph is empty.
Complexity: O(s) time, memory and size.
removeVertex1 x (vertexx) == Nothing removeVertex1 1 (vertex2) == Just (vertex2) removeVertex1 x (edgex x) == Nothing removeVertex1 1 (edge1 2) == Just (vertex2) removeVertex1 x>=>removeVertex1 x == removeVertex1 x
replaceVertex :: Eq a => a -> a -> Graph a -> Graph a Source #
The function replaceVertex x y replaces vertex x with vertex y in a
given Graph. If y already exists, x and y will be merged.
Complexity: O(s) time, memory and size.
replaceVertex x x == id replaceVertex x y (vertexx) ==vertexy replaceVertex x y ==mergeVertices(== x) y
mergeVertices :: (a -> Bool) -> a -> Graph a -> Graph a Source #
Merge vertices satisfying a given predicate into a given vertex. Complexity: O(s) time, memory and size, assuming that the predicate takes constant time.
mergeVertices (constFalse) x == id mergeVertices (== x) y ==replaceVertexx y mergeVerticeseven1 (0 * 2) == 1 * 1 mergeVerticesodd1 (3 + 4 * 5) == 4 * 1
splitVertex1 :: Eq a => a -> NonEmpty a -> Graph a -> Graph a Source #
Split a vertex into a list of vertices with the same connectivity. Complexity: O(s + k * L) time, memory and size, where k is the number of occurrences of the vertex in the expression and L is the length of the given list.
splitVertex1 x [x] == id
splitVertex1 x [y] == replaceVertex x y
splitVertex1 1 [0,1] $ 1 * (2 + 3) == (0 + 1) * (2 + 3)
induce1 :: (a -> Bool) -> Graph a -> Maybe (Graph a) Source #
Construct the induced subgraph of a given graph by removing the
vertices that do not satisfy a given predicate. Returns Nothing if the
resulting graph is empty.
Complexity: O(s) time, memory and size, assuming that the predicate takes
constant time.
induce1 (constTrue ) x == Just x induce1 (constFalse) x == Nothing induce1 (/= x) ==removeVertex1x induce1 p>=>induce1 q == induce1 (\x -> p x && q x)
induceJust1 :: Graph (Maybe a) -> Maybe (Graph a) Source #
Construct the induced subgraph of a given graph by removing the vertices
that are Nothing. Returns Nothing if the resulting graph is empty.
Complexity: O(s) time, memory and size.
induceJust1 (vertexNothing) ==NothinginduceJust1 (edge(Justx)Nothing) ==Just(vertexx) induceJust1 .fmapJust==JustinduceJust1 .fmap(\x -> if p x thenJustx elseNothing) ==induce1p
simplify :: Ord a => Graph a -> Graph a Source #
Simplify a graph expression. Semantically, this is the identity function, but it simplifies a given expression according to the laws of the algebra. The function does not compute the simplest possible expression, but uses heuristics to obtain useful simplifications in reasonable time. Complexity: the function performs O(s) graph comparisons. It is guaranteed that the size of the result does not exceed the size of the given expression.
simplify == idsize(simplify x) <=sizex simplify 1===1 simplify (1 + 1)===1 simplify (1 + 2 + 1)===1 + 2 simplify (1 * 1 * 1)===1 * 1
sparsify :: Graph a -> Graph (Either Int a) Source #
Sparsify a graph by adding intermediate Left Int vertices between the
original vertices (wrapping the latter in Right) such that the resulting
graph is sparse, i.e. contains only O(s) edges, but preserves the
reachability relation between the original vertices. Sparsification is useful
when working with dense graphs, as it can reduce the number of edges from
O(n^2) down to O(n) by replacing cliques, bicliques and similar densely
connected structures by sparse subgraphs built out of intermediate vertices.
Complexity: O(s) time, memory and size.
sort.reachablex ==sort.rights.reachable(Rightx) . sparsifyvertexCount(sparsify x) <=vertexCountx +sizex + 1edgeCount(sparsify x) <= 3 *sizexsize(sparsify x) <= 3 *sizex
sparsifyKL :: Int -> Graph Int -> Graph Source #
Sparsify a graph whose vertices are integers in the range [1..n], where
n is the first argument of the function, producing an array-based graph
representation from Data.Graph (introduced by King and Launchbury, hence
the name of the function). In the resulting graph, vertices [1..n]
correspond to the original vertices, and all vertices greater than n are
introduced by the sparsification procedure.
Complexity: O(s) time and memory. Note that thanks to sparsification, the resulting graph has a linear number of edges with respect to the size of the original algebraic representation even though the latter can potentially contain a quadratic O(s^2) number of edges.
sort.reachablek ==sort.filter(<= n) .flipreachablek . sparsifyKL nlength(vertices$ sparsifyKL n x) <=vertexCountx +sizex + 1length(edges$ sparsifyKL n x) <= 3 *sizex
Graph composition
box :: Graph a -> Graph b -> Graph (a, b) Source #
Compute the Cartesian product of graphs. Complexity: O(s1 * s2) time, memory and size, where s1 and s2 are the sizes of the given graphs.
box (path1[0,1]) (path1['a','b']) ==edges1[ ((0,'a'), (0,'b')) , ((0,'a'), (1,'a')) , ((0,'b'), (1,'b')) , ((1,'a'), (1,'b')) ]
Up to isomorphism between the resulting vertex types, this operation is
commutative, associative, distributes over overlay, and has
singleton graphs as identities. Below ~~ stands for equality up to an
isomorphism, e.g. (x, ()) ~~ x.
box x y ~~ box y x box x (box y z) ~~ box (box x y) z box x (overlayy z) ==overlay(box x y) (box x z) box x (vertex()) ~~ xtranspose(box x y) == box (transposex) (transposey)vertexCount(box x y) ==vertexCountx *vertexCountyedgeCount(box x y) <=vertexCountx *edgeCounty +edgeCountx *vertexCounty