Skip to content

Latest commit

 

History

History
141 lines (116 loc) · 5.89 KB

README.md

File metadata and controls

141 lines (116 loc) · 5.89 KB

The GoSearch Programming Language

GoSearch is a dialect of the Go programming language which implements the search keyword.

Given two facts:

  • The node from which no decisions have yet been made (henceforth known as the First Choice Generator, or FCG)
  • A block of code which returns a generator channel that defines iteration of the children of a given node.

...then any depth-first search may be conducted on the user's behalf.

Given either/or both of the following facts:

  • A block of code that accepts a slice of user defined type and a candidate node and returns a boolean indicating whether that node is grounds for rejection of that subbranch.
  • A block of code that accepts a slice of user defined type and returns a boolean indicating whether a complete solution has been found.

...then any backracking algorithm may be conducted on the user's behalf.

The structure of a search block is as follows:

search FCG; typeof(FCG)[; Concurrency Level] {
	children:
		...
	[
	accept:
		...
	reject:
		...
	]
}

Where Concurrency Level is an optional integer value > 0 and defines the number of concurrent subbranch searches allowed by the search engine. accept and reject are optional code blocks intended for defining bactracking algorithms.

The type of node that this algorithm is searching through is required merely due to a technical difficulty in implementing this feature on my own with no real access to the Go compiler maintainers.

Where did this come from?

This code was a constructive demonstration of my master's thesis. The thesis pointed out that languages (such as Prolog) offered search as a first class citizen, but that these languages were often considered obscure and too scientific (...such as Prolog). The research attempts to bring easy to implement, and efficient, graph search to imperative/procedural programming languages in such a way that no programmer would ever dread such algorithms again.

Building

The following downloads a bootstrap compiler, compiles this compiler, and runs the motivating_example.

git clone [email protected]:christopher-henderson/GoSearch.git
cd GoSearch
mkdir bootstrap
cd bootstrap
# Replace this with a different architecture or platform as is necessary.
curl -O -L https://golang.org/dl/go1.10.2.linux-amd64.tar.gz
tar zxvf go1.10.2.linux-amd64.tar.gz
export GOROOT_BOOTSTRAP=$(pwd)/go
cd ../src
./make.bash
cd ..
bin/go run motivating_example.go

If you are on Mac, then you can find the appropriate bootstrap compiler at go1.10.2.darwin-amd64.tar.gz.

The target compiler will be available at GoSearch/bin/go.

Motivating Examples

A small collection of examples may be found in the examples directory. The following is a complete implementation of the NQueens problem, solved using a number of goroutines equal to the number of CPUs available to the system at runtime:

package main

import (
	"log"
	"time"
	"runtime"
)

type Queen struct {
	Column int
	Row int
}

func NQueens(N int) {
	search Queen{0, 0}; Queen; runtime.NumCPU() {
	children:
		column := node.Column + 1
		c := make(chan Queen, 0)
		// If the parent is in the final column
		// then there are no children.
		if column > N {
			close(c)
			return c
		}
		go func() {
			defer close(c)
			for r := 1; r < N+1; r++ {
				c <- Queen{column, r}
			}
		}()
		return c
	accept:
		if len(solution) == N {
			// stdout is expensive, so you
			// can get a hefty speedup by
			// commenting this out.
			log.Println(solution)
			return true
		}
		return false
	reject:
		row, column := node.Row, node.Column
		for _, q := range solution {
			r, c := q.Row, q.Column
			if row == r ||
				column == c ||
				row+column == r+c ||
				row-column == r-c {
				return true
			}
		}
		return false
	}
}

func main() {
	log.SetFlags(log.Lshortfile)
	s := time.Now()
	NQueens(8)
	log.Println(time.Now().Sub(s))
}

Efficiency

@TODO pull richer explanation and comparison from the paper. Faster than most, but not the fastest. It loses to Google Optimization Research labs. The Rust version, however, spanks Google OR.

  • Search and backtracking are stack powered algorithms. This engine allocates that stack to the heap and manages it manually rather than relying on the call stack.
  • The concurrent graph search is implemented by a naive global round robin work stealing algorithm, where each time work is stolen a copy of the current solution is incurred. This causes concurrent speedups to be sublinear with respect to number of physical CPUs available (for example, 3.80x speedup for a 4 cores).
  • No additional time complexity beyond that of the user's target algorithm complexity is incured. The exception is the cost of copying the solution data structure.
  • If you want a much - MUCH - faster (30x faster on my laptop for NQueens = 15) version of this code see RustSearch.

Compatibility

This compiler is based off of the Go project as of release 1.10.2 (71bdbf431b79dff61944f22c25c7e085ccfc25d5). Due to the reservation of search, children, accept, and reject in the language the API with standard Go is broken. The ABI, however, remains intact. The result is that code written in the GoSearch dialect may be compiled by this project and then later linked into normal Go using the standard compiler.

Tests

The Go testing framework uses the go/parser pacakge, which is more-or-less a mirror of the parser package internal to the compiler. I'm not exactly sure why, but it's easy to imagine that it's trying to give richer errors or isolation in a testing environment. I haven't gotten to updating the go/parser package with this new syntax, so as a result unit tests to not currently work in this dialect. If you really want to use this code, submit an issue and I'll give it go so that we can all test.