Sixth Data - Data storage and computing engine
+Sixth Data - Data storage and computing engine
+Table of Contents
+ +1 General
+1 General
- This program is free software: released under Creative Commons Zero -(CC0) license - +(CC0) license
- Program author:
-
-
- Svjatoslav Agejenko - -
- Homepage: https://svjatoslav.eu - -
- Email: mailto://svjatoslav@svjatoslav.eu - -
+ - Svjatoslav Agejenko +
- Homepage: https://svjatoslav.eu +
- Email: mailto://svjatoslav@svjatoslav.eu +
1.1 Source code
+1.1 Source code
-
-
- Download latest snapshot in TAR GZ format - +
- Download latest snapshot in TAR GZ format -
- Browse Git repository online - +
- Browse Git repository online -
- Clone Git repository using command: +
+Clone Git repository using command: +
git clone https://www2.svjatoslav.eu/git/sixth-data.git +
-
-
-
-- See JavaDoc. - +
- See JavaDoc.
2 Vision / goal
+2 Vision / goal
Provide hackable, versioned, optimized, distributed, geometrical, -arbitrary dimensional (hypercube based) data storage and computation -engine (as inspired by the brain) for general purpose visual computing +arbitrary dimensional (hypercube based) data storage and computation +engine (as inspired by the brain) for general purpose visual computing environment called Sixth.
@@ -261,49 +342,42 @@ language it would be used to provide -3 Inspiration
+3 Inspiration
-3.1 Brain
+3.1 Brain
- Brain appears to be natural geometrical/parallel data storage and computational engine: - +
- https://www.quantamagazine.org/the-brain-maps-out-ideas-and-memories-like-spaces-20190114/ +
3.2 CM-1 Connection Machine
+3.2 CM-1 Connection Machine
https://en.wikipedia.org/wiki/Connection_Machine @@ -321,53 +395,48 @@ computational units into virtual 5D network. See:
we can pre-distribute data across computation units and perform -parallel geometrical computation. +parallel geometrical computation.
4 Reasons for hypercube as a so called first class citizen
+4 Reasons for hypercube as a so called first class citizen
- Hypercube is quite general purpose data structure that naturally -encapsulates wide variety data and problems. - +encapsulates wide variety data and problems. -
- Nicely captures apparent properties of the brain. - +
- Nicely captures apparent properties of the brain. -
- Naturally supports distributed and parallel geometrical data storage -and computation. - +
- Naturally supports distributed and parallel geometrical data storage +and computation. -
- Dedicated hardware like CM-1 can be built around hypercube concept +
- Dedicated hardware like CM-1 can be built around hypercube concept that results in data, computation process and hardware, all beautifully fitting together while complementing each other -strengths. - +strengths.
- Hypercube stored data (and computation process) has geometry by its nature and should fit nicely with "3D first" user interface ideology -of the parent Sixth project. - +of the parent Sixth project.
5 Geometrical computation idea
+5 Geometrical computation idea
5.1 Distributed computation and data storage
+5.1 Distributed computation and data storage
Lots of problems can be translated to geometry (use any shapes and as many dimensions as you need). Solution(s) to such problems could be then found via geometrical search/comparison/lookup results. As a -bonus, such geometrical *data storage* AND *computation* can be -naturally made in *parallel* and *distributed*. +bonus, such geometrical data storage AND computation can be +naturally made in parallel and distributed.
@@ -377,8 +446,8 @@ part). Question answering is making (relatively simple) lookups
5.2 Mapping hypercube to object-oriented model and relational database
+5.2 Mapping hypercube to object-oriented model and relational database
Object oriented programming is inspired by the way human mind @@ -399,42 +468,34 @@ class declares 4 variables for an object, then corresponding object can be stored as a single point inside 4 dimensional hypercube. Variable values translate to point coordinates in that hypercube. That is: numbers and string can be translated to linear -value that can be used as a coordinate along particular dimension. - +value that can be used as a coordinate along particular dimension.
5.3 Mapping entity relations in hypercube
+5.3 Mapping entity relations in hypercube
Consider we want to create database of:
-
-
- Books. - -
- Authors. - +
- Books. +
- Authors.
- Effort: Amount of time contributed by every author to every book -that he/she wrote. - +that he/she wrote.
Information above can be represented as 3D cube where dimensions are:
-
-
- X: Book - -
- Y: Author - -
- Z: Effort - +
- X: Book +
- Y: Author +
- Z: Effort
@@ -444,149 +505,118 @@ between authors and the books.
6 Current status
+6 Current status
-
-
- More or less defined Vision / goal. - +
- More or less defined Vision / goal. -
- Collected some inspiring ideas. - +
- Collected some inspiring ideas.
- Implemented very simple persistent key-value map.
- Long term goal is to use it as a backing storage engine and implement more advanced features on top of this via layered -architecture. - -
+architecture.
+
7 See also
+7 See also
Interesting or competing projects with good ideas:
-
-
- ChrysaLisp +
- flexible user interface building for interacting with different data
-
-
- Assembler/C-Script/Lisp 64 bit, MIMD, multi CPU, multi threaded, -multi core, multi user Parallel OS. With GUI, Terminal, OO -Assembler, Class libraries, C-Script compiler, Lisp interpreter, -Debugger, and more… - -
-
- - CM-1 Connection Machine - - -
- Gemstone/S
-
-
-
- Completely distributed smalltalk based computing -system. - -
-
- Glamorous Toolkit
- Moldable development environment. It is a live notebook. It is a flexible search interface. It is a fancy code editor. It is a software analysis platform. It is a data visualization engine. All -in one. - +in one.
+
7.1 Computation on multi dimensional data
+-
-
- database in the form of a knowledge graph that uses machine -reasoning to simplify data processing challenges for AI -applications. https://grakn.ai/ - +
- Array Databases: Concepts, Standards, Implementations + +
- TileDB
+
-
+
- Analyze and share complex multi-dimensional data at scale +
+
+ - CM-1 Connection Machine
7.2 Distributed, reliable, parallel computing systems
+-
+
- ChrysaLisp
+
-
+
- Assembler/C-Script/Lisp 64 bit, MIMD, multi CPU, multi threaded, +multi core, multi user Parallel OS. With GUI, Terminal, OO +Assembler, Class libraries, C-Script compiler, Lisp interpreter, +Debugger, and more… +
+
+ - Gemstone/S
+
-
+
- Completely distributed smalltalk based computing +system. +
- http://phantomos.org/
- Programs run forever. System crash or reboot does not destroy -state of running program. - -
+state of running program.
+
-
-
- Multi-user object database for Squeak - -
-
-
- Completely distributed operating system/virtual machine: - +
- Completely distributed operating system/virtual machine: +
7.3 Rules based machine reasoning
+-
-
- Analyze and share complex multi-dimensional data at scale - -
-
+
- database in the form of a knowledge graph that uses machine +reasoning to simplify data processing challenges for AI +applications. https://grakn.ai/ +
Created: 2021-03-16 Tue 20:48
+ +