Graph database integrations (2020)

LynxKite is not a graph database. As a batch analytical tool it complements graph databases nicely. A graph database can answer interactive queries and handle updates. LynxKite can train models on the whole graph or compute metrics for every vertex.

While integration is already possible via standard data formats, we plan to make this even simpler by adding direct import/export functionality for popular graph databases. We are also considering deeper integrations and partnerships.

Status: (2020-01-07) Import from Neo4j is done. Not started on others yet.

Improved exploration UI (2020)

LynxKite already has a solid UI for exploring graphs. We plan to look into how to make it even better.

Status: (2020-04-27) Good improvements in LynxKite 3.2.0. The large-scale redesign is in planning stage.

NetworKit Integration (2020)

Thanks to Sphynx (see below), it is now possible to access functionality from third party libraries from LynxKite. NetworKit, a performant and rich graph library is a great candidate for integration. With this project we hope to further extend the breadth of graph algorithms available to LynxKite users.

Status: (2020-04-27) Not yet started.

Open-sourcing (2020 Q2)

We are currently in the legal and technical process of open-sourcing LynxKite.

Status: (2020-06-24) LynxKite 4.0 is released open source!

Better progress indicator (2020)

LynxKite has two systems for indicating progress already. But neither gives you a clear indication of how much work has been completed and how much more you have to wait. We want to address this.

Status: (2020-04-27) Much improved in LynxKite 3.2.1. Depending on feedback we may either stop or continue.

Performance improvements (2020 Q1)

Apache Spark is great for distributed computations on large datasets. It is not a great fit for quick computations on small and medium-size datasets. We are addressing that with Sphynx, our next-generation backend. Sphynx is a high-performance graph analytics execution engine written in Go. Starting from version 3.0, LynxKite can choose to execute operations either on Apache Spark or on Sphynx.

The Sphynx implementations have a number of advantages:

Over the quarter we will add Sphynx implementations for more and more operations. We have seen 100× speedups in laboratory benchmarks. We cannot add Sphynx implementations for every operation. But our goal is to cover enough operations that the user experience is vastly improved.

Status: (2020-04-27) Released in LynxKite 3.2.0.

Wizards (2020 Q1)

New users cannot build complex workspaces in LynxKite. But that is where LynxKite’s power shines. We are building “wizards” to put the power of complex workspaces at the fingertips of new users.

Wizard mockup

Advanced users can turn their workspaces into easy-to-use wizards. LynxKite will also come bundled with a selection of useful wizards.

Status: (2020-04-27) Released in LynxKite 3.2.0.

Automatic attribute prediction from graphs (2020 Q1)

A common data science project is supervised learning: figuring out the attributes of people or other entities based on a smaller number of known examples. This gets more complicated on graphs, because we cannot consider the vertices of the graph individually. We have to work with the graph as a whole to make the most of the connections between vertices.

LynxKite has a number of tools already for solving such node attribute prediction problems. We want to make it easier to use these tools by offering a fully automated workflow. Just take a CSV, choose what you want to predict, and everything happens automatically.

Status: (2020-04-27) Released on

Steiner tree optimization (2020 Q1)

Graphs often feature in combinatorial optimization problems. The Prize-Collecting Steiner Tree Problem is of practical importance for finding minimum-cost fiber network layouts. We are adding it to LynxKite.

Status: (2020-04-27) Released in LynxKite 3.2.0.

Graph neural networks (2020 Q1)

We have invested in neural networks that work with graphs from the start. We believe they are key to finding patterns in densely connected datasets. Our immediate next step is to add two important GNN applications to LynxKite: node embedding and missing attribute prediction. Expect to see more graph AI capabilities coming down the road.

Status: (2020-04-27) Released in LynxKite 3.2.0.