LynxKite already has a solid UI for exploring graphs. But we have a lot of ideas for improvements. We would like to merge the visualization parameters and the visualization itself into a single rich UI. This would make visualizations in wizards more powerful.
Status: (2021-10-06) In planning stage.
We plan to add import and export boxes for Gremlin-based graph databases, such as Cosmos DB and JanusGraph.
Status: (2021-10-06) Early prototyping.
In the past year our focus was open-sourcing, performance, features, and user experience. A number of our dependencies have made big improvements in the meantime and it’s time to upgrade. We want to upgrade Apache Spark, SBT, Scala, Play Framework, and a number of JavaScript libraries.
Status: (2021-10-06) Released in LynxKite 4.3.0.
Vegas looked like a good choice for defining custom visualizations at the time. But it had no new releases since 2017, and in fact Scala is not a popular option for data science.
We plan to switch to defining visualizations in the very actively maintained Vega-Lite JSON syntax.
This will be a breaking change. There are almost no users of the Vegas syntax. Rewriting custom visualizations manually will be fairly easy.
Status: (2021-10-06) Released in LynxKite 4.3.0.
LynxKite is not a graph database. As a batch analytical tool it complements graph databases nicely. A graph database can answer interactive queries and handle updates. LynxKite can train models on the whole graph or compute metrics for every vertex.
While integration is already possible via standard data formats, we plan to make this even simpler by adding direct import/export functionality for popular graph databases. We are also considering deeper integrations and partnerships.
Status: (2020-10-08) LynxKite 4.1.0 works well with Neo4j. See our blog post. We have no immediate plans for other graph databases, but are open to suggestions.
Thanks to Sphynx (see below), it is now possible to access functionality from third party libraries from LynxKite. NetworKit, a performant and rich graph library is a great candidate for integration. With this project we hope to further extend the breadth of graph algorithms available to LynxKite users.
Status: (2021-02-01) LynxKite 4.2.0 includes 42 algorithms from NetworKit. See our Twitter thread about it.
We are currently in the legal and technical process of open-sourcing LynxKite.
Status: (2020-06-24) LynxKite 4.0 is released open source!
LynxKite has two systems for indicating progress already. But neither gives you a clear indication of how much work has been completed and how much more you have to wait. We want to address this.
Status: (2020-04-27) Much improved in LynxKite 3.2.1. Depending on feedback we may either stop or continue.
Apache Spark is great for distributed computations on large datasets. It is not a great fit for quick computations on small and medium-size datasets. We are addressing that with Sphynx, our next-generation backend. Sphynx is a high-performance graph analytics execution engine written in Go. Starting from version 3.0, LynxKite can choose to execute operations either on Apache Spark or on Sphynx.
The Sphynx implementations have a number of advantages:
Over the quarter we will add Sphynx implementations for more and more operations. We have seen 100× speedups in laboratory benchmarks. We cannot add Sphynx implementations for every operation. But our goal is to cover enough operations that the user experience is vastly improved.
Status: (2020-04-27) Released in LynxKite 3.2.0.
New users cannot build complex workspaces in LynxKite. But that is where LynxKite’s power shines. We are building “wizards” to put the power of complex workspaces at the fingertips of new users.
Advanced users can turn their workspaces into easy-to-use wizards. LynxKite will also come bundled with a selection of useful wizards.
Status: (2020-04-27) Released in LynxKite 3.2.0.
A common data science project is supervised learning: figuring out the attributes of people or other entities based on a smaller number of known examples. This gets more complicated on graphs, because we cannot consider the vertices of the graph individually. We have to work with the graph as a whole to make the most of the connections between vertices.
LynxKite has a number of tools already for solving such node attribute prediction problems. We want to make it easier to use these tools by offering a fully automated workflow. Just take a CSV, choose what you want to predict, and everything happens automatically.
Status: (2020-04-27) Released on try.lynxkite.com.
Graphs often feature in combinatorial optimization problems. The Prize-Collecting Steiner Tree Problem is of practical importance for finding minimum-cost fiber network layouts. We are adding it to LynxKite.
Status: (2020-04-27) Released in LynxKite 3.2.0.
We have invested in neural networks that work with graphs from the start. We believe they are key to finding patterns in densely connected datasets. Our immediate next step is to add two important GNN applications to LynxKite: node embedding and missing attribute prediction. Expect to see more graph AI capabilities coming down the road.
Status: (2020-04-27) Released in LynxKite 3.2.0.