Skip to content

awslabs/nx-neptune


nx_neptune

CI Upload Python Package

This project offers a NetworkX-compatible backend for Neptune Analytics, enabling users to offload graph algorithm workloads to AWS with no code changes. By using familiar NetworkX APIs, developers can seamlessly scale graph computations on-demand through Neptune Analytics. This approach combines the simplicity of local development with the performance and scalability of a fully managed AWS graph analytics service.

Supported Algorithms

For details of all supported NetworkX algorithms see algorithms.md

Preview Status: Alpha Release

We're making the nx_neptune plugin library an open-source project, and are releasing it now as an Alpha Preview to the community to gather feedback, and actively collaborate on the project roadmap. We welcome questions, suggestions, and contributions from all community members. At this point in development, the project has not been fully released to the public and is recommended for testing purposes only. We're tracking its production readiness for general availability on the roadmap.

Installation

Install it from PyPI

pip install nx_neptune

Build and install from package wheel

# Package the project from source:
python -m pip wheel -w dist .

# Install with Jupyter dependencies from wheel: 
pip install "dist/nx_neptune-0.4.4-py3-none-any.whl"

Installation

To install the required nx_neptune dependencies:

git clone git@github.com:awslabs/nx-neptune.git
cd nx-neptune

# install from source directly
make install

Prerequisite

Before using this backend, ensure the following prerequisites are met:

AWS IAM Permissions

The IAM role or user accessing Neptune Analytics must have the following permissions:

These permissions are required to read, write, and manage graph data via queries on Neptune Analytics:

  • neptune-graph:ReadDataViaQuery
  • neptune-graph:WriteDataViaQuery
  • neptune-graph:DeleteDataViaQuery

These permissions are required to start/stop a Neptune Analytics graph:

  • neptune-graph:StartGraph
  • neptune-graph:StopGraph

These permissions are required to save/restore a Neptune Analytics snapshot:

  • neptune-graph:CreateGraphSnapshot (for save)
  • neptune-graph:RestoreGraphFromSnapshot (for restore)
  • neptune-graph:DeleteGraphSnapshot (for delete)
  • neptune-graph:TagResource

These permissions are required to import/export between S3 and Neptune Analytics:

  • s3:GetObject (for import)
  • s3:PutObject (for export)
  • s3:ListBucket (for export)
  • s3:DeleteBucket (for delete)
  • kms:Decrypt
  • kms:GenerateDataKey
  • kms:DescribeKey

In Addition to the S3 import/export permissions, to read from/write to an existing S3 Tables datalake:

  • athena:StartQueryExecution
  • athena:GetQueryExecution

The ARN with the above permissions must be added to your environment variables

Python Runtime

  • Python 3.11 is required.
  • Ensure your environment uses Python 3.11 to maintain compatibility with dependencies and API integrations.

Note: As part of the preview status, we are recommending that you run the library using Python 3.11.

Usage

import networkx as nx

G = nx.Graph()
G.add_node("Bill")
G.add_node("John")
G.add_edge("Bill", "John")

r = nx.pagerank(G, backend="neptune")

And run with:

# Set the NETWORKX_GRAPH_ID environment variable
export NETWORKX_GRAPH_ID=your-neptune-analytics-graph-id
python ./nx_example.py

Alternatively, you can pass the NETWORKX_GRAPH_ID directly:

NETWORKX_GRAPH_ID=your-neptune-analytics-graph-id python ./nx_example.py

Without a valid NETWORKX_GRAPH_ID, the examples will fail to connect to your Neptune Analytics instance. Make sure your AWS credentials are properly configured and your IAM role/user has the required permissions (ReadDataViaQuery, WriteDataViaQuery, DeleteDataViaQuery).

Running tests

Unit tests can be run with make, this runs all tests in the test folder:

make test

Integration tests are included in the integ_test folder and run examples against an existing instance of Neptune Analytics, by passing the graph identifier available in the AWS account.

export NETWORKX_GRAPH_ID=g-test12345
make integ-test

You can set BACKEND=False to run the test suite using NetworkX without nx-neptune as the backend.

Jupyter Notebook Integration

For interactive exploration and visualization, you can use the Jupyter notebook integration.

Features

The notebooks directory contains interactive demonstrations of using Neptune Analytics with NetworkX:

Uploading a package wheel

We recommend uploading your package as a wheel to Jupyter Notebooks.

# Package the project from source:
python -m pip wheel -w dist .
# creates dist/nx_neptune-0.4.4-py3-none-any.whl

Installation

To install the required dependencies for the Jupyter notebook (including the Jupyter dependencies):

# Install with Jupyter dependencies from wheel: 
pip install "dist/nx_neptune-0.4.4-py3-none-any.whl[jupyter]"

Running the Jupyter Notebook

A full tutorial is available to run in Neptune Jupyter Notebooks.

To run the Jupyter notebooks:

  1. Set your Neptune Analytics Graph ID as an environment variable:

    export NETWORKX_GRAPH_ID=your-neptune-analytics-graph-id
  2. You will also need to specify the IAM roles that will execute S3 import or export:

    export NETWORKX_ARN_IAM_ROLE=arn:aws:iam::AWS_ACCOUNT:role/IAM_ROLE_NAME
    export NETWORKX_S3_IMPORT_BUCKET_PATH=s://S3_BUCKET_PATH
    export NETWORKX_S3_EXPORT_BUCKET_PATH=s://S3_BUCKET_PATH
  3. Launch Jupyter Notebook:

    jupyter notebook notebooks/
  4. You can also set the Graph ID directly in the notebook using:

    %env NETWORKX_GRAPH_ID=your-neptune-analytics-graph-id

Security

See CONTRIBUTING for more information.

License

This project is licensed under the Apache-2.0 License.

About

NetworkX.org backend using Amazon Neptune Analytics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Packages

No packages published

Contributors 7