Skip to content

A performance-focused library implementing algorithms for simulating network diffusion processes, written in Cython.

License

Notifications You must be signed in to change notification settings

eliotwrobson/CyNetDiff

Repository files navigation

CyNetDiff

PyPI version tests docs PyPI - Python Version Ruff Project Status: Active – The project has reached a stable, usable state and is being actively developed. License: MIT DOI

Network diffusion processes aim to model the spread of trends through social networks, represented using graphs. Experimental work with these models usually involves simulating these processes many times over large graphs, which can be computationally very expensive. To address this, CyNetDiff is a Cython module implementing the independent cascade and linear threshold models. Development has been focused on performance, while still giving an intuitive, high-level interface to assist in research tasks. To learn more about these specific models, read this book chapter or the preliminaries page in the documentation.

Quick Start

Installation

pip install cynetdiff

Note: The installation includes a build step that requires having a C++ complier installed.

Basic Usage

We can run models over graphs we define, using pre-defined weighting schemes. Here is a simple example:

import random
import networkx as nx
from cynetdiff.utils import networkx_to_ic_model

# Randomly generate the graph
n = 1_000
p = 0.05
graph = nx.fast_gnp_random_graph(n, p)

# Randomly choose seed nodes
k = 10
nodes = list(graph.nodes)
seeds = random.sample(nodes, k)

# Set the activation probability uniformly and set seeds
model, _ = networkx_to_ic_model(graph, activation_prob=0.2)
model.set_seeds(seeds)

# Run a single diffusion process until completion
model.advance_until_completion()

# Get the number of nodes activated
model.get_num_activated_nodes()

The output from the last line is the number of nodes activated in a single simulation of the model. To get the average number of activated nodes across n_sim = 1_000 simulations, we can replace the last line in the above with the following:

n_sim = 1_000
total = 0.0

for _ in range(n_sim):
    # Resetting the model doesn't change the initial seed set used.
    model.reset_model()
    model.advance_until_completion()
    total += model.get_num_activated_nodes()

avg = total / n_sim

Advanced Usage

See the documentation.

Project Status

This project is still considered in an alpha stage of development. As such, the API could still change to facilitate easier community adoption.

All feedback is greatly appreciated!

Contributing

Contributions are always welcome! Take a look at the contributing guide.

About

A performance-focused library implementing algorithms for simulating network diffusion processes, written in Cython.

Resources

License

Code of conduct

Stars

Watchers

Forks

Packages