Close Menu
  • Home
  • Opinion
  • Region
    • Africa
    • Asia
    • Europe
    • Middle East
    • North America
    • Oceania
    • South America
  • AI & Machine Learning
  • Robotics & Automation
  • Space & Deep Tech
  • Web3 & Digital Economies
  • Climate & Sustainability Tech
  • Biotech & Future Health
  • Mobility & Smart Cities
  • Global Tech Pulse
  • Cybersecurity & Digital Rights
  • Future of Work & Education
  • Trend Radar & Startup Watch
  • Creator Economy & Culture
What's Hot

Chinese language Researchers Develop Semi-Strong-State EV Battery with 620-Mile Vary

March 7, 2026

UL and IMR to design Eire’s first 3D-printed liquid rocket engine

March 7, 2026

AsiaStartupExpo Q1 2026 Highlights the New Actuality of Founder–Investor Dialogue in Asia – KoreaTechDesk

March 7, 2026
Facebook X (Twitter) Instagram LinkedIn RSS
NextTech NewsNextTech News
Facebook X (Twitter) Instagram LinkedIn RSS
  • Home
  • Africa
  • Asia
  • Europe
  • Middle East
  • North America
  • Oceania
  • South America
  • Opinion
Trending
  • Chinese language Researchers Develop Semi-Strong-State EV Battery with 620-Mile Vary
  • UL and IMR to design Eire’s first 3D-printed liquid rocket engine
  • AsiaStartupExpo Q1 2026 Highlights the New Actuality of Founder–Investor Dialogue in Asia – KoreaTechDesk
  • Irish information safety start-up Evervault raises $25m
  • Google Launches TensorFlow 2.21 And LiteRT: Sooner GPU Efficiency, New NPU Acceleration, And Seamless PyTorch Edge Deployment Upgrades
  • Oukitel WP63 Turns a Cellphone Into the Final Out of doors Companion, Full with a Constructed-in Firestarter
  • Samsung Advances Galaxy AI and Its Related Ecosystem at MWC 2026
  • Full-time graduate employment falls once more—for third yr in a row
Saturday, March 7
NextTech NewsNextTech News
Home - AI & Machine Learning - A Manufacturing-Model NetworKit 11.2.1 Coding Tutorial for Massive-Scale Graph Analytics, Communities, Cores, and Sparsification
AI & Machine Learning

A Manufacturing-Model NetworKit 11.2.1 Coding Tutorial for Massive-Scale Graph Analytics, Communities, Cores, and Sparsification

NextTechBy NextTechMarch 7, 2026No Comments5 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email Copy Link
Follow Us
Google News Flipboard
A Manufacturing-Model NetworKit 11.2.1 Coding Tutorial for Massive-Scale Graph Analytics, Communities, Cores, and Sparsification
Share
Facebook Twitter LinkedIn Pinterest Email


On this tutorial, we implement a production-grade, large-scale graph analytics pipeline in NetworKit, specializing in pace, reminiscence effectivity, and version-safe APIs in NetworKit 11.2.1. We generate a large-scale free community, extract the biggest linked part, after which compute structural spine alerts by way of k-core decomposition and centrality rating. We additionally detect communities with PLM and quantify high quality utilizing modularity; estimate distance construction utilizing efficient and estimated diameters; and, lastly, sparsify the graph to cut back price whereas preserving key properties. We export the sparsified graph as an edgelist so we are able to reuse it in downstream workflows, benchmarking, or graph ML preprocessing.

!pip -q set up networkit pandas numpy psutil


import gc, time, os
import numpy as np
import pandas as pd
import psutil
import networkit as nk


print("NetworKit:", nk.__version__)
nk.setNumberOfThreads(min(2, nk.getMaxNumberOfThreads()))
nk.setSeed(7, False)


def ram_gb():
   p = psutil.Course of(os.getpid())
   return p.memory_info().rss / (1024**3)


def tic():
   return time.perf_counter()


def toc(t0, msg):
   print(f"{msg}: {time.perf_counter()-t0:.3f}s | RAM~{ram_gb():.2f} GB")


def report(G, title):
   print(f"n[{name}] nodes={G.numberOfNodes():,} edges={G.numberOfEdges():,} directed={G.isDirected()} weighted={G.isWeighted()}")


def force_cleanup():
   gc.gather()


PRESET = "LARGE"


if PRESET == "LARGE":
   N = 120_000
   M_ATTACH = 6
   AB_EPS = 0.12
   ED_RATIO = 0.9
elif PRESET == "XL":
   N = 250_000
   M_ATTACH = 6
   AB_EPS = 0.15
   ED_RATIO = 0.9
else:
   N = 80_000
   M_ATTACH = 6
   AB_EPS = 0.10
   ED_RATIO = 0.9


print(f"nPreset={PRESET} | N={N:,} | m={M_ATTACH} | approx-betweenness epsilon={AB_EPS}")

We arrange the Colab atmosphere with NetworKit and monitoring utilities, and we lock in a steady random seed. We configure thread utilization to match the runtime and outline timing and RAM-tracking helpers for every main stage. We select a scale preset that controls graph measurement and approximation knobs so the pipeline stays massive however manageable.

t0 = tic()
G = nk.mills.BarabasiAlbertGenerator(M_ATTACH, N).generate()
toc(t0, "Generated BA graph")
report(G, "G")


t0 = tic()
cc = nk.parts.ConnectedComponents(G)
cc.run()
toc(t0, "ConnectedComponents")
print("parts:", cc.numberOfComponents())


if cc.numberOfComponents() > 1:
   t0 = tic()
   G = nk.graphtools.extractLargestConnectedComponent(G, compactGraph=True)
   toc(t0, "Extracted LCC (compactGraph=True)")
   report(G, "LCC")


force_cleanup()

We generate a big Barabási–Albert graph and instantly log its measurement and runtime footprint. We compute linked parts to grasp fragmentation and rapidly diagnose topology. We extract the biggest linked part and compact it to enhance the remainder of the pipeline’s efficiency and reliability.

t0 = tic()
core = nk.centrality.CoreDecomposition(G)
core.run()
toc(t0, "CoreDecomposition")
core_vals = np.array(core.scores(), dtype=np.int32)
print("degeneracy (max core):", int(core_vals.max()))
print("core stats:", pd.Sequence(core_vals).describe(percentiles=[0.5, 0.9, 0.99]).to_dict())


k_thr = int(np.percentile(core_vals, 97))


t0 = tic()
nodes_backbone = [u for u in range(G.numberOfNodes()) if core_vals[u] >= k_thr]
G_backbone = nk.graphtools.subgraphFromNodes(G, nodes_backbone)
toc(t0, f"Spine subgraph (ok>={k_thr})")
report(G_backbone, "Spine")


force_cleanup()


t0 = tic()
pr = nk.centrality.PageRank(G, damp=0.85, tol=1e-8)
pr.run()
toc(t0, "PageRank")


pr_scores = np.array(pr.scores(), dtype=np.float64)
top_pr = np.argsort(-pr_scores)[:15]
print("High PageRank nodes:", top_pr.tolist())
print("High PageRank scores:", pr_scores[top_pr].tolist())


t0 = tic()
abw = nk.centrality.ApproxBetweenness(G, epsilon=AB_EPS)
abw.run()
toc(t0, "ApproxBetweenness")


abw_scores = np.array(abw.scores(), dtype=np.float64)
top_abw = np.argsort(-abw_scores)[:15]
print("High ApproxBetweenness nodes:", top_abw.tolist())
print("High ApproxBetweenness scores:", abw_scores[top_abw].tolist())


force_cleanup()

We compute the core decomposition to measure degeneracy and establish the community’s high-density spine. We extract a spine subgraph utilizing a excessive core-percentile threshold to concentrate on structurally vital nodes. We run PageRank and approximate betweenness to rank nodes by affect and bridge-like conduct at scale.

t0 = tic()
plm = nk.neighborhood.PLM(G, refine=True, gamma=1.0, par="balanced")
plm.run()
toc(t0, "PLM neighborhood detection")


half = plm.getPartition()
num_comms = half.numberOfSubsets()
print("communities:", num_comms)


t0 = tic()
Q = nk.neighborhood.Modularity().getQuality(half, G)
toc(t0, "Modularity")
print("modularity Q:", Q)


sizes = np.array(checklist(half.subsetSizeMap().values()), dtype=np.int64)
print("neighborhood measurement stats:", pd.Sequence(sizes).describe(percentiles=[0.5, 0.9, 0.99]).to_dict())


t0 = tic()
eff = nk.distance.EffectiveDiameter(G, ED_RATIO)
eff.run()
toc(t0, f"EffectiveDiameter (ratio={ED_RATIO})")
print("efficient diameter:", eff.getEffectiveDiameter())


t0 = tic()
diam = nk.distance.EstimatedDiameter(G)
diam.run()
toc(t0, "EstimatedDiameter")
print("estimated diameter:", diam.getDiameter().distance)


force_cleanup()

We detect communities utilizing PLM and document the variety of communities discovered on the big graph. We compute modularity and summarize community-size statistics to validate the construction fairly than merely trusting the partition. We estimate world distance conduct utilizing efficient diameter and estimated diameter in an API-safe method for NetworKit 11.2.1.

t0 = tic()
sp = nk.sparsification.LocalSimilaritySparsifier(G, 0.7)
G_sparse = sp.getSparsifiedGraph()
toc(t0, "LocalSimilarity sparsification (alpha=0.7)")
report(G_sparse, "Sparse")


t0 = tic()
pr2 = nk.centrality.PageRank(G_sparse, damp=0.85, tol=1e-8)
pr2.run()
toc(t0, "PageRank on sparse")
pr2_scores = np.array(pr2.scores(), dtype=np.float64)
print("High PR nodes (sparse):", np.argsort(-pr2_scores)[:15].tolist())


t0 = tic()
plm2 = nk.neighborhood.PLM(G_sparse, refine=True, gamma=1.0, par="balanced")
plm2.run()
toc(t0, "PLM on sparse")
part2 = plm2.getPartition()
Q2 = nk.neighborhood.Modularity().getQuality(part2, G_sparse)
print("communities (sparse):", part2.numberOfSubsets(), "| modularity (sparse):", Q2)


t0 = tic()
eff2 = nk.distance.EffectiveDiameter(G_sparse, ED_RATIO)
eff2.run()
toc(t0, "EffectiveDiameter on sparse")
print("efficient diameter (orig):", eff.getEffectiveDiameter(), "| (sparse):", eff2.getEffectiveDiameter())


force_cleanup()


out_path = "/content material/networkit_large_sparse.edgelist"
t0 = tic()
nk.graphio.EdgeListWriter("t", 0).write(G_sparse, out_path)
toc(t0, "Wrote edge checklist")
print("Saved:", out_path)


print("nAdvanced large-graph pipeline full.")

We sparsify the graph utilizing native similarity to cut back the variety of edges whereas retaining helpful construction for downstream analytics. We rerun PageRank, PLM, and efficient diameter on the sparsified graph to verify whether or not key alerts stay constant. We export the sparsified graph as an edgelist so we are able to reuse it throughout periods, instruments, or further experiments.

In conclusion, we developed an end-to-end, scalable NetworKit workflow that mirrors actual large-network evaluation: we began from technology, stabilized the topology with LCC extraction, characterised the construction by way of cores and centralities, found communities and validated them with modularity, and captured world distance conduct by way of diameter estimates. We then utilized sparsification to shrink the graph whereas retaining it analytically significant and saving it for repeatable pipelines. The tutorial supplies a sensible template we are able to reuse for actual datasets by changing the generator with an edgelist reader, whereas retaining the identical evaluation levels, efficiency monitoring, and export steps.


Take a look at the Full Codes right here. Additionally, be at liberty to observe us on Twitter and don’t overlook to hitch our 120k+ ML SubReddit and Subscribe to our Publication. Wait! are you on telegram? now you possibly can be a part of us on telegram as nicely.


Elevate your perspective with NextTech Information, the place innovation meets perception.
Uncover the newest breakthroughs, get unique updates, and join with a world community of future-focused thinkers.
Unlock tomorrow’s tendencies immediately: learn extra, subscribe to our publication, and develop into a part of the NextTech neighborhood at NextTech-news.com

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
NextTech
  • Website

Related Posts

Google Launches TensorFlow 2.21 And LiteRT: Sooner GPU Efficiency, New NPU Acceleration, And Seamless PyTorch Edge Deployment Upgrades

March 7, 2026

Google AI Releases Android Bench: An Analysis Framework and Leaderboard for LLMs in Android Growth

March 7, 2026

Microsoft Releases Phi-4-Reasoning-Imaginative and prescient-15B: A Compact Multimodal Mannequin for Math, Science, and GUI Understanding

March 7, 2026
Add A Comment
Leave A Reply Cancel Reply

Economy News

Chinese language Researchers Develop Semi-Strong-State EV Battery with 620-Mile Vary

By NextTechMarch 7, 2026

solid-state battery that gives a 30% enhance in vitality density in comparison with main lithium-ion…

UL and IMR to design Eire’s first 3D-printed liquid rocket engine

March 7, 2026

AsiaStartupExpo Q1 2026 Highlights the New Actuality of Founder–Investor Dialogue in Asia – KoreaTechDesk

March 7, 2026
Top Trending

Chinese language Researchers Develop Semi-Strong-State EV Battery with 620-Mile Vary

By NextTechMarch 7, 2026

solid-state battery that gives a 30% enhance in vitality density in comparison…

UL and IMR to design Eire’s first 3D-printed liquid rocket engine

By NextTechMarch 7, 2026

The partnership information comes with official acceptance into the celebrated UK-based Race2Space…

AsiaStartupExpo Q1 2026 Highlights the New Actuality of Founder–Investor Dialogue in Asia – KoreaTechDesk

By NextTechMarch 7, 2026

Elevating enterprise capital in Asia now calls for greater than a compelling…

Subscribe to News

Get the latest sports news from NewsSite about world, sports and politics.

NEXTTECH-LOGO
Facebook X (Twitter) Instagram YouTube

AI & Machine Learning

Robotics & Automation

Space & Deep Tech

Web3 & Digital Economies

Climate & Sustainability Tech

Biotech & Future Health

Mobility & Smart Cities

Global Tech Pulse

Cybersecurity & Digital Rights

Future of Work & Education

Creator Economy & Culture

Trend Radar & Startup Watch

News By Region

Africa

Asia

Europe

Middle East

North America

Oceania

South America

2025 © NextTech-News. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms Of Service
  • Advertise With Us
  • Write For Us
  • Submit Article & Press Release

Type above and press Enter to search. Press Esc to cancel.

Subscribe For Latest Updates

Sign up to best of Tech news, informed analysis and opinions on what matters to you.

Invalid email address
 We respect your inbox and never send spam. You can unsubscribe from our newsletter at any time.     
Thanks for subscribing!