cross-posted from: https://mander.xyz/post/49566880

This is a graph of Peertube instances following each other. There are 942 nodes and 10067 edges.

Eigenvector centrality

On Peertube, an instance X can follow an instance Y to let its users see all the videos posted on Y. This graph is a directed graph.

Color and size of nodes depends on how big their Eigenvector centrality is. Nodes which have 0 centrality are blue and small, nodes with bigger centrality are big and red.

What centrality represents? Instances which are not followed by anyone have 0 centrality. Instances (A) with a lot of followers (B) have bigger centrality. If those followers (B) themselves have followers ©, it means centrality of A will be even higher.

Does it mean anything in context of Peertube? I’m not sure. Considering chain of three instances: (A) <- (B) <- ©, when (A) posts a video, does it appear in ©? Probably not. But if it was so, then centrality would’ve mean this: Videos posted on instances with high centrality spread across entire network, while videos posted on instances with 0 centrality are not visible anywhere else.

Here are top 10 instances and their centrality:

How to repeat this graph visualization

  1. Download latest Peertube instances.csv and interactions.csv files here: https://www.kaggle.com/datasets/marcdamie/fediverse-graph-dataset-reduced
  2. Import them to Gephi;
  3. Apply Giant Component filter to remove nodes which are not connected to biggest network;
  4. Apply ForceAtlas 2 layout;
  5. Run Eigenvector centrality Statistics (directed). It will add a new column to nodes table;
  6. Apply Nodes - Color - Ranking - Eigenvector centrality;
  7. Apply Nodes - Size - Ranking - Eigenvector centrality;
  8. Configure Preview and export.

Gephi

P.S. On colorful image used as thumbnail of this post nodes are colored by Modularity (community detection).

  • grue@lemmy.world
    link
    fedilink
    arrow-up
    8
    ·
    edit-2
    14 days ago

    Does it mean anything in context of Peertube? I’m not sure. Considering chain of three instances: (A) <- (B) <- (C), when (A) posts a video, does it appear in (C)? Probably not. But if it was so, then centrality would’ve mean this: Videos posted on instances with high centrality spread across entire network, while videos posted on instances with 0 centrality are not visible anywhere else.

    What I want to know is how to accomplish the inverse of that: not which instance to upload a video to in order to have it spread across the network, but which instance to browse from in order to be able to find all the videos in the network.

    IMO, this is the biggest problem with PeerTube: so much of the content is silo’d in relatively isolated instances that its’ hard for potential viewers to find.

    (BTW, you have to escape your (C) with a backslash, like \(C), to stop it from being turned into a copyright symbol.)

      • grue@lemmy.world
        link
        fedilink
        arrow-up
        1
        ·
        edit-2
        14 days ago

        Yes, sort of, but I’d also be curious to know what fraction of the total unique videos network-wide those numbers are, and also how disjoint the sets of videos are between instances (in order to try to find the minimum number of instances you’d need to browse to be able to have “nearly all” videos available).

        Also, it’s worth noting that the first two instances on there are very small and don’t allow signup. Dalek Zone is the first one that looks relatively “public,” but you’re already down 150K videos if you’re browsing from there.

        • podbrushkin@mander.xyzOP
          link
          fedilink
          arrow-up
          2
          ·
          14 days ago

          If there were real communities of instances in Peertube, they would’ve been visible in those pictures above, but looks like there are none. Everyone is kinda connected to everyone, no place for significant disjoint sets of videos. I hope one day at least NSFW instances will start moving away from the main hairy ball.

  • atro_city@fedia.io
    link
    fedilink
    arrow-up
    2
    ·
    14 days ago

    I don’t understand the term “centrality”. It seems to actually be showing followship or reachability. Centrality sounds like more people use it instead of other instances or percentage of content that is from that instance alone. 100% centrality would thus mean the instance only has its own videos and doesn’t make other videos available, while 0% centrality means it only makes videos from other instances available.

    • podbrushkin@mander.xyzOP
      link
      fedilink
      arrow-up
      1
      ·
      14 days ago

      I agree, “reachability” better explains this metric. But “centrality” is a well-defined term from graph theory. Also, center usually is easiest to reach from any given point (e.g. in cities). As for amount of active users, local videos or number of outgoing “follows” relationships for any instance - those metrics do not need a colorful picture and they nicely fit into this table: https://instances.joinpeertube.org/instances

      • atro_city@fedia.io
        link
        fedilink
        arrow-up
        1
        ·
        14 days ago

        From that perspective, centrality makes sense 🤔 Maybe it’s just me but for this context, it seems very counter-intuitive. It may be right for graph theory, but when talking about the fediverse, not so much IMO.

        • podbrushkin@mander.xyzOP
          link
          fedilink
          arrow-up
          2
          ·
          14 days ago

          for this context, it seems very counter-intuitive

          That’s very probable. Wait till I will get a graph where edges represent “blocks” relationships, will run same algorithm and will again say it’s centrality.