Unit 3 Notes - Unit3
Unit 3 Notes - Unit3
Unit -3
Based on – “Matthew Ganis, Avinash Kohirkar, Social Media
Analytics: Techniques and Insights for Extracting Business Value Out
of Social Media Pearson 2016”
Presenter Name – Shiv Tripathi
April 2025
Network Structures
Definition:
• Network structures define how individual entities (nodes)
are connected through relationships (edges) in a network.
1.Centralized Networks:
1.Have a dominant node that acts as the main point of interaction.
2.Examples: Facebook pages with millions of followers interacting
with one central brand account.
3.Pros: Efficient for information dissemination.
4.Cons: Vulnerable to single-point failure.
2.Decentralized Networks:
1.Composed of multiple hubs, ensuring no single node dominates.
2.Examples: Reddit communities, where multiple subreddits
function independently but are connected.
3.Pros: More resilient to failures and censorship.
4.Cons: Difficult to control or moderate content.
Types of Network Structures:
1.Distributed Networks:
1.Nodes have similar connectivity levels, making influence more
evenly spread.
2.Examples: Peer-to-peer networks like BitTorrent.
3.Pros: High resilience and fairness in content distribution.
4.Cons: Slower information spread compared to centralized networks.
2.Hierarchical Networks:
1.Structured in levels where information passes through layers before
reaching all nodes.
2.Examples: Corporate organizational charts, where communication
flows from top management to employees.
3.Pros: Clear roles and responsibilities.
4.Cons: Slower decision-making and communication bottlenecks.
Types of Network Structures:
Small-World Networks:
Characterized by short paths between nodes due to the
presence of influential hubs.
Examples: LinkedIn, where "six degrees of separation"
helps professionals connect globally.
Definition:
• Equivalence in networks refers to the similarity between
nodes based on their relationships and roles within the
network. It helps in categorizing users based on their
influence and connectivity patterns.
Types of Equivalence:
1.Structural Equivalence:
1.Two nodes have identical connections with the same set of
nodes.
2.Example: Employees reporting to the same manager have
similar structural equivalence in a corporate hierarchy.
3.Application: Used in job recommendation systems to find
similar professionals.
2.Automorphic Equivalence:
1.Nodes hold the same structural role but may not have identical
connections.
2.Example: CEOs of different companies play similar roles but are
not connected to the same employees.
3.Application: Identifying similar market leaders across industries.
Types of Equivalence:
Regular Equivalence:
• Nodes perform similar functions within different network
substructures.
• Example: Social media influencers in different niches
(fashion vs. tech influencers) share equivalent roles.
• Application: Understanding content creators’ impact in
different domains.
Equivalence in Social Media Analytics:
Definition:
Homophily is the tendency of similar individuals to form
connections with each other. It plays a critical role in shaping
online communities and determining content reach.
Types of Homophily:
1.Status-Based Homophily:
1.Based on demographic attributes like age, gender, or education.
2.Example: LinkedIn users forming groups based on job roles.
2.Value-Based Homophily:
1.Based on shared interests, beliefs, or opinions.
2.Example: Twitter users following accounts that align with their
political views.
Impact of Homophily in Social Media Analytics:
Definition:
Clustering refers to the formation of tightly connected groups within a network.
Key Metrics:
• Clustering Coefficient: Measures the degree to which nodes cluster
together.
• Modularity: Determines the strength of division in a network.
Definition:
A technique where existing users recruit new participants, growing
the sample exponentially.
Steps in Snowball Sampling:
1.Select initial subjects.
2.Ask them to refer more users.
3.Repeat the process to expand the dataset.
Applications:
• Influencer outreach programs.
• Identifying underground communities (e.g., dark web
networks).
• Analyzing niche social media trends.
Contact Tracing
Definition:
Tracking interactions between nodes to understand how information
or diseases spread.
Types:
• Forward Tracing: Tracking who an infected person has interacted
with.
• Backward Tracing: Identifying the original source of an infection
or rumor.
Applications:
• Misinformation tracking on social media.
• Understanding viral marketing campaigns.
• Epidemiology (e.g., COVID-19 tracking apps).
Random Walks in Networks
Definition:
A process where a path is randomly taken between nodes,
often used to analyze connectivity and structure.
Applications:
1.Search Engine Algorithms: Google’s PageRank for
ranking web pages.
2.Recommendation Systems: Netflix’s content
discovery algorithm.
3.Fraud Detection: Spotting abnormal transaction
behaviors.
Ego-Centered Networks
Applications:
• Social Media: Understanding a user's direct connections
(friends, followers).
• Marketing: Targeting individuals with high influence in
ego networks.
Example:
• In LinkedIn, your direct connections form your ego
network. The mutual connections among your contacts
determine network density.
• Diagram: Ego at the center with spokes connecting to
alters; some alters are interconnected.
Dominance Hierarchies
Definition: These are external data sources that provide information about
interactions or affiliations, not collected directly from users but observed or
documented by others.
Types:
• Web analytics logs
• CRM data
• Government or academic databases
Applications:
• Data enrichment: Combine third-party records with user-generated data.
• Behavior analysis: Understanding patterns beyond social platforms.
Example:
• Using government data on education to correlate social media discussions about
literacy.
• Diagram: Two circles: one for social media data, another for third-party records,
with overlap representing combined insights.
Affiliation Networks
Applications:
• Event analysis: Identify user groups attending similar
events.
• Recommendation systems: Suggesting content based
on shared group memberships.
Example:
• Users in a Facebook group are connected via their
common membership.
• Diagram: Two columns of nodes: left (users), right
(groups/events), with lines connecting them.
Citation Networks