U 2osn
U 2osn
--------------------------------------------------------------------------------------------------------------------------------------
-------------------------------------------------------------------------------------------------------------------------------------
1.Information Diffusion
Information diffusion in social media refers to the process through which content
(e.g., messages, news, images, videos) spreads across networks of individuals or entities through
interactions and connections. Social media platforms like Facebook, Twitter, Instagram, and TikTok
play a critical role in disseminating information rapidly, often reaching global audiences in real-time.
2. Diffusion Process
o Information starts with a source node and spreads through the network via edges as
users interact with or share content.
o The process is influenced by user behaviors, network structure, and the content's
characteristics.
3. Cascading Effect
o Influential nodes (high centrality users) amplify diffusion due to their larger reach
and engagement levels.
5. Virality
o Refers to the phenomenon when content spreads rapidly across the network, often
exponentially.
o Factors contributing to virality include emotional triggers, relatability, and network
effects.
o Each active node (user) has a single chance to activate its neighboring nodes
(friends/followers) with a certain probability.
3. Epidemic Models
▪ Susceptible (S): Users who are not yet exposed to the information.
4. Hybrid Models
1. Content Characteristics
o Format: Visual and video-based content tends to diffuse more rapidly than plain text.
2. Network Structure
o Bridge Nodes: Users that connect multiple groups or communities facilitate broader
diffusion.
3. User Behavior
4. Platform Algorithms
o Example: Facebook's News Feed, Instagram’s explore page, or TikTok’s "For You"
page.
3. Political Campaigning
o Political entities use social media for awareness, engagement, and mobilizing
support.
o Example: #MeToo movement, Black Lives Matter, and climate change campaigns.
o Studying diffusion patterns helps in identifying and controlling the spread of fake
news or rumors.
o False or misleading content can spread faster than accurate information, causing
societal harm.
o Users are often exposed only to like-minded content, limiting diverse perspectives.
3. Overload of Information
4. Manipulation of Diffusion
o Bots, fake accounts, and malicious campaigns can distort diffusion patterns.
5. Lack of Privacy
1. Linear Diffusion
Definition: Linear diffusion occurs when information spreads through a linear network, where each
node has a limited number of connections.
Example: A rumor spreads through a small town, where each person tells a few friends.
Characteristics: Linear diffusion is often slow and predictable, with information spreading in a linear
fashion.
2. Exponential Diffusion
Definition: Exponential diffusion occurs when information spreads rapidly through a network, where
each node has many connections.
Characteristics: Exponential diffusion is often rapid and unpredictable, with information spreading
exponentially.
3. Threshold Diffusion
Definition: Threshold diffusion occurs when information spreads through a network, but only after a
certain threshold of adoption is reached.
Example: A new technology is adopted by a small group of early adopters, but only becomes widely
accepted after a certain threshold of adoption is reached.
4. Hierarchical Diffusion
Definition: Hierarchical diffusion occurs when information spreads through a hierarchical network,
where information flows from top to bottom.
Example: A company announces a new policy, which is communicated from top management to
employees.
5. Contagion Diffusion
Definition: Contagion diffusion occurs when information spreads through a network, where each
node is influenced by its neighbors.
Example: A disease spreads through a population, where each person is more likely to become
infected if they have contact with an infected person.
Definition: Social network diffusion occurs when information spreads through a social network,
where information flows through relationships and connections.
Example: A piece of news spreads through a social media platform, where users share and comment
on the news.
Definition: Spatial diffusion occurs when information spreads through a geographic area, where
information flows from one location to another.
Example: A new product is introduced in a city, and then spreads to surrounding cities and towns.
8. Temporal Diffusion
Definition: Temporal diffusion occurs when information spreads over time, where information flows
from one time period to another.
Example: A new technology is introduced, and then spreads over time as more people adopt it.
3. Herd Behaviour
Herd behavior refers to the phenomenon where individuals follow the actions of others,
often without considering their own preferences or rational judgment.
During the dot-com bubble of the late 1990s, many investors bought stocks of technology
companies without thoroughly evaluating their financials or business models. They followed the
herd, assuming that if everyone else was investing, it must be a good idea. As a result, stock prices
skyrocketed, only to crash later, leaving many investors with significant losses.
• Social influence: Individuals follow the actions of others due to social pressure or the desire
to conform.
• Lack of critical thinking: Individuals may not critically evaluate the information or situation
before making a decision.
• Emotional decision-making: Herd behavior is often driven by emotions, such as fear, greed,
or excitement.
• Rapid spread: Herd behavior can spread quickly through a group or population, often
through social networks or media.
• Informational Herding: Individuals follow the actions of others because they believe others
have better information.
• Social Herding: Individuals follow the actions of others because they want to conform to
social norms or avoid social disapproval.
• Reputational Herding: Individuals follow the actions of others because they want to associate
themselves with successful or prestigious individuals or groups.
• Uncertainty: When individuals are uncertain about the best course of action, they may follow
the herd.
• Social Influence: The influence of social norms, peer pressure, and authority figures can
contribute to herd behavior.
• Emotions: Strong emotions such as fear, greed, or excitement can drive herd behavior.
• Lack of Critical Thinking: When individuals do not critically evaluate information, they may
follow the herd.
• Stock Market Bubbles: The rapid increase in stock prices during the dot-com bubble and the
subsequent crash.
• Fashion Trends: The rapid adoption of fashion trends, such as clothing styles or accessories.
• Social Media: The rapid spread of information, memes, or challenges on social media
platforms.
• Financial Panics: The rapid withdrawal of funds from banks or financial institutions during
times of financial stress.
• Market Volatility: Herd behavior can lead to rapid changes in market prices, resulting in
volatility.
• Financial Losses: Herd behavior can result in significant financial losses, such as during stock
market crashes.
• Reduced Innovation: Herd behavior can stifle innovation, as individuals may be less likely to
challenge conventional wisdom.
• Social Conformity: Herd behavior can lead to social conformity, where individuals prioritize
fitting in over critical thinking.
4. Information Cascades:
• Social Influence Cascades: Information spreads through social networks due to social
influence.
• Informational Cascades: Information spreads due to its perceived value or accuracy.
• Reputation-Based Cascades: Information spreads due to the reputation of the source.
• Social Media Rumors: False information spreads rapidly through social media platforms.
• Financial Bubbles: Investors rapidly adopt and spread information about a particular stock or
asset.
• Product Adoption: Consumers rapidly adopt and recommend a new product.
• Network Structure: The structure of the social network influences the spread of information.
• Source Credibility: The credibility of the source influences the adoption of the information.
• Social Influence: Social influence from peers and opinion leaders influences the adoption of
the information.
The diffusion of innovations is the process by which new ideas, technologies, or products are
adopted and spread within a society or organization.
Key Concepts
• Innovation: A new idea, technology, or product that is perceived as new by the individual or
organization.
• Adoption: The decision to use or implement an innovation.
• Diffusion: The process by which an innovation is communicated and adopted by members of
a social system.
• Social System: A group of individuals who share a common culture, norms, and values.
Innovation
An innovation is a new idea, technology, or product that is perceived as new by the individual or
organization.
Types of Innovations:
Characteristics of Innovations:
Adoption
- Types of Adoption:
- Compatibility: The degree to which the innovation is consistent with existing values, norms, and
practices.
Diffusion
- Types of Diffusion:
- Mass Media Diffusion: The innovation is communicated through mass media channels, such as
television or newspapers.
- Social Networks: The structure and composition of social networks can influence the diffusion of
innovations.
- Opinion Leaders: Individuals who are respected and influential within a social system can
influence the diffusion of innovations.
Social System
- Definition: A social system is a group of individuals who share a common culture, norms, and
values.
- Formal Organizations: Organizations with a formal structure and hierarchy, such as businesses or
governments.
- Informal Networks: Networks of individuals who interact and communicate with each other, such
as social media groups or community organizations.
- Factors Influencing Social Systems:
- Culture: The shared values, norms, and practices of a social system can influence the diffusion of
innovations.
- Power Dynamics: The distribution of power and influence within a social system can influence
the diffusion of innovations.
• Relative Advantage: The degree to which the innovation is perceived as better than existing
solutions.
• Compatibility: The degree to which the innovation is consistent with existing values, norms,
and practices.
• Complexity: The degree to which the innovation is difficult to understand or use.
• Triability: The degree to which the innovation can be experimented with or tried out.
• Observability: The degree to which the results of the innovation are visible or observable.
Types of Adopters
1. Innovators: The first 2.5% of adopters, who are willing to take risks and try new things.
2. Early Adopters: The next 13.5% of adopters, who are opinion leaders and early to adopt new
innovations.
3. Early Majority: The next 34% of adopters, who are cautious but willing to adopt new innovations.
4. Late Majority: The next 34% of adopters, who are skeptical and only adopt new innovations after
they have become widely accepted.
5. Laggards: The final 16% of adopters, who are resistant to change and only adopt new innovations
when they are forced to do so.
2. Public Health: The diffusion of innovations can be used to promote healthy behaviors and prevent
the spread of diseases.
3. Organizational Change: Understanding how innovations diffuse through an organization can help
managers develop effective strategies for implementing change.
4. Technology Adoption: The diffusion of innovations can be used to understand how new
technologies are adopted and used by individuals and organizations.
6. Epidemics
Definition
An epidemic is a sudden increase in the number of cases of a disease or illness beyond what is
normally expected in a given area or population.
Types of Epidemics
2. Propagated Epidemic: The disease spreads from person to person, often through direct contact or
airborne transmission.
Stages of an Epidemic
Examples of Epidemics
Controlling Epidemics
1. Influence Maximization:
- Findings: Identified the most influential nodes in the network and showed that targeting these
nodes can maximize influence
2. Information Diffusion:
- Experiment: Analyzed the spread of information on Twitter during the 2009 Iranian election
protests
- Findings: Identified the key factors that influence information diffusion, including the structure of
the network and the characteristics of the information
3. Social Influence:
- Study: "Social Influence and Social Change in Social Networks" by Christakis and Fowler (2007)
- Experiment: Analyzed the spread of obesity and smoking cessation in a social network of over
12,000 individuals
- Findings: Showed that social influence plays a significant role in shaping individual behavior and
that social networks can be used to promote positive social change
4. Network Structure:
- Study: "The Structure of Online Social Networks" by Mislove et al. (2007)
- Experiment: Analyzed the structure of several online social networks, including Flickr and
YouTube
- Findings: Identified common structural features of online social networks, including power-law
degree distributions and high clustering coefficients
- Study: "The Privacy- Security Tradeoff in Online Social Networks" by Gross and Acquisti (2005)
- Experiment: Analyzed the privacy settings and security practices of users on Facebook
- Findings: Showed that users often trade off privacy for security and that online social networks
can be vulnerable to security threats.
These studies demonstrate the importance of experimental research on OSNs to understand their
dynamics, structure, and impact on society.
6.Emotional Contagion:
- Experiment: Manipulated the emotional content of Facebook users' news feeds to study
emotional contagion
- Findings: Showed that emotional contagion occurs on social media, and that users' emotions can
be influenced by the emotional content of their news feeds.
7. Social Comparison:
- Experiment: Surveyed Facebook users about their social comparison behaviors on the platform
- Findings: Showed that social comparison is a common behavior on social media, and that it can
lead to negative emotions and decreased self-esteem
8. Network Effects:
- Experiment: Studied the spread of information on Twitter and found that network effects play a
significant role in shaping the diffusion of information
- Findings: Showed that the structure of the network and the characteristics of the information
being shared both influence the spread of information on social media
9. Personalization:
- Findings: Showed that personalization can increase user engagement, but that it can also lead to
filter bubbles and decreased diversity of information
- Study: "The Spread of Fake News on Social Media" by Allcott and Gentile (2017)
- Experiment: Studied the spread of fake news on Facebook and Twitter during the 2016 US
presidential election
- Findings: Showed that fake news can spread quickly on social media, and that it can have
significant consequences for democracy and public discourse.
8. Behaviour Analytics
Behavior analytics is the process of analyzing and interpreting human behavior, often
through the use of data analytics and machine learning techniques.
• Web Behavior Analytics: Analyzing user behavior on websites and web applications.
• Mobile Behavior Analytics: Analyzing user behavior on mobile devices and applications.
• Customer Behavior Analytics: Analyzing customer behavior and interactions with a company
or product.
• Social Media Behavior Analytics: Analyzing user behavior on social media platforms.
• Google Analytics: A web analytics tool that provides insights into user behavior on websites
and applications.
• Mixpanel: A product analytics tool that provides insights into user behavior on mobile and
web applications.
• Adobe Analytics: A web analytics tool that provides insights into user behavior on websites
and applications.
• SAS Customer Intelligence: A customer analytics tool that provides insights into customer
behavior and preferences.
9. Sentiment Analysis
Sentiment analysis is a natural language processing (NLP) technique used to determine the
emotional tone or sentiment behind a piece of text, such as a review, tweet, or comment.
• Ambiguity and Context: Sarcasm, irony, and figurative language can make sentiment analysis
challenging.
• Domain Adaptation: Sentiment analysis models may not generalize well across different
domains or industries.
• Handling Imbalanced Data: Sentiment datasets can be imbalanced, with more positive or
negative examples than neutral ones.
• NLTK: A popular Python library for NLP tasks, including sentiment analysis.
• TextBlob: A simple Python library for sentiment analysis and text classification.
• Stanford CoreNLP: A Java library for NLP tasks, including sentiment analysis.
• IBM Watson Natural Language Understanding: A cloud-based API for sentiment analysis and
text analysis.
10. Sampling
In other words, sampling involves choosing a representative group from a larger group, so that the
characteristics of the smaller group can be used to make conclusions about the larger group.
1. Cost and time efficiency: Collecting data from an entire population can be expensive and
time-consuming. Sampling allows researchers to collect data from a smaller group, which is
more feasible.
2. Increased accuracy: Sampling can help reduce errors and biases that can occur when
collecting data from an entire population.
1. Random Sampling: Selecting a random subset of users or posts from the social
media platform.
3. Snowball Sampling: Starting with a small group of users and asking them to
recruit their friends or followers to participate in the study.
Sampling Methods
1. API-based Sampling: Using social media APIs to collect data from a random or
stratified sample of users or posts.
2. Web Scraping: Collecting data from social media websites using web scraping
techniques.
Challenges
2. Bias: Avoiding biases in the sampling process, such as selection bias or non-
response bias.
3. Data Quality: Ensuring that the collected data is accurate, complete, and relevant.
Best Practices
1. Clearly define the population: Identify the specific social media platform, user
group, or topic of interest.
4. Monitor and adjust: Continuously monitor the sampling process and adjust as
needed to ensure that the sample is representative and unbiased.
Probability Sampling
Non-Probability Sampling
Identify the specific social media platform(s) and user group(s) you want to study.
Decide on the type of sampling frame you want to use, such as:
Use APIs, web scraping, or other data collection methods to gather data from the social media
platform(s).
Use the chosen sampling method to select the required number of users, posts, or hashtags from the
dataset.
# Step 8: Validate the Sample
▪ Population: All Twitter users who have tweeted about a specific topic (e.g.,
#climatechange)
▪ Sampling frame: Twitter API to collect tweets and user information
▪ Sampling method: Random sampling of 1,000 users who have tweeted
about #climatechange
▪ Sample size: 1,000 users
Challenges of Sampling
• Bias: Sampling bias can occur if the sample is not representative of the
population.
• Error: Sampling error can occur due to chance, resulting in a sample that is not
representative of the population.
• Generalizability: The results of a sample may not be generalizable to the larger
population.
• Cost and Time: Sampling can be time-consuming and expensive, especially for
large populations.
• Data Quality: Poor data quality can affect the accuracy of the sample.
• Non-Response: Non-response can occur if some individuals or groups do not
respond to the sample.
• Sampling Frame: The sampling frame may not be accurate or up-to-date.
Applications of Sampling