Data Science Ethics - Part - B
Data Science Ethics - Part - B
ETHICS
• Ethics is about the shared values and moral principles that govern our behavior in
society.
• Ethics is based not on laws but on widely accepted norms of what is "right vs.
wrong".
• Data Ethics is a new branch of ethics that "studies and evaluates moral
problems related to data, algorithms and corresponding practices".
• Ethics Culture is about operationalizing applied ethics to make sure that our
ethical principles and practices are adopted in a consistent and scalable manner
across the entire organization.
• Transparency ensures that data and AI actions are understandable (interpretable) to users, explaining the what
and why behind decisions.
• Fairness - focuses on ensuring AI treats all people fairly, addressing any systemic or implicit socio-technical
biases in data and systems.
• Reliability & Safety - ensures that AI behaves consistently with defined values, minimizing potential harms or
unintended consequences.
• Privacy & Security - is about understanding data lineage, and providing data privacy and related
protections to users.
• Inclusiveness - is about designing AI solutions with intention, adapting them to meet a broad range of human
needs & capabilities.
ETHICS CHALLENGE
• With data collection, actions will likely involve personal data or personally identifiable information
(PII) for identifiable living individuals.
• This includes diverse items of non-personal data that collectively identify an individual.
• Ethical challenges can relate to data privacy, data ownership, and related topics like informed
consent and intellectual property rights for users.
• With algorithm design, actions will involve collecting & curating datasets, then using them to train &
deploy data models that predict outcomes or automate decisions in real-world contexts.
• Ethical challenges can arise from dataset bias, data quality issues, unfairness ,and misrepresentation in
algorithms - including some issues that are systemic in nature.
1. DATA OWNERSHIP
• Data collection often involves personal data that can identify the data subjects.
Data ownership is about control and user rights related to the creation, processing
,and dissemination of data.
• Informed consent defines the act of users agreeing to an action (like data
collection) with a full understanding of relevant facts including the purpose,
potential risks, and alternatives.
• Did the user (data subject) give permission for data capture and usage?
• Did the user understand the purpose for which that data was captured?
• Did the user understand the potential risks from their participation?
3. INTELLECTUAL PROPERTY
• Data privacy or information privacy refers to the preservation of user privacy and
protection of user identity with respect to personally identifiable information.
• Data Quality looks at the validity of the curated dataset used to develop our
algorithms, checking to see if features and records meet requirements for the level
of accuracy and consistency needed for our AI purpose.
• The Illusion of Free Choice occurs when system "choice architectures" use decision-
making algorithms to nudge people towards taking a preferred outcome while
seeming to give them options and control. These dark patterns can cause social and
economic harm to users. Because user decisions impact behavior profiles, these
actions potentially drive future choices that can amplify or extend the impact of
these harms.
• PROFESSIONAL CODES
• ETHICS CHECKLISTS
• ETHICS REGULATIONS
• ETHICS CULTURE
1. PROFESSIONAL CODES
• Professional Codes offer one option for organizations to "incentivize" members
to support their ethical principles and mission statement.
• They are only as good as the voluntary compliance from members; however,
many organizations offer additional rewards and penalties to motivate compliance
from members.
2. ETHICS CHECKLISTS – Connect principles and practices
Example:
• Privacy Audit Checklist - provides general guidance for information handling practices
from legal and social exposure perspectives.
• 22 questions for ethics in data and AI - more open-ended framework, structured for initial
exploration of ethical issues in design, implementation, and organizational, contexts.
3. ETHICS REGULATIONS
• Note that there remains an intangible gap between compliance (doing enough to meet "the letter of the
law") and addressing systemic issues (like ossification, information asymmetry, and distributional
unfairness) that can speed up the weaponization of AI.
• The latter requires collaborative approaches to defining ethics cultures that build emotional
connections and consistent shared values across organizations in the industry.
• This calls for more formalized data ethics cultures in organizations - allowing anyone to
pull the Andon cord (to raise ethics concerns early in the process) and making ethical
assessments (e.g., in hiring) a core criteria team formation in AI projects.