Can AI Be Ethical? The Bias Problem
AI now decides who gets hired, who gets loans, and who gets parole. Explore how bias enters these systems and what we can do.
- How bias enters AI systems and why it is hard to remove
- Frameworks for responsible AI development
- Case studies: hiring algorithms, predictive policing, healthcare triage
- The global landscape of AI regulation in 2026
1. Where AI bias comes from
Can AI Be Ethical? The Bias Problem
AI now decides who gets hired, who gets loans, and who gets parole. Explore how bias enters these systems and what we can do.
How bias enters machine learning systems
Bias can enter at three main stages:
- Data collection: the training set is incomplete or unrepresentative
- Labeling: people encode subjective judgment into the target variable
- Deployment: the model is used in a setting different from the one it was trained on
A useful analogy: training data is like a classroom textbook. If the textbook leaves out whole chapters of history, the student who learns from it will be confident and still wrong.
Common sources of bias include:
- Historical bias: past discrimination appears in records
- Sampling bias: some groups are underrepresented
- Measurement bias: the thing being measured is not the thing we really care about
- Proxy bias: a harmless-looking variable stands in for a sensitive trait
- Feedback loops: model decisions change the world, and the new data reinforces the same pattern
Why bias is hard to remove
Removing one source of bias can expose another. If you rebalance data, you may increase noise. If you remove a sensitive feature like race or gender, the model can still infer it from ZIP code, school, or employment history. In fairness research, this is called proxy discrimination.
There is also a tradeoff between different fairness goals. A model can satisfy one fairness metric and violate another. For example, equal false positive rates and equal calibration are often impossible to satisfy at the same time when base rates differ across groups. That result was formalized in the fairness literature by researchers including Jon Kleinberg, Sendhil Mullainathan, and Manish Raghavan in 2016.
So the real question is not, “Can we make AI perfectly neutral?” It is, “Which harms are we trying to prevent, for whom, and under what context?”
A simple worked example
Suppose a hiring model is trained on 10,000 past applications. If 80 percent of the historical hires were men, the model may learn that male-coded signals correlate with success. Even if gender is removed, the model may still use signals like certain extracurriculars, career gaps, or school networks as proxies.
That is why fairness work starts with problem definition, not just model tuning.
2. The fairness toolbox
Core fairness metrics in AI
Demographic parity
- The model selects people at similar rates across groups
- Useful when selection itself is the concern
- Can be misleading if groups differ in underlying risk or eligibility
Equalized odds
- False positive rates and false negative rates are similar across groups
- Useful when mistakes have unequal costs
- Often harder to satisfy in practice
Calibration
- A predicted score means the same thing across groups
- Important for risk scores and triage systems
Individual fairness
- Similar people should receive similar predictions
- Hard to define because “similar” depends on the task
Analogy: demographic parity is like making sure every classroom gets the same number of gold stars. Equalized odds is like making sure the grading mistakes are equally common in every classroom.
3. Three high-stakes case studies
Case study 1: hiring algorithms
What can go wrong:
- Historical hiring data reflects old discrimination
- Resume features can act as proxies for gender, age, or class
- Automated ranking can hide the reason a candidate was rejected
What helps:
- Use job-relevant features only
- Audit outcomes by subgroup
- Keep humans accountable for final decisions
Case study 2: predictive policing
What can go wrong:
- Arrest data is not the same as crime data
- Patrol intensity changes what gets recorded
- Feedback loops amplify existing surveillance patterns
Case study 3: healthcare triage
What can go wrong:
- Spending is used as a proxy for need
- Under-treated patients look healthier than they are
- Risk scores can worsen inequities if they ignore access differences

What these cases teach
The hardest part is not detecting bias after the fact. It is noticing which variable is acting as a proxy before harm spreads. That takes domain experts, not just machine learning engineers.
4. Responsible AI in practice
Responsible AI development framework
-
Problem framing
- What decision is being automated?
- Who can be harmed?
- Is automation necessary?
-
Data review
- Who is represented?
- What is missing?
- Are labels reliable?
-
Model evaluation
- Test accuracy and fairness together
- Compare error rates across groups
- Stress-test edge cases
-
Human oversight
- Set escalation rules
- Require review for high-impact cases
- Make overrides visible
-
Post-deployment monitoring
- Track drift
- Re-audit regularly
- Log complaints and appeals
Practical safeguards
- Use representative test sets, not only average accuracy
- Set thresholds separately when justified by the use case and law
- Keep an appeal path for people affected by automated decisions
- Log model inputs, outputs, and overrides
- Reassess the system when the population or policy changes
A useful analogy: responsible AI is less like a single exam and more like a flight checklist. The plane can be technically capable and still unsafe if one step is skipped.
Documentation matters
Model cards and datasheets help teams answer three questions:
- What was this system built for?
- What data shaped it?
- When should it not be used?
That last question is often the most honest one.
5. Regulation and the road ahead in 2026
Global AI regulation snapshot in 2026
European Union
- AI Act in force since 1 August 2024
- Phased obligations for banned uses, general-purpose AI, and high-risk systems
- Strong emphasis on documentation, risk management, and human oversight
United States
- Sector-by-sector rules and enforcement
- Executive Order 14110 issued on 30 October 2023
- Local rules such as New York City Local Law 144 for hiring tools
United Kingdom
- Regulator-led approach across sectors
- Guidance from bodies such as the Information Commissioner's Office and the Financial Conduct Authority
China
- Rules for recommendation systems, deep synthesis, and generative AI services
- Strong content and platform governance
International
- Council of Europe AI Convention adopted in 2024
- OECD AI Principles remain a widely used policy reference
What good teams do now
- Build compliance into product design
- Keep audit trails and model documentation
- Test for bias before launch and after updates
- Include legal, policy, engineering, and domain experts
- Give users a way to contest automated decisions
The goal is not to make AI perfectly innocent. The goal is to make it accountable, testable, and reversible when it causes harm.
Keep going with Slate
Pick up where this left off in your own voice session.