Skip to content

Commit

Permalink
fixing todos
Browse files Browse the repository at this point in the history
origin

# Conflicts:
#	docs/site_libs/bootstrap/bootstrap-dark.min.css
#	docs/site_libs/bootstrap/bootstrap.min.css
#	todos.Rmd
  • Loading branch information
Nics-Github committed Oct 2, 2024
2 parents 03c58d9 + cc0b3c0 commit 7c725b8
Show file tree
Hide file tree
Showing 13 changed files with 1,356 additions and 239 deletions.
7 changes: 5 additions & 2 deletions _freeze/site_libs/revealjs/dist/theme/quarto.css

Large diffs are not rendered by default.

Original file line number Diff line number Diff line change
Expand Up @@ -38,7 +38,7 @@ window.QuartoLineHighlight = function () {
divSourceCode.forEach((el) => {
if (el.hasAttribute(kCodeLineNumbersAttr)) {
const codeLineAttr = el.getAttribute(kCodeLineNumbersAttr);
el.removeAttribute("data-code-line-numbers");
el.removeAttribute(kCodeLineNumbersAttr);
if (handleLinesSelector(deck, codeLineAttr)) {
// Only process if attr is a string to select lines to highlights
// e.g "1|3,6|8-11"
Expand Down Expand Up @@ -165,17 +165,17 @@ window.QuartoLineHighlight = function () {
if (typeof highlight.last === "number") {
spanToHighlight = [].slice.call(
codeBlock.querySelectorAll(
":scope > span:nth-child(n+" +
":scope > span:nth-of-type(n+" +
highlight.first +
"):nth-child(-n+" +
"):nth-of-type(-n+" +
highlight.last +
")"
)
);
} else if (typeof highlight.first === "number") {
spanToHighlight = [].slice.call(
codeBlock.querySelectorAll(
":scope > span:nth-child(" + highlight.first + ")"
":scope > span:nth-of-type(" + highlight.first + ")"
)
);
}
Expand Down
118 changes: 76 additions & 42 deletions course-materials/lectures/11_ethics.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -11,106 +11,140 @@ editor: visual

This is a squishy area.

[*Is the use of data causing harm?*]{.smallcaps}
"Use Common Sense" - The authors of our textbook.

## We'll look at two areas
## The Five Ps of Ethical Data Handling.

1. Data Ethics of Big Data.
The five Ps termonology comes from here the Harvard Business Review.

2. Data Ethics of Visualization.
[Here's the link.](https://hbr.org/2023/07/the-ethics-of-managing-peoples-data)

## Copyright and Chat GPT

::: columns
::: {.column width="50%"}
![](https://upload.wikimedia.org/wikipedia/commons/thumb/0/04/ChatGPT_logo.svg/1920px-ChatGPT_logo.svg.png)
:::

::: {.column width="50%"}
- Sarah Silverman, Mona Awad and Paul Tremplay sue Open AI for direct infringement.

- Using copyrighted materials to train their models.

- [List of suits is getting longer.](https://originality.ai/blog/openai-chatgpt-lawsuit-list)
:::
:::

## Christopher Wylie
## Provenance

Where does the data originate?

Was it aquired legally?

## Github

::: columns
::: {.column width="50%"}
![](https://static01.nyt.com/images/2018/03/18/autossell/18Cambridge1/merlin_135596256_2eaf14bd-c577-441b-ac87-b5d54199d1b4-superJumbo.jpg?quality=75&auto=webp){fig-align="left" width="400"}
![](https://upload.wikimedia.org/wikipedia/commons/a/ae/Github-desktop-logo-symbol.svg){width="354"}
:::

::: {.column width="50%"}
- Cambridge Analytica improperly gets data on 50 million facebook users.
::: columns
Microsoft (owner of github) uses code in repos to train copilot.

- Ethical on public repositories?

- Uses Data to manipulate users for votes in 2016 election.
- Private repositories?

- Similar story with Brexit.
- Copilot costs \$20 a month or is free.
:::
:::
:::

## Prediction of Race
## Purpose

Github user solivella made an [r package](https://github.com/kosukeimai/wru) to predict race based on surname and location.
Would the source of the data agree with how it is being used?

API from the US Census Bureau.
What if the data is re-purposed?

## Making Gaydar
## ![](https://upload.wikimedia.org/wikipedia/commons/4/48/OKCupid_Logo.svg){width="243"}

![](https://static01.nyt.com/images/2017/10/10/science/10SCI-GAYDAR1/00GAYDAR1-superJumbo.jpg?quality=75&auto=webp){fig-align="center" width="1500"}
::: incremental
- If that data is scraped from the site by a third party and made available is that ethical?

[Michal Kosinski and Yilun Wang create a gaydar.](https://www.nytimes.com/2017/10/09/science/stanford-sexual-orientation-study.html)
- OkCupid, including usernames, age, gender, location, what kind of
relationship (or sex) they’re interested in, personality traits

Want to show that is it possible to make something public with data that should be private to the individual.
- "The data is already public. " -Emil Kirkegaard
:::

## Lab 3
## Protection

[Spotify](https://www.vox.com/culture/22814121/spotify-wrapped-2021-algorithm-data-privacy)
How is the data being protected?

## Algorithms Reflect the bais of their creator
Who is responsible for destroying it?

> A piece of data itself has no positive or negative moral value, but the way we manipulate it does. It's hard to imagine a more contentious project than programing **ethics into our algorithms**; to do otherwise, however, and allow algorithms to monitor themselves, is to invite the quicksand of **moral equivalence**.
## College Students

## Books
::: columns
::: {.column width="50%"}
![Smith College](https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcQm_RWI9VTHbnc4TXZTOpWZxf-nX3KdhlyATA&s)
:::

::: {.column width="50%"}
::: columns
::: {.column width="58%"}
![](https://images-na.ssl-images-amazon.com/images/S/compressed.photo.goodreads.com/books/1456091964i/28186015.jpg){fig-align="center"}
There is data that Smith collected from you.

The repos in this class are data that you are creating.
:::
:::
:::

## Data Visualizations.
## Preparation

How is the data cleaned?

Charts can be manipulated either intentionally or unintentionally.
Are data sets being combined to preserve anonymity?

Here are [some lesson's learned from Lying With Statistics.](https://towardsdatascience.com/lessons-from-how-to-lie-with-statistics-57060c0d2f19) by Darrell Huff.
Is the accuracy of the data verified?

Here is a [site I wanted to show you forever ago.](https://www.statisticshowto.com/probability-and-statistics/descriptive-statistics/misleading-graphs/)
## Lab 7ish

Here's another, [if we have time.](https://www.buzzfeednews.com/article/katienotopoulos/graphs-that-lied-to-us)
[![](https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcQAGMOJ6BcHCxFFTln4-ZM9AQ9jaul5yUThUg&s)](https://aratrace.com/7-sisters-trail-race-4/)

## Telling a story
## Privacy

Data Visualizations are just telling a story.
Who will have access to data that can be used to ID a person?

Try to make it an honest one.
How will individuals be anonymized?

Who has access to that anonymized data.

## Lab 3

[Spotify](https://www.vox.com/culture/22814121/spotify-wrapped-2021-algorithm-data-privacy)

## Algorithms Reflect the bais of their creator

> A piece of data itself has no positive or negative moral value, but the way we manipulate it does. It's hard to imagine a more contentious project than programing **ethics into our algorithms**; to do otherwise, however, and allow algorithms to monitor themselves, is to invite the quicksand of **moral equivalence**.
## Books

## How to lie with Statistics

::: columns
::: {.column width="50%"}
::: {.column width="33%"}
![](https://images-na.ssl-images-amazon.com/images/S/compressed.photo.goodreads.com/books/1427665814i/51291.jpg)
:::

::: {.column width="50%"}
::: {.column width="33%"}
![](https://images-na.ssl-images-amazon.com/images/S/compressed.photo.goodreads.com/books/1456091964i/28186015.jpg)
:::
::: {.column width="33%"}
![](https://timharford.com/wp-content/uploads/2020/11/data-detective.png)
:::
:::

## Credits

- Some Content was taken from Ben Baumer's Lecture on Ethics

- Chistopher Wylie https://www.nytimes.com/2018/03/17/us/politics/cambridge-analytica-trump-campaign.html

- Michal Kosinski and Yilun Wang https://www.nytimes.com/2017/10/09/science/stanford-sexual-orientation-study.html

- New Yorker https://www.newyorker.com/news/daily-comment/the-ai-gaydar-study-and-the-real-dangers-of-big-data
- The five ps seem to come from the Harvard Business Review <https://hbr.org/2023/07/the-ethics-of-managing-peoples-data> 9/30/2024

- Suing Chat GPT https://www.theverge.com/2024/2/13/24072131/sarah-silverman-paul-tremblay-openai-chatgpt-copyright-lawsuit
Loading

0 comments on commit 7c725b8

Please sign in to comment.