NLP_debias_project

Our project replicates and extends the claims made in the paper Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings. This paper highlights the biases that exist in the data and how the machine learning models run the risk of amplifying them. Wordembedding is a popular framework to represent text data as vectors and most of the machine/deep learning models these days make extensive use of it. Word embedding represents text as a d-dimensional vector. So, all those words which have similar semantic meaning ends up close to each other in this d-dimensional vector space

Authors

Madhuri Pujari | Harish Chauhan | Dhruv Agarwal

Dataset

w2vNEWS (Word2Vec embedding trained on a corpus of Google news texts) - w2v_gnews_small.txt

This is a word embeddings trained on Google News articles which exhibit female/male gender stereotypes to a disturbing extent. This raises concerns because their widespread use, as we describe, often tends to amplify these biases.
300-dimensional word2vec embedding, which has proven to be immensely useful since it is high quality, publicly available, and easy to incorporate into any application. In particular, we downloaded the pre-trained embedding on the Google News corpus,4 and normalized each word to unit length as is common.
Starting with the 50,000 most frequent words, we selected only lower-case words and phrases consisting of fewer than 20 lower-case characters (words with upper-case letters, digits, or punctuation were discarded).
After this filtering, 26,377 words remained. While we focus on w2vNEWS, we show later that gender stereotypes are also present in other embedding data-sets.

Debias Algorithm

The paper utilizes following methods to success fully dampen the effect of bias in the embedding while still preserving its useful properties such asthe ability to cluster related concepts and to solve analogy tasks.

Identify gender bias subspace, the authors use Principal Component Analysis(PCA) on 10 gender pair difference vectors and show that the majority of variance in these vectors is only along one principal axis. The first eigenvalue is significantly larger than the rest.
Hard de-biasing (neutralize and equalize) To lessen the impact of biases, the authors introduce a method viz. neutralize and equalize. It removes gender neutral words from gender subspace and make them equidistant outside this subspace.
Soft bias correction Equalize removes certain distinctions that are valuable in certain applications. The Soften algorithm reduces the differences between these sets while maintaining as much similarity to the original emedding as possible, with a parameter that controls this trade-off.

Results

Figure 1. Gender Bias - Word Embedding Space

Figure 2. Racial Bias - Word Embedding Space

Figure 3. Debiased Word Embedding Space after algorithm

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
data		data
results		results
utilities		utilities
.gitignore		.gitignore
README.md		README.md
main.ipynb		main.ipynb
turkish_main.ipynb		turkish_main.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP_debias_project

Authors

Dataset

Debias Algorithm

Results

About

Releases

Packages

Languages

hariya99/NLP_debias_project

Folders and files

Latest commit

History

Repository files navigation

NLP_debias_project

Authors

Dataset

Debias Algorithm

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages