Skip to content

Latest commit

 

History

History
28 lines (15 loc) · 1.76 KB

Data Collection and Prep.md

File metadata and controls

28 lines (15 loc) · 1.76 KB

In this project I will be using the Chicago Face Database as my main dataset.

Download link: https://www.chicagofaces.org/

A C K N O W L E D G E M E N T

CFD: Ma, Correll, & Wittenbrink (2015). The Chicago Face Database: A Free Stimulus Set of Faces and Norming Data. Behavior Research Methods, 47, 1122-1135. https://doi.org/10.3758/s13428-014-0532-5.
CFD-MR: Ma, Kantner, & Wittenbrink, (2020). Chicago Face Database: Multiracial Expansion. Behavior Research Methods. https://doi.org/10.3758/s13428-020-01482-5.
CFD-INDIA: Lakshmi, Wittenbrink, Correll, & Ma (2020). The India Face Set: International and Cultural Boundaries Impact Face Impressions and Perceptions of Category Membership. Frontiers in Psychology, 12, 161. https://doi.org/10.3389/fpsyg.2021.627678.

The dataset is composed of various sections.

I will be focused on the main CFD folder.

enter image description here

The CFD folder contains a total of 597 pictures of individuals of 4 different ethnics groups: Asian, Black, Latin and White. Each group is divided in Females and Males. For each ethnic group we have male and female:

Ethnic Groups

I am going to collect two types of values for each subcategory:

  1. Observed frequencies: are the actual number of individuals aged over 60 in each ethnic subcategory in your dataset. This task is done manually 😣

  2. Expected frequencies: are what you would expect if there was no age bias.