Skip to content

The main goal of this project is to answer the questions "Which state within the USA had the most sightings over the period?" and "Within this state which county was responsible for the majority of these sightings?", using Excel and Tableau.

Notifications You must be signed in to change notification settings

DkOwusuA004/Bigfoot-Sightings

Repository files navigation

Bigfoot-Sightings

Table of Contents
  1. Business Understanding
  2. Data Understanding

The main goal of this project is to answer the questions "Which state within the USA had the most sightings over the period?" and "Within this state which county was responsible for the majority of these sightings?".

Business Understanding

Scope

This analysis takes place between 1950 and 2018 within the geographical location of the USA.

Credibility Criteria

The Bigfoot sightings were classified into 3 report types:

Class A - These reports involve clear sightings in circumstances where misinterpretation or misidentification of other animals can be ruled out with greater confidence e.g. there are several footprint cases that are very well documented.

Class B - These reports involve incidents where a possible sighting was observed at a great distance or in poor lighting conditions e.g. a report where nothing was seen but distinct and characteristic sounds of sasquatches were heard.

Class C - These reports have a high potential for inaccuracy e.g. most second-hand reports, and any third-hand reports, or stories with an untraceable source.

In this analysis I consider sightings to be credible if they are within Classes A & B.

Data Understanding

Data source

The data source I am using is the Bigfoot_cleaned Excel file which I obtained through my Introduction to Tableau module in my Data Analytics Essentials course.

I believe this data to be reliable because firstly, the data was sourced from WWW.BFRO.NET which is the only scientific research organization exploring the bigfoot/sasquatch mystery. The data used in my analysis is from their comprehensive database of credible sightings and related reports which is maintained by an all-volunteer network of bigfoot/sasquatch researchers, archivists, and investigators in the United States and Canada.

Variables of Interest

The key variables I want to analyse within this project are as follows:

  1. Number of sightings
  2. States in which sightings occured
  3. Counties in which sightings occured

Data Types

My data contains:

  • Report Type - String
  • Report Class - String
  • Submitted Date - Date and Time
  • Year - String
  • Season - String
  • Month - Date and Time
  • State - String
  • County - String

Visualisation Requirements

I want to first use a horizontal bar chart in order to visualise the state with the most sightings. Then I want to use a symbol map in order to visualise which counties within the selected state had the most sightings.

Comparative Analysis

I want to compare sightings within the selected state in 3 different ways:

  1. The season in which reports were made
  2. The month in which reports were made
  3. The class types of reports made

In order to identify any patterns or trends with when these reports were made.

Approach

Data Collection

To collect the data used for this analysis I downloaded an Excel file within the Cisco 'Introduction to Tableau module' in my Data Analytics Essentials course.

Data Cleaning

First I put a filter on the entire date in order to make the cleaning process easier, I then removed the 'Submitted Date' column as this was irrelevant to my analysis. Then I used conditional formatting on the year column to highlight any data rows outside the range of 1950 - 2018. After this, I filtered all the class C reports and removed them from the dataset as they were not deemed credible enough for the analysis. I then filtered all the 'unknown' cells in the 'Season' column and removed these rows of data as they would've affected my comparative analysis later during the project, I also filtered all the '(blanks)' cells in the month column in order to remove the lines of data that didn't have a specific month as these lines of data would affect my upcoming comparative analysis.

Analysis

Using a pivot table I put the state column in the row and values fields and then I sorted the data descending by 'Count of State' which showed me that Washington had the highest number of sightings (506).

I then created a second pivot table. This time I put 'State' and then 'County' in the Rows field and then I put 'Count of County' in the Values field in order to show sightings per county. I then sorted the data descending by 'Count of County', and then I filtered on the state of Washington in the Row field, as this was the state with the most sightings, which showed me that 'Pierce' was the county with the most sightings.

Visualisations

Bigfoot Sightings Dashboard 1

Bigfoot Sightings Dashboard 2

Interpretation

Within the first dashboard are 2 graphs; 'Bigfoot sightings by State' and 'Washington sightings by County'. The 'Bigfoot sightings by State' shows us that Washington produced the most Bigfoot sightings (503) between 1950-2018. This answers the first question "Which state within the USA had the most sightings over the period?".

In order to answer the second question; "Within this state which county was responsible for the majority of these sightings?", we have to look at the 'Washington sightings by County' graph. This graph shows us that within Washington, the county with the highest number of sightings is Pierce County with 68 sightings in the period.

Regarding the comparative analysis we can look at tables 'Sightings by season', 'Sightings by month' and Sightings by class'.

Firstly, by looking at the 'Sightings by season' graph, we can see that 40.36% of sightings (203) were reported during Summer, this could be due to the following:

  1. Increased Outdoor Activity
  2. During the summer season, a pronounced surge in vacationing and recreational pursuits is observed. Many individuals, particularly outdoor enthusiasts such as campers and hikers, tend to partake in these activities during this period, capitalizing on the more predictable weather conditions. This propensity raises the probability of individuals finding themselves in locations where reported Bigfoot sightings have historically occurred. Furthermore, the summer months represent an optimal time for wildlife observation. Devotees of nature and photography often allocate an increased duration to exploring natural habitats, thereby augmenting the chances of encountering diverse fauna. It is noteworthy that such wildlife encounters, in certain instances, may be misconstrued as potential Bigfoot sightings.

  3. Better Weather Conditions/Extended Daylight Hours
  4. The elevated temperatures of the summer season render outdoor activities notably more comfortable and enticing. Consequently, individuals are inclined to explore natural habitats, thereby heightening the prospects of wildlife encounters or reporting unconventional sightings. Moreover, the longer duration of daylight during summer extends the time available for outdoor activities. This prolonged visibility window creates additional opportunities for individuals to discern peculiar phenomena and subsequently report such sightings.

  5. Wildlife Activity/Migration and Movement Patterns
  6. In the summer season, numerous wildlife species undergo heightened activity, influenced by factors like breeding seasons, abundant food availability, and favourable weather conditions. This surge in wildlife activity is prone to result in increased observations and reports from individuals engaged in outdoor activities, particularly in regions recognised for Bigfoot sightings. Furthermore, locales characterised by both documented Bigfoot sightings and escalated wildlife activity may witness elevated instances of human-wildlife interactions. This heightened interaction dynamic amplifies the likelihood of individuals reporting sightings, contributing to the convergence of reported encounters in these areas.

Next, looking at the 'sightings by month' graph, we can see that the month of August had 88 sightings reported out of a possible 503 (11.73%). This graph shows us that the summer sightings were greatly influenced by the month of August as the 2nd and 3rd highest months after August were October (68) and September (67)

Finally, the 'sightings by class' graph shows that the majority (58.65%) of reports made were reports involving incidents where a possible sighting was observed at a great distance or in poor lighting conditions

Potential Biases

The most prevalent type of bias that could've affected the conclusion of this project is information bias; many people may have had sightings that could very well fall under Class A or B reports however, for an array of reasons they may not have filed a report. In addition to this, there is the risk of hoax and prank reporting which affects the validity of the data collected, ultimately skewing the data and creating inaccuracies within the analysis.

Technologies

Excel Logo

Tableau Public Logo

Status

Credits

Bigfoot Sightings

About

The main goal of this project is to answer the questions "Which state within the USA had the most sightings over the period?" and "Within this state which county was responsible for the majority of these sightings?", using Excel and Tableau.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages