Predicting Single Family Home Prices in Pittsburgh

Topic

This project aims to build models that can better predict the cost of real estate in Pittsburgh. We will use a variety of data sources related to Pittsburgh to bolster real estate data from sources such as Redfin.

Overview

With today’s hot real estate market, it is more difficult than ever to find value in a home purchase. Our product leverages non-conventional data streams to better predict the actual selling price of homes. This problem is important because it allows real estate investors to identify properties that are potentially undervalued relative to the market, and find value where other investors may not think to look.

Current approaches are through companies like Redfin, Zillow, and Realtor.com. These companies use a mix of resources to predict home prices. We decided to utilize Redfin's data. Based on their publicly available data, they predict home prices based on bedrooms, bathrooms, the year the home was built, and the neighborhood. The input for our model is the basic information that is publicly available on Redfin, as well as additional data for crime, local school information, and census information, to predict a home selling price, which is our output.

Goal

Our criteria for success is centered around accuracy. This can be measured by comparing the predicted values to the actual values and calculating the percentage of predictions that are within a certain range of the actual values. “According to Redfin, its estimates are approximately 74% accurate within 5% of the sales price for listed homes”. We are running our own Redfin baseline model to verify Redfin’s claim on their accuracy. The goal is for our improved model to increase accuracy relative to that baseline model. We will consider our model successful if it performs better than the Redfin model.

Team Responsibilities

Hannah Fairfield: Baseline linear regression model, Redfin dataset

Sai Rajuladevi: Redfin dataset, Clustering, SciKit learn modeling, AutoML modeling

Kraig Sheetz: Crime dataset, Schools dataset

Cole Thomas: AutoML modeling, SciKit learn modeling, Census Bureau dataset

Results

View the report pdf to see our detailed findings.

Final Report

Presentation

View the presentation slides.

Presentation

Contributing

Contributions are what make the open source community such an amazing place to be learn, inspire, and create. Any contributions you make are greatly appreciated.

Fork the Project
Create your Feature Branch (git checkout -b feature/AmazingFeature)
Commit your Changes (git commit -m 'Add some AmazingFeature')
Push to the Branch (git push origin feature/AmazingFeature)
Open a Pull Request

License

Distributed under the MIT License. See LICENSE for more information.

Contact

Sai Rajuladevi: https://www.linkedin.com/in/sai-rajuladevi/

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Data Sources		Data Sources
Presentation		Presentation
Project Code		Project Code
00_FinalProjectIdeas.docx		00_FinalProjectIdeas.docx
01_Pricing Data Sources.docx		01_Pricing Data Sources.docx
02_RefinedScopeForProject.docx		02_RefinedScopeForProject.docx
11_04_2022 Meeting Notes.docx		11_04_2022 Meeting Notes.docx
11_11_2022 Meeting Notes.docx		11_11_2022 Meeting Notes.docx
AI Final Project Proposal.docx		AI Final Project Proposal.docx
Final Report.pdf		Final Report.pdf
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predicting Single Family Home Prices in Pittsburgh

Topic

Overview

Goal

Team Responsibilities

Code

EDA Sample 1

Census_Model_Run.ipynb

Census_Model_Run_AutoML.ipynb

Results

Final Report

Presentation

Presentation

Contributing

License

Contact

About

Releases

Packages

Languages

License

sr9dc/Pittsburgh_Single_Family_Home_Price

Folders and files

Latest commit

History

Repository files navigation

Topic

Overview

Goal

Team Responsibilities

Code

Results

Presentation

Contributing

License

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Languages