Journal of Information Systems Education, 35(2), 144-147, Spring 2024
https://doi.org/10.62273/YZHB9002
145
interested in analyzing trends regarding real estate market
valuations and other key relevant indicators to better serve its
clients more effectively. For example, Orchard Grove can assist
sellers in determining the optimal selling price for their
property. This ensures that sellers receive the best value for
their investments while attracting numerous potential buyers.
Orchard Grove Realty’s market value estimations also provide
buyers with a clear understanding of the current market
conditions, helping them identify properties that align with their
budget and expectations. Data analysts and scientists use
visualizations to tell the story of the data. Orchard Grove Realty
also understands the importance of technology in the real estate
industry. The firm leverages advanced tools and technologies
to enhance their market analysis capabilities. By utilizing data-
driven approaches and innovative software, Orchard Grove can
provide even more accurate estimations and insights into
market trends, ensuring that clients receive the most up-to-date
and reliable information.
Orchard Grove maintains a large data set covering 13 years
of real-world measures that relate to real estate market value
including days on market, list price, and selling price among
others. Orchard Grove has asked you, as one of their
experienced data analysts, to help generate real estate metrics
and visualizations in Tableau to provide more market insight.
As such, the overall purpose of your analysis will be to describe
the valuation of real estate property trends by developing a
Tableau Storyboard. You will report your findings to your
supervisor, the real estate broker, in the form of a Tableau
storyboard with visuals outlining your analysis and
recommendations. You may be asked to write a report or
present your findings to your boss.
3. CASE DATA
3.1 Data Description
The data set consists of 96,625 residential properties sold
during 2008-2021. The variables and definitions that will be
used in the case can be found in Table 1. Note: Your data file
may not have all of the variables so you may have to create
calculated fields for any missing variables.
3.2 Data Cleansing
Before beginning the data analysis process, it is important to
undertake data cleansing techniques to improve data quality and
remove errors or inconsistencies. The data should be normalized
by converting different units of measurement into a standardized
unit. For example, some homeowner association dues are
expressed as monthly costs while others are yearly.
All data errors should be corrected through a process of
identifying and rectifying erroneous data, in particular,
addressing extreme or invalid values. Any inconsistencies or
conflicts within text variables should be resolved. For example,
the homeowner association frequency variable (HOA Freq)
might say dues are collected “Monthly” or “Per Month” and so
those text entries should be edited for uniformity. Furthermore,
it is recommended to review the variable names to ensure
consistent naming conventions.
4. ASSIGNMENT
As one of Orchard Grove’s experienced data analysts, you are
tasked with creating visualizations to address the following real
estate questions in order to provide a market analysis of your
data set. Students could also be directed to various real estate
YouTube instructional videos or specific websites in order to
better understand real estate terminology, current issues and
concerns. Some website examples include but are certainly not
limited to the following:
https://www.opendoor.com/articles/real-estate-terms-you-
should-know (Opendoor Team, 2019) or
https://www.investopedia.com/mortgage/real-estate-investing-
guide (The Investopedia Team, 2022).
Price of the property when sold
Assessor’s Parcel Number -
identification number assigned to parcels
of property by the tax assessor of a
Multiple Listing Service number
Number of fireplaces in the property
Postal code used to provide a location of
a property to the United States Postal
Number of bathrooms on the property
Number of bedrooms on the property
Type of financing used by the buyer to
purchase the property
Number of spaces for cars to fit in the
garage
Homeowner Association fee charged at a
given frequency
Frequency of payments to the
Homeowner Association
The initial suggested selling price of the
property
Date property is listed on the market for
sale
Size of the land according to boundary
lines
The type of lot size, i.e., acres or square
feet
Whether the property has a pool or not,
Y=Yes/N=No
Type of roof on the property
School district where the property is
located
Whether the property has a solar or not,
Y=Yes/N=No
Size of the property in square feet
Year the property was first built
Number of days the property was on the
market before being sold
Ratio of selling price to listing price
Table 1. Variables and Definitions