All Student Theses and Dissertations

Fitting a Linear Regression Model and Forecasting in R in the Presence of Heteroskedascity with Particular Reference to Advanced Regression Technique Dataset on kaggle.com.

Samuel Mbah Nde, Governors State UniversityFollow

Publication Date

Summer 2017

Document Type

Thesis

Degree Name

Master of Science

Department

Mathematics

First Advisor

Andrius Tamulis, Ph.D.

Second Advisor

Anne Morlet, Ph.D.

Third Advisor

Jing Zhang, Ph.D.

Abstract

Since ancient times, men have built and sold houses. But just how much is a house worth? The challenge is to be able to use information about a house such as its location, and the area on which it is built to predict its price. Such predicted prices can be of great importance to any participant in the real estate business be it an agent, a buyer, seller or a bank to make intelligent decisions and the profit that come with such decisions. Since every company’s success depends on its ability to accurately predict financial outcomes, its profitability will depend on how well it can forecast economic outcomes. The goal of this thesis is to demonstrate how to use the forecasting tools of the software R to forecast house prices. To achieve this, we use random forest, correlation plots and scatter plots to select variables to include to use in building a model using the information in one of the data sets (training data set) and then test the effectiveness of the model on another set (test data set). Then, we explore the relationships between these variables and decide whether it is appropriate to build linear models(lm) or a generalized linear models(glm). Finally, we build our model on the dataset making sure to avoid an overly complex or overfit model. Noting that our model suffers from unconditional heteroskedasticity, we discuss its goodness of fit. Then we use the model to predict sales prices for the point in the testing data set.

Recommended Citation

Nde, Samuel Mbah, "Fitting a Linear Regression Model and Forecasting in R in the Presence of Heteroskedascity with Particular Reference to Advanced Regression Technique Dataset on kaggle.com." (2017). All Student Theses and Dissertations. 99.
https://opus.govst.edu/theses/99

Download

Included in

Numerical Analysis and Computation Commons

COinS

OPUS Open Portal to University Scholarship

All Student Theses and Dissertations

Fitting a Linear Regression Model and Forecasting in R in the Presence of Heteroskedascity with Particular Reference to Advanced Regression Technique Dataset on kaggle.com.

Publication Date

Document Type

Degree Name

Department

First Advisor

Second Advisor

Third Advisor

Abstract

Recommended Citation

Included in

Browse

Search

Author Corner

Links

OPUS Open Portal to University Scholarship

All Student Theses and Dissertations

Fitting a Linear Regression Model and Forecasting in R in the Presence of Heteroskedascity with Particular Reference to Advanced Regression Technique Dataset on kaggle.com.

Author

Publication Date

Document Type

Degree Name

Department

First Advisor

Second Advisor

Third Advisor

Abstract

Recommended Citation

Included in

Share

Browse

Search

Author Corner

Links