Skip to the content.

ST-558-Project-2

brief intro

This repo is created by Collin Knezevich and shared with Jiyue Zhang to do the project2. The project2 is primary use the variables in Online News Popularity Data Set to predict the number of shares. This dataset summarizes a heterogeneous set of features about articles published by Mashable in a period of two years. The goal is to predict the number of shares in social networks (popularity). Therefore, the share variable will be the response variable in our model. In this project, we will use linear regression, random forest and boosted tree model to predict the number of shares.

R packages that we need to use

tidyverse, caret, randomForest, gbm, shiny, rmarkdown

You can find our analysis for the different types of articles here:

render code

rmarkdown::render("ST-558-Project-2-Report.Rmd",
                   output_format = "github_document",
                   output_options = list(
                       html_preview = FALSE
                   ))