By Xiaotong Ding (Claire), With Greg Page

A practical tool that enables a modeler to remove non-informative data points during the variable selection process of data modeling

In this article, we will introduce a powerful function called ‘nearZeroVar()’. This function, which comes from the caret package, is a practical tool that enables a modeler to remove non-informative data points during the variable selection process of data modeling.

Identification of near zero variance predictors

For starters, the nearZeroVar() function identifies constants, and predictors with one unique value across samples. In addition, nearZeroVar() diagnoses predictors as having “near-zero variance” when they possess very few unique values relative to…


By Xiaotong Ding (Claire), With Greg Page

Introduction:

In the digital world around us, we are surrounded by recommendation systems. The entertainment recommendations that we see from streaming movie and music services, the item suggestions presented to us when we shop online, and the targeted ads that appear after our web searches are all specifically geared towards us. Based on our browsing history, purchase patterns, “Likes” on social media, product reviews, demographics, and myriad other quantifiable data points, marketers continually refine their algorithms to make these targeted recommendations increasingly relevant for us — and lucrative for them.

In order to…


by Xiaotong “Claire” Ding

Introduction & Purpose

The purpose of this analysis is to use logistic regression in order to identify risk factors for angina. Angina can be a symptom of coronary artery disease. Noted by severe pain in the chest, it is the result of reduced blood flow to the heart. The dataset used for the analysis can be found on Kaggle.

In the sections below, we will describe the process used for Exploratory Data Analysis, variable selection in the model, and a process for handling categorical variables whose levels’ statistical significance varies.

The Data

The angina.csv dataset includes…

Xiaotong Ding (Claire)

Master Student in Applied Business Analytics, Boston University

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store