Welcome to Homepage of
Statistical Foundations of Data Science



Computer Code
A. Link for Creating Word Cloud in Chapter 1
B. Some Computer Code for Chapter 3
- R-Package for scaled LASSO written by Tingni Sun
C. Some Computer Code for Chapter 8
- R-Package for SIS written by Jianqing Fan, Yang Feng, Diego Franco Saldana, Richard Samworth, Yichao Wu
- R Package VariableScreening for High-Dimensional Screening for Semiparametric Longitudinal Regression written by Runze Li, Liying Huang and John Dziak
Data Sets
A. Data sets for Chapter 2
- Hong Kong Environment Data Set [Data] [Data Description]
- Macroeconomic Time Series Data Set [Monthly Macroeconomics Data] [Source and Details] [Transformed Macroeconomics Data] [Details of transformation and Meanings of variables]
- Zillow House Price Data [Training Data] [Testing Data] [Source and Details]
B. Data sets for Chapter 3
- Macroeconomic Time Series Data Set [Monthly Macroeconomics Data] [Source and Details] [Transformed Macroeconomics Data] [Details of transformation and Meanings of variables]
- Zillow House Price Data [Training Data] [Testing Data] [Source and Details]
C. Data sets for Chapter 5
- Mammographic Mass Data Used in Section 5.6 [Data] [Data Description]
D. Data sets for Chapter 6
E. Data sets for Chapter 7
- European American SNP Data Used in Section 8.2.5 [SNPs] [Response] [Data Description]
F. Data sets for Chapter 8
- Market Data Used in Section 8.8 [Data] [Data Description]
- The Cardiomyopathy Microarray Data Used in Exercise 8.8 [Data] [Gene names] [Data Description]
- The Rat Eye Expression Data Used in Exercise 8.9 [Data] [Data Description]
G. Data sets for Chapter 9
- Mice Protein Expression Data Used in Excise 9.16 [Data & Description] [Cleaned Version]
H. Data sets for Chapter 10
- Macroeconomic Time Series Data Set [Monthly Macroeconomics Data] [Source and Details][Transformed Macroeconomics Data] [Details of transformation and Meanings of variables]
I. Data sets for Chapter 11
- Macroeconomic Time Series Data Used in Exercise 11.3 [Data & Description]
J. Data sets for Chapter 12
- Email Spam Data Used in Exercise 12.18 [Data & Description]
- Zillow House Price Data Used in Exercise 12.19 [Training Data] [Testing Data] [Source and Details]
- Mice Protein Expression Data Used in Excise 12.20 [Data & Description]
K. Data sets for Chapter 13
- Mice Protein Expression Data Used in Excise 13.2 [Data & Description]
L. Data sets for Chapter 14