Save 45% off ODSC East, it's just a few months away!

days

:

:

for an extra 20% off, use the code: ODSC20
Go
Max Kuhn, Director - Pfizer

Max Kuhn, Director - Pfizer

Title :

Bio: I am a Ph.D. statistician with experience in a few different domains: pharmaceutical non-clinical statistics, molecular diagnostic R&D, assay development, manufacturing support/Six Sigma and clinical statistics (in order of my interests). I have worked both as an individual contributor as well as directing moderate sized groups (currently 2 people, 12 previously). I prefer problems where creativity is a key to problem solving. For example, predictive modeling (i.e. machine learning) is an area where traditional statistics may be limiting; as long as you can prove that a solution performs well, any idea is on the table. Complex problems interest me. A friend once remarked that I thrive on making order out of chaos and, to some extent, this is true. One of the most overlooked skills in these situations is the ability to remained focused on the core objective. This is especially true when we have as access to large amounts of data. Specialties: Predictive modeling/machine learning/pattern recognition | Computational biology and chemistry | High dimensional biology | The design and analysis of experiments

2 Visualizations

2 Visualizations

Editor’s note: This is the second of a series of posts on the caret package. The featurePlot function is a wrapper for different lattice plots to visualize the data. For example, the following figures show the default plot for continuous outcomes generated using the featurePlotfunction. For classification data sets, the iris data are used for illustration. […]

The caret Package

The caret Package

Editor’s note: This is the first of a long series of posts on the caret package. Introduction The caret package (short for _C_lassification _A_nd _RE_gression _T_raining) is a set of functions that attempt to streamline the process for creating predictive models. The package contains tools for: data splitting pre-processing feature selection model tuning using resampling […]

Optimizing with Nonlinear Programming

Optimizing with Nonlinear Programming

Rafael Ladeira asked on github: I was wondering why it doesn’t implement some others algorithms for search for optimal tuning parameters. What would be the caveats of using a genetic algorithm , for instance, instead of grid or random search? Do you think using some of those powerful optimization algorithms for tuning parameters is a […]

Three Aspects of Predictive Modeling

Three Aspects of Predictive Modeling

These slides were originally posted on appliedpredictivemodeling.com, and were kindly contributed to Open Data Science. Link to presentation: Three Aspects of Predictive Modeling By: Max Kuhn, Ph.D Presentation Overview: “Predictive modeling” definition Some example applications A short overview and example How is this dierent from what statisticians already do? Unmet challenges in applied modeling Predictive […]